期刊文献+
共找到71篇文章
< 1 2 4 >
每页显示 20 50 100
Deep Learning Enabled Microarray Gene Expression Classification for Data Science Applications
1
作者 Areej A.Malibari Reem M.Alshehri +5 位作者 Fahd N.Al-Wesabi Noha Negm Mesfer Al Duhayyim Anwer Mustafa Hilal Ishfaq Yaseen Abdelwahed Motwakel 《Computers, Materials & Continua》 SCIE EI 2022年第11期4277-4290,共14页
In bioinformatics applications,examination of microarray data has received significant interest to diagnose diseases.Microarray gene expression data can be defined by a massive searching space that poses a primary cha... In bioinformatics applications,examination of microarray data has received significant interest to diagnose diseases.Microarray gene expression data can be defined by a massive searching space that poses a primary challenge in the appropriate selection of genes.Microarray data classification incorporates multiple disciplines such as bioinformatics,machine learning(ML),data science,and pattern classification.This paper designs an optimal deep neural network based microarray gene expression classification(ODNN-MGEC)model for bioinformatics applications.The proposed ODNN-MGEC technique performs data normalization process to normalize the data into a uniform scale.Besides,improved fruit fly optimization(IFFO)based feature selection technique is used to reduce the high dimensionality in the biomedical data.Moreover,deep neural network(DNN)model is applied for the classification of microarray gene expression data and the hyperparameter tuning of the DNN model is carried out using the Symbiotic Organisms Search(SOS)algorithm.The utilization of IFFO and SOS algorithms pave the way for accomplishing maximum gene expression classification outcomes.For examining the improved outcomes of the ODNN-MGEC technique,a wide ranging experimental analysis is made against benchmark datasets.The extensive comparison study with recent approaches demonstrates the enhanced outcomes of the ODNN-MGEC technique in terms of different measures. 展开更多
关键词 BIOINFORMATICS data science microarray gene expression data classification deep learning metaheuristics
在线阅读 下载PDF
Optimizing Cancer Classification and Gene Discovery with an Adaptive Learning Search Algorithm for Microarray Analysis
2
作者 Chiwen Qu Heng Yao +1 位作者 Tingjiang Pan Zenghui Lu 《Journal of Bionic Engineering》 2025年第2期901-930,共30页
DNA microarrays, a cornerstone in biomedicine, measure gene expression across thousands to tens of thousands of genes. Identifying the genes vital for accurate cancer classification is a key challenge. Here, we presen... DNA microarrays, a cornerstone in biomedicine, measure gene expression across thousands to tens of thousands of genes. Identifying the genes vital for accurate cancer classification is a key challenge. Here, we present Fs-LSA (F-score based Learning Search Algorithm), a novel gene selection algorithm designed to enhance the precision and efficiency of target gene identification from microarray data for cancer classification. This algorithm is divided into two phases: the first leverages F-score values to prioritize and select feature genes with the most significant differential expression;the second phase introduces our Learning Search Algorithm (LSA), which harnesses swarm intelligence to identify the optimal subset among the remaining genes. Inspired by human social learning, LSA integrates historical data and collective intelligence for a thorough search, with a dynamic control mechanism that balances exploration and refinement, thereby enhancing the gene selection process. We conducted a rigorous validation of Fs-LSA’s performance using eight publicly available cancer microarray expression datasets. Fs-LSA achieved accuracy, precision, sensitivity, and F1-score values of 0.9932, 0.9923, 0.9962, and 0.994, respectively. Comparative analyses with state-of-the-art algorithms revealed Fs-LSA’s superior performance in terms of simplicity and efficiency. Additionally, we validated the algorithm’s efficacy independently using glioblastoma data from GEO and TCGA databases. It was significantly superior to those of the comparison algorithms. Importantly, the driver genes identified by Fs-LSA were instrumental in developing a predictive model as an independent prognostic indicator for glioblastoma, underscoring Fs-LSA’s transformative potential in genomics and personalized medicine. 展开更多
关键词 gene selection Learning search algorithm gene expression data classification
暂未订购
A Survey on Acute Leukemia Expression Data Classification Using Ensembles
3
作者 Abdel Nasser H.Zaied Ehab Rushdy Mona Gamal 《Computer Systems Science & Engineering》 SCIE EI 2023年第11期1349-1364,共16页
Acute leukemia is an aggressive disease that has high mortality rates worldwide.The error rate can be as high as 40%when classifying acute leukemia into its subtypes.So,there is an urgent need to support hematologists... Acute leukemia is an aggressive disease that has high mortality rates worldwide.The error rate can be as high as 40%when classifying acute leukemia into its subtypes.So,there is an urgent need to support hematologists during the classification process.More than two decades ago,researchers used microarray gene expression data to classify cancer and adopted acute leukemia as a test case.The high classification accuracy they achieved confirmed that it is possible to classify cancer subtypes using microarray gene expression data.Ensemble machine learning is an effective method that combines individual classifiers to classify new samples.Ensemble classifiers are recognized as powerful algorithms with numerous advantages over traditional classifiers.Over the past few decades,researchers have focused a great deal of attention on ensemble classifiers in a wide variety of fields,including but not limited to disease diagnosis,finance,bioinformatics,healthcare,manufacturing,and geography.This paper reviews the recent ensemble classifier approaches utilized for acute leukemia gene expression data classification.Moreover,a framework for classifying acute leukemia gene expression data is proposed.The pairwise correlation gene selection method and the Rotation Forest of Bayesian Networks are both used in this framework.Experimental outcomes show that the classification accuracy achieved by the acute leukemia ensemble classifiers constructed according to the suggested framework is good compared to the classification accuracy achieved in other studies. 展开更多
关键词 LEUKEMIA classification ENSEMBLE rotation forest pairwise correlation bayesian networks gene expression data MICROARRAY gene selection
在线阅读 下载PDF
Enhancing Cancer Classification through a Hybrid Bio-Inspired Evolutionary Algorithm for Biomarker Gene Selection 被引量:1
4
作者 Hala AlShamlan Halah AlMazrua 《Computers, Materials & Continua》 SCIE EI 2024年第4期675-694,共20页
In this study,our aim is to address the problem of gene selection by proposing a hybrid bio-inspired evolutionary algorithm that combines Grey Wolf Optimization(GWO)with Harris Hawks Optimization(HHO)for feature selec... In this study,our aim is to address the problem of gene selection by proposing a hybrid bio-inspired evolutionary algorithm that combines Grey Wolf Optimization(GWO)with Harris Hawks Optimization(HHO)for feature selection.Themotivation for utilizingGWOandHHOstems fromtheir bio-inspired nature and their demonstrated success in optimization problems.We aimto leverage the strengths of these algorithms to enhance the effectiveness of feature selection in microarray-based cancer classification.We selected leave-one-out cross-validation(LOOCV)to evaluate the performance of both two widely used classifiers,k-nearest neighbors(KNN)and support vector machine(SVM),on high-dimensional cancer microarray data.The proposed method is extensively tested on six publicly available cancer microarray datasets,and a comprehensive comparison with recently published methods is conducted.Our hybrid algorithm demonstrates its effectiveness in improving classification performance,Surpassing alternative approaches in terms of precision.The outcomes confirm the capability of our method to substantially improve both the precision and efficiency of cancer classification,thereby advancing the development ofmore efficient treatment strategies.The proposed hybridmethod offers a promising solution to the gene selection problem in microarray-based cancer classification.It improves the accuracy and efficiency of cancer diagnosis and treatment,and its superior performance compared to other methods highlights its potential applicability in realworld cancer classification tasks.By harnessing the complementary search mechanisms of GWO and HHO,we leverage their bio-inspired behavior to identify informative genes relevant to cancer diagnosis and treatment. 展开更多
关键词 Bio-inspired algorithms BIOINFORMATICS cancer classification evolutionary algorithm feature selection gene expression grey wolf optimizer harris hawks optimization k-nearest neighbor support vector machine
在线阅读 下载PDF
基于Fiedler向量的基因表达谱数据分类方法 被引量:4
5
作者 王年 庄振华 +1 位作者 唐俊 苏亮亮 《中国生物工程杂志》 CAS CSCD 北大核心 2010年第12期82-86,共5页
尝试将一种基于图的Fiedler向量的聚类算法引入到基因表达谱数据的肿瘤分类中来。该方法将分属不同类的所有样本通过高斯权构造Laplace完全图,经SVD分解后获得Fiedler向量,利用各样本所对应的Fiedler向量分量的符号差异来进行基因表达... 尝试将一种基于图的Fiedler向量的聚类算法引入到基因表达谱数据的肿瘤分类中来。该方法将分属不同类的所有样本通过高斯权构造Laplace完全图,经SVD分解后获得Fiedler向量,利用各样本所对应的Fiedler向量分量的符号差异来进行基因表达谱数据的分类。通过模拟数据仿真实验和对白血病两个亚型(ALL与AML)及结肠癌真实数据实验,证明了这一方法的有效性。 展开更多
关键词 分类 fiedler向量 基因表达数据
原文传递
Gene Expression Data Classification Using Consensus Independent Component Analysis 被引量:7
6
作者 Chun-Hou Zheng De-Shuang Huang +1 位作者 Xiang-Zhen Kong Xing-Ming Zhao 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2008年第2期74-82,共9页
We propose a new method for tumor classification from gene expression data, which mainly contains three steps. Firstly, the original DNA microarray gene expression data are modeled by independent component analysis (... We propose a new method for tumor classification from gene expression data, which mainly contains three steps. Firstly, the original DNA microarray gene expression data are modeled by independent component analysis (ICA). Secondly, the most discriminant eigenassays extracted by ICA are selected by the sequential floating forward selection technique. Finally, support vector machine is used to classify the modeling data. To show the validity of the proposed method, we applied it to classify three DNA microarray datasets involving various human normal and tumor tissue samples. The experimental results show that the method is efficient and feasible. 展开更多
关键词 independent component analysis feature selection support vector machine gene expression data
在线阅读 下载PDF
Cancer classification based on microarray gene expression data using a principal component accumulation method 被引量:2
7
作者 LIU JingJing CAI WenSheng SHAO XueGuang 《Science China Chemistry》 SCIE EI CAS 2011年第5期802-811,共10页
The classification of cancer is a major research topic in bioinformatics. The nature of high dimensionality and small size associated with gene expression data,however,makes the classification quite challenging. Altho... The classification of cancer is a major research topic in bioinformatics. The nature of high dimensionality and small size associated with gene expression data,however,makes the classification quite challenging. Although principal component analysis (PCA) is of particular interest for the high-dimensional data,it may overemphasize some aspects and ignore some other important information contained in the richly complex data,because it displays only the difference in the first twoor three-dimensional PC subspaces. Based on PCA,a principal component accumulation (PCAcc) method was proposed. It employs the information contained in multiple PC subspaces and improves the class separability of cancers. The effectiveness of the present method was evaluated by four commonly used gene expression datasets,and the results show that the method performs well for cancer classification. 展开更多
关键词 cancer classification principal component analysis principal component accumulation gene expression data
原文传递
The expression and antigenicity identification of recombinant rat TGF-β1 in bacteria 被引量:1
8
作者 GaoCF KongXT 《Cell Research》 SCIE CAS CSCD 2001年第2期95-100,共6页
In order to study structure-function details of TGF-beta1, the recombinant mature form of rat TGF-beta1 was expressed in bacteria. Synthesis of the 112 amino-acid carboxyl-terminal part of TGF-beta1 (amino acid 279-39... In order to study structure-function details of TGF-beta1, the recombinant mature form of rat TGF-beta1 was expressed in bacteria. Synthesis of the 112 amino-acid carboxyl-terminal part of TGF-beta1 (amino acid 279-390) was controlled by an inducible gene expression system based on bacteriophage T7 RNA polymerase. This system allowed an active and selective synthesis of recombinant TGF-beta1. The molecular weight of expressed TGF-alpha1 monomer determined on SDS-polyacrylamide gel under reducing conditions was about 13 kD. Serial detergent washes combined with a single gel-filtration purification step were sufficient to purify the expression product to homogeneity. Amino-terminal sequencing revealed that the N-terminal of the recombinant protein was identical to the published data. In Western blot analysis the recombinant polypeptide showed excellent antigenicity against polyclonal TGF-beta1 antibody. The mature recombinant rat TGF-beta1 expressed in this study provides a useful tool for future detailed structural and functional studies. 展开更多
关键词 Amino Acid Sequence Animals Base Sequence EPITOPES Escherichia coli gene expression Regulation Bacterial genetic vectors Molecular Sequence data Plasmids Protein Structure Tertiary Rats Recombinant Proteins Research Support Non-U.S. Gov't Transformation genetic Transforming Growth Factor beta
在线阅读 下载PDF
Predictive Analysis of Microarray Data
9
作者 Paulo C.Marques F. Carlos A.de B.Pereira 《Open Journal of Genetics》 2014年第1期63-68,共6页
Microarray gene expression data are analyzed by means of a Bayesian nonparametric model, with emphasis on prediction of future observables, yielding a method for selection of differentially expressed genes and the cor... Microarray gene expression data are analyzed by means of a Bayesian nonparametric model, with emphasis on prediction of future observables, yielding a method for selection of differentially expressed genes and the corresponding classifier. 展开更多
关键词 Bayesian Nonparametrics Dirichlet Process Microarray data Differential gene expression classification
暂未订购
基于差异共表达邻接网络的癌症致病基因预测算法
10
作者 李志杰 廖旭红 +1 位作者 李青蓝 刘丽 《计算机科学》 北大核心 2025年第5期161-170,共10页
癌症是人类健康的第一杀手。随着测序技术的快速发展,积累了海量的癌症基因表达数据,利用计算方法进行致病基因预测成为癌症研究领域新的热点。然而,目前致病基因预测大多基于基因相互作用网络等,很少考虑网络局部连接与基因差异表达间... 癌症是人类健康的第一杀手。随着测序技术的快速发展,积累了海量的癌症基因表达数据,利用计算方法进行致病基因预测成为癌症研究领域新的热点。然而,目前致病基因预测大多基于基因相互作用网络等,很少考虑网络局部连接与基因差异表达间的潜在联系。针对上述问题,首先利用患病前后的基因表达差异数据,通过互信息计算基因间的相关性并构建邻接网络,然后设计特征向量模型用于癌症致病基因预测。向量特征包括候选基因及其近邻的差异表达信息。从TCGA,OMIM和GEO等公共数据库获取癌症相关的致病与非致病基因以及患病前后基因差异表达数据进行实验,利用邻接网络中基因及其近邻的差异表达信息进行癌症致病基因预测(Differential Information of Gene and Nearest Neighbor for Cancer Pathogenic Gene Prediction,DICPG)。实验结果表明,DICPG癌症基因分类模型的生物学意义明显,分类精度和AUC等性能指标优于同类方法。 展开更多
关键词 基因差异表达数据 邻接网络 候选基因 基因特征向量 癌症致病基因预测
在线阅读 下载PDF
A new distributed feature selection technique for classifying gene expression data
11
作者 Sarah M.Ayyad Ahmed I.Saleh Labib M.Labib 《International Journal of Biomathematics》 SCIE 2019年第4期79-109,共31页
Classification of gene expression data is a pivotal research area that plays a substantial role in diagnosis and prediction of diseases. Generally, feature selection is one of the extensively used techniques in data m... Classification of gene expression data is a pivotal research area that plays a substantial role in diagnosis and prediction of diseases. Generally, feature selection is one of the extensively used techniques in data mining approaches, especially in classification. Gene expression data are usually composed of dozens of samples characterized by thousands of genes. This increases the dimensionality coupled with the existence of irrelevant and redundant features. Accordingly, the selection of informative genes (features) becomes difficult, which badly affects the gene classification accuracy. In this paper, we consider the feature selection for classifying gene expression microarray datasets. The goal is to detect the most possibly cancer-related genes in a distributed manner, which helps in effectively classifying the samples. Initially, the available huge amount of considered features are subdivided and distributed among several processors. Then, a new filter selection method based on a fuzzy inference system is applied to each subset of the dataset. Finally, all the resulted features are ranked, then a wrapper-based selection method is applied. Experimental results showed that our proposed feature selection technique performs better than other techniques since it produces lower time latency and improves classification performance. 展开更多
关键词 Feature selection gene expression dimensionality reduction MICROARRAY data classification DISTRIBUTED learning Mathematics SUBJECT classification
原文传递
基于输出不一致测度的极限学习机集成的基因表达数据分类 被引量:43
12
作者 陆慧娟 安春霖 +2 位作者 马小平 郑恩辉 杨小兵 《计算机学报》 EI CSCD 北大核心 2013年第2期341-348,共8页
选择性集成学习已经成为分析基因表达数据、获取生物学信息的有力工具.为了更好地挖掘基因表达数据,利用极限学习机的集成,克服单个ELM用于数据分类时性能欠稳定的缺点,文中提出了一种基于输出不一致测度的ELM相异性集成算法(D-D-ELM).... 选择性集成学习已经成为分析基因表达数据、获取生物学信息的有力工具.为了更好地挖掘基因表达数据,利用极限学习机的集成,克服单个ELM用于数据分类时性能欠稳定的缺点,文中提出了一种基于输出不一致测度的ELM相异性集成算法(D-D-ELM).算法首先以输出不一致测度为标准对多个ELM模型进行相异性判断,其次根据ELM的平均分类精度剔除掉相应的模型,最后对筛选后的分类模型用多数投票法进行集成.算法被运用到Breast、Leukemia、Colon、Heart基因表达数据上,并通过理论和实验得到验证.实验结果的统计学分析表明D-D-ELM能够以更少的模型数量达到较稳定的分类精度. 展开更多
关键词 极限学习机 基因表达数据 集成算法 输出不一致测度 分类
在线阅读 下载PDF
肿瘤基因表达谱分类特征基因选取问题及分析方法研究 被引量:45
13
作者 李颖新 李建更 阮晓钢 《计算机学报》 EI CSCD 北大核心 2006年第2期324-330,共7页
对肿瘤分类特征基因选取问题的研究是发现肿瘤特异表达基因、研究肿瘤基因表达模式的重要手段.文中基于多类别肿瘤基因表达谱数据集,从研究肿瘤与正常组织的分类入手,对肿瘤分类特征基因选取问题进行分析和研究.首先对基于Relief算法的... 对肿瘤分类特征基因选取问题的研究是发现肿瘤特异表达基因、研究肿瘤基因表达模式的重要手段.文中基于多类别肿瘤基因表达谱数据集,从研究肿瘤与正常组织的分类入手,对肿瘤分类特征基因选取问题进行分析和研究.首先对基于Relief算法的特征选取策略加以改进生成候选特征集合;然后以支持向量机作为分类器对其分类性能进行检验以选取分类特征基因;最后结合分类模型,利用灵敏度分析方法进行特征基因的精确搜索以滤除冗余.基于该方法文中选出了52个具有良好分类性能的特征基因作为肿瘤的基因特征,并对其表达行为进行了简要分析. 展开更多
关键词 肿瘤 基因表达 特征基因 组织分类 特征选取 支持向量机
在线阅读 下载PDF
基因表达谱聚类/分类技术研究及展望 被引量:6
14
作者 李杰 唐降龙 +1 位作者 王亚东 李霞 《生物工程学报》 CAS CSCD 北大核心 2005年第4期667-673,共7页
随着人类及多种模式生物全基因组测序基本完成,人类基因组计划的研究进入后基因组时代.后基因组时代研究的焦点已经从测序转向功能研究。聚类/分类技术作为分析基因表达谱和识别基因功能的重要工具之一,近年来获得很大的发展。对目前基... 随着人类及多种模式生物全基因组测序基本完成,人类基因组计划的研究进入后基因组时代.后基因组时代研究的焦点已经从测序转向功能研究。聚类/分类技术作为分析基因表达谱和识别基因功能的重要工具之一,近年来获得很大的发展。对目前基因表达谱聚类/分类技术及它们的发展,进行了综述性的研究,分析了它们的优缺点,结合我们的研究,提出了解决问题的思路和方法,为基因表达谱的进一步研究提供了新的途径。 展开更多
关键词 基因表达谱 分类 聚类 基因调控网络 逆向工程
在线阅读 下载PDF
嵌入代价敏感的极限学习机相异性集成的基因表达数据分类 被引量:7
15
作者 安春霖 陆慧娟 +1 位作者 魏莎莎 杨小兵 《计算机科学》 CSCD 北大核心 2014年第12期211-215,共5页
极限学习机的相异性集成算法(Dissimilarity Based Ensemble of Extreme Learning Machine,D-ELM)在基因表达数据分类中能够得到较稳定的分类效果,然而这种分类算法是基于分类精度的,当所给样本的误分类代价不相等时,不能直接实现代价... 极限学习机的相异性集成算法(Dissimilarity Based Ensemble of Extreme Learning Machine,D-ELM)在基因表达数据分类中能够得到较稳定的分类效果,然而这种分类算法是基于分类精度的,当所给样本的误分类代价不相等时,不能直接实现代价敏感分类过程中的最小平均误分类代价的要求。通过在分类过程中引入概率估计以及误分类代价和拒识代价重新构造分类结果,提出了基于相异性集成极限学习机的代价敏感算法(CS-D-ELM)。该算法被运用到基因表达数据集上,得到了较好的分类效果。 展开更多
关键词 极限学习机 相异性集成 代价敏感 基因表达数据 分类
在线阅读 下载PDF
基于人工蜂群和SVM的基因表达数据分类 被引量:10
16
作者 叶明全 高凌云 万春圆 《山东大学学报(工学版)》 CAS 北大核心 2018年第3期10-16,共7页
基因表达数据存在高维、小样本、高噪声等特性,使得相应的肿瘤分类诊断面临着一定的挑战。为了实现更加精确的分类准确率,利用人工蜂群(artificial bee colony,ABC)算法对支持向量机(support vector machine,SVM)的核函数参数和惩罚因... 基因表达数据存在高维、小样本、高噪声等特性,使得相应的肿瘤分类诊断面临着一定的挑战。为了实现更加精确的分类准确率,利用人工蜂群(artificial bee colony,ABC)算法对支持向量机(support vector machine,SVM)的核函数参数和惩罚因子进行优化,采用准确率作为分类模型的适应度函数,提出一种基于ABC和SVM的基因表达数据分类方法 ABC-SVM。在6种公开的肿瘤基因表达数据集上进行试验,并对比分析其他的分类方法。结果表明,在筛选得到的较少信息基因基础上,ABC-SVM可获得更高的肿瘤分类准确率,对肿瘤样本类型进行更有效的分类预测。 展开更多
关键词 人工蜂群 支持向量机 基因表达数据 智能优化 肿瘤分类 生物信息学
原文传递
基于加权极限学习机的肿瘤基因表达谱数据分类 被引量:4
17
作者 姜琳颖 余东海 石鑫 《东北大学学报(自然科学版)》 EI CAS CSCD 北大核心 2017年第6期798-803,共6页
基因表达谱数据一般来源于临床试验,而在临床试验中,试验样本的类分布情况是不确定的,这就使得表达谱数据往往具有比较明显的不平衡性.采用加权极限学习机来对不平衡基因表达谱数据进行分类,为了减少因为不平衡数据引起的分类误差,一个... 基因表达谱数据一般来源于临床试验,而在临床试验中,试验样本的类分布情况是不确定的,这就使得表达谱数据往往具有比较明显的不平衡性.采用加权极限学习机来对不平衡基因表达谱数据进行分类,为了减少因为不平衡数据引起的分类误差,一个临时的权重被分配给每一个样本以增强少样本类的影响,同时减少多样本类的影响,进而提高肿瘤分类的准确率.实验结果表明,所提方法能够提高少样本类的识别率,从而提高分类器的总体性能. 展开更多
关键词 基因 表达谱数据 加权极限学习机 不平衡性 肿瘤分类
在线阅读 下载PDF
一种改进的基因表达数据分类方法 被引量:3
18
作者 蔡立军 沈小乔 +1 位作者 林亚平 蒋林波 《湖南大学学报(自然科学版)》 EI CAS CSCD 北大核心 2007年第3期79-82,共4页
从分类算法和特征基因选择两个方面研究基因表达数据的分类,将传统的Support Vector Machines(SVM)算法和K-nearest neighbor(KNN)算法两者结合成为一种应用于基因表达数据分类的算法,并针对基因表达数据分类数据集“样本少,维数高”的... 从分类算法和特征基因选择两个方面研究基因表达数据的分类,将传统的Support Vector Machines(SVM)算法和K-nearest neighbor(KNN)算法两者结合成为一种应用于基因表达数据分类的算法,并针对基因表达数据分类数据集“样本少,维数高”的特点,提出了一种改进的基于相关性的递归特征消除算法(简称为C-RFE),消除了数据冗余.实验结果表明,新方法可有效提高分类准确率和特征选取的效率. 展开更多
关键词 基因表达数据分类 SVM KNN 特征选择
在线阅读 下载PDF
支持向量机在基因表达数据分类中的应用研究 被引量:7
19
作者 武振宇 李康 《中国卫生统计》 CSCD 北大核心 2007年第1期8-11,共4页
目的探讨支持向量机在基因表达数据分类研究中的应用条件和效果。方法使用支持向量机软件包,通过实际基因表达数据考核其应用效果,并通过模拟试验进一步验证和研究在含有大量无差异表达基因情况下对分类产生的影响。结果对四种疾病的真... 目的探讨支持向量机在基因表达数据分类研究中的应用条件和效果。方法使用支持向量机软件包,通过实际基因表达数据考核其应用效果,并通过模拟试验进一步验证和研究在含有大量无差异表达基因情况下对分类产生的影响。结果对四种疾病的真实基因表达数据的分类取得了良好的效果,模拟试验则显示了支持向量机对分类具有较高的准确性,但随无差异基因数量的增加其分类效果呈明显下降的趋势;在类间分离一定的情况下,差异表达基因数目较多、基因之间具有较高的相关性时,更容易获得好的分类效果。结论支持向量机在解决小样本、非线性及高维问题中表现出许多潜在的优势,可以有效地用于分析基因表达数据的分类问题。 展开更多
关键词 支持向量机 基因表达数据 分类研究 模拟试验
在线阅读 下载PDF
癌症基因分类的Laplace谱方法 被引量:2
20
作者 王年 庄振华 +2 位作者 范益政 李学俊 王继 《电子学报》 EI CAS CSCD 北大核心 2011年第7期1594-1597,共4页
本文尝试着将图的Laplace谱理论应用于癌症基因表达谱数据的分类上.计算出训练集中每个类的均值作为类中心,选出与类中心欧式距离最小的若干样本用laplace矩阵构造完全图,记为代表该类的标准图.用待测样本依次替换标准图中所有的点,将... 本文尝试着将图的Laplace谱理论应用于癌症基因表达谱数据的分类上.计算出训练集中每个类的均值作为类中心,选出与类中心欧式距离最小的若干样本用laplace矩阵构造完全图,记为代表该类的标准图.用待测样本依次替换标准图中所有的点,将生成的新图与标准图进行特征点匹配,并计算匹配点数总和.将待测样本划分为总匹配点数最多的那个类.通过对白血病两个亚型(ALL与AML)与结肠癌数据进行留一法实验,验证了本文方法的有效性. 展开更多
关键词 分类 基因表达谱数据 LAPLACE谱
在线阅读 下载PDF
上一页 1 2 4 下一页 到第
使用帮助 返回顶部