基于支持向量机的特征提取方法研究被引量：1

Research on Uncorrelated Linear Discriminant Analysis Based on Support Vector Machine

下载PDF

导出

摘要基因表达数据的一个重要应用是给组织样本进行分类。在基因表达数据中,基因的数量相对于数据样本的个数通常比较多;也就是说,可以得到变量数(基因数)远远大于样本数的数据矩阵。过高的维数(变量或基因数)将给分类问题带来极大的挑战。本文提出结合一种新的特征提取方法——非相关线性判别式分析方法(ULDA)和支持向量机(SVM)分类算法,对结肠癌组织样本进行分类识别。并同其它方法作了比较研究,结果表明了该方法的可行性和有效性。 One important application of microarray gene expression data is classification of tissue samples. In gene expression da- ta, the number of genes is usually very high compared to the number of data samples; that is, we can obtain the data matrix with the number of variables （genes）far exceeding the number of samples. Too high dimension （the number of variables or genes） makes the task of classification quite challenging. This paper presents that a new feature extraction method ULDA and SVM are combined to classify colon tissue samples, Compared to other methods, the effect of classification is improved, the results prove the feasibility and effectiveness of this method,

作者张小丹吕建平

机构地区苏州大学电子信息学院

出处《计算机与现代化》 2008年第8期104-106,109,共4页 Computer and Modernization

关键词非相关线性判别分析支持向量机基因表达谱特征提取分类 uncorrelated linear discriminant analysis SVM gene expression profiling feature extraction classification

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献13

1Ward J J, McGuffin L J, Buxton B F, et al. Secondary structure prediction with support vector machines [ J ]. Bioinformatics, 2003,19 ( 13 ) : 1650-1655.
2Alon U, Barkai N, Notterman D A, et al. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [C]//Proc. Natl Academy of Science, 1999,96:6745- 6750.
3A Ben-Dor, Bruhn L,Friedman N,et al. Tissue classification with gene expression profiles[ J]. Journal of Computational Biology, 2000,7:559-584.
4Belhumeour P N, Hespanha J P, Kriegman D J. Eigenfaces vs. fisherfaces:recognition using class specific linear projection[J]. IEEE Trans. Pattern Analysis and Machine Intelligence, 1997,19(7) :711-720.
5Brown M P S, Grundy W N, Lin D,et al. Knowledge-based analysis of microarray gene expression data by using support vector machines [ C ]//Proc. Nat'1 Academy of Science, 2000,97:262-267.
6Zhao Y, Pinilla C, Valmori D, et al. Application of support vector machines for T-cell epitopes prediction[ J ]. Bioinformatics, 2003,19(15) :1978-1984.
7Golub G H,Van Loan C F. Matrix Computatlons(Thlrd Edition) [ M ]. Baltimore and London, The Johns Hopkins Univ. Press, 1996.
8Dudoit S, Fridlyand J, Speed T P. Comparison of discrimination methods for the classification of tumors using gene expression data[J]. J. Am. Statistical Assoc., 2002,97: 77-87.
9Howland P, Jeon M, Park H. Structure preserving dimension reduction for clustered text data based on the generalized singular value decomposition[J]. SIAM J. Matrix Analysis Applications, 2003,25 ( 1 ) : 165-179.
10Furey T S, Cristianini N, Duffy N, et al. Support vector machine classification and validation of cancer tissue sampies using microarray expression data [ J ]. Bioinformatics, 2000,16(10) : 906-914.

二级参考文献7

1Golub TR, Slonim DK, Tamayo P, et al. Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring[J]. Science, 1999,286:531-537.
2Alizadeh AA, Eisen MB, Davis RE, et al. Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling[J]. Nature, 2000,403:503-511.
3Cheeseman P, Stutz J. Advances in knowledge discovery and data mining[J]. Bayesian classification(Autoclass): theory and results, 1996.153-180. AAAI Press/MIT Press, Cambridge,MA.
4Wallace CS, Dowe DL. MML clustering of multi-state, Poisson, von Mises circular and Gaussian distributions[J]. Statistics and Computing, 2000,10:73-83.
5Eisen MB, Spellman PT, Brown P, et al. Cluster analysis and display of genome-wide expression patterns[J]. Proc Natl Acad Sci USA, 1998,95:14863-14868.
6Stamatopoulos K, Kosmas C, Belessi C, et al. Molecular insights into the immunopathogenesis of follicular lymphoma[J]. Immunol Today, 2000,21:298-305.
7Staudt LM, Dent AL, Shaffer AL, et al. Regulation of lymphocyte cell fate decisions and lymphomagenesis by BCL-6[J]. Int Rev Immunol, 1999,18:381-403.

共引文献43

1RUAN Xiaogang,LI Yingxin,LI Jiangeng,GONG Daoxiong,WANG Jinlian.Tumor-specific gene expression patterns with gene expression profiles[J].Science China(Life Sciences),2006,49(3):293-304. 被引量：2
2李颖新,朱云华,阮晓钢.基于支持向量机的肿瘤亚型识别[J].计算机工程与应用,2004,40(17):1-3. 被引量：2
3李颖新,阮晓钢.基于基因表达谱的肿瘤亚型识别与分类特征基因选取研究[J].电子学报,2005,33(4):651-655. 被引量：18
4李颖新,刘全金,阮晓钢.急性白血病的基因表达谱分析与亚型分类特征的鉴别[J].中国生物医学工程学报,2005,24(2):240-244. 被引量：19
5王磊,王文云,王乔.基于软件构件技术的KDD算法的设计与实现[J].计算机工程与设计,2005,26(9):2467-2469. 被引量：2
6李颖新,阮晓钢.基于支持向量机的肿瘤分类特征基因选取[J].计算机研究与发展,2005,42(10):1796-1801. 被引量：50
7刘全金,李颖新,朱云华,阮晓钢.基于BP神经网络的肿瘤特征基因选取[J].计算机工程与应用,2005,41(34):184-186. 被引量：6
8刘全金,李颖新,阮晓钢.基于基因表达谱的结肠癌特征基因选取[J].昆明理工大学学报（理工版）,2006,31(1):89-92. 被引量：4
9阮晓钢,李颖新,李建更,龚道雄,王金莲.基于基因表达谱的肿瘤特异基因表达模式研究[J].中国科学（C辑）,2006,36(1):86-96. 被引量：5
10韦振中,黄廷磊.基于支持向量机和遗传算法的特征选择[J].广西工学院学报,2006,17(2):18-21. 被引量：12

同被引文献3

1李颖新,刘全金,阮晓钢.急性白血病的基因表达谱分析与亚型分类特征的鉴别[J].中国生物医学工程学报,2005,24(2):240-244. 被引量：19
2刘全金,李颖新.Boosting算法在基因表达谱样本分类中的应用[J].计算机工程与应用,2008,44(14):228-230. 被引量：2
3游伟,李树涛,谭明奎.基于SVM-RFE-SFS的基因选择方法[J].中国生物医学工程学报,2010,29(1):93-99. 被引量：11

引证文献1

1李烨,王永丽,贺国平.基于支持向量机的结肠癌信息基因提取[J].山东科技大学学报（自然科学版）,2012,31(3):84-89. 被引量：3

二级引证文献3

1冯菁华,高媛,陈玕,彭翼飞,郑文岭,马文丽.ppGalNac-T10对人结直肠癌细胞株LoVo的生物学特性的影响[J].生命科学研究,2012,16(5):428-432.
2宋彩风,刘伟锋,王延江.基于稀疏学习的人脸表情识别[J].山东科技大学学报（自然科学版）,2013,32(3):28-34. 被引量：2
3汤亚东,谢鹭,陈兰明.改进CKSAAP结合RFE算法预测蛋白质棕榈酰化位点[J].计算机工程与应用,2019,55(5):143-148.

1张小丹,吕建平.基于SVM的非相关线性判别分析算法研究[J].计算机工程与应用,2008,44(4):227-229. 被引量：4
2沈海生.医院样本信息管理系统的设计与实现[J].信息与电脑（理论版）,2013,0(8):25-27.
3徐兵,于骏一.车间作业调度问题的染色体非完整表示方法[J].吉林大学学报（工学版）,2003,33(4):48-50. 被引量：1
4李艳芳,高大启.Fisher线性判别式阈值优化方法研究[J].计算机应用与软件,2016,33(6):141-145. 被引量：2
5肖晓明,陈志兴,高平安.动态确定基因数的遗传算法路径规划[J].计算机应用研究,2009,26(7):2469-2470. 被引量：3
6张德丰.聚类与动态RBF网络的模式识别应用研究[J].计算机工程与应用,2009,45(16):204-207. 被引量：2
7张德丰,周灵,孙亚民.基于动态径向基神经网络的人脸识别算法研究[J].计算机工程与应用,2012,48(2):203-206.
8黄紫成.主成分分析和支持向量机在微阵列数据分析的应用[J].现代计算机（中旬刊）,2013(9):26-28.
9张爱华,杨凤霞,王润东.基于脉象信号的亚健康状态的识别[J].兰州理工大学学报,2006,32(6):82-84. 被引量：17
10曹元大,蔡刿.组播QoS路由的遗传算法研究[J].计算机工程,2004,30(7):80-81. 被引量：6

计算机与现代化

2008年第8期

浏览历史

内容加载中请稍等...

基于支持向量机的特征提取方法研究被引量：1

参考文献13

二级参考文献7

共引文献43

同被引文献3

引证文献1

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

基于支持向量机的特征提取方法研究 被引量：1

参考文献13

二级参考文献7

共引文献43

同被引文献3

引证文献1

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

基于支持向量机的特征提取方法研究被引量：1