期刊文献+

基于HM-SVM的剪接位点识别 被引量:1

Splice Site Identification Based on Hidden Markov Support Vector Machines
在线阅读 下载PDF
导出
摘要 精确预测剪接位点是真核基因系统研究的第一步。为了取得更加精确的预测结果,本文采用了一个新的标识序列识别方法HM-SVM对剪接位点进行识别。依据剪接位点附近存在的序列保守性,将联合核函数学习融入最大边缘分类器,结合HM-SVM工作集最优化算法,构建并生成了健壮分类器。实验结果表明,该方法在对于剪接位点的识别中,较目前常用的机器学习方法,获得了更高识别率。 Accurate prediction for splice sites is the primary step in the system research for eukaryotic genes. A novel discriminative learning technique for label sequences, named by Hidden Markov Support Vector Machines, is adopted for better prediction performance of splice sites. According to the conservation feature in the vicinity of splice sites, the joint kernel learning is syncretized into the maximum margin classifier, and combining the HM-SVM training set optimization algorithm,a robust classifier is designed and generated here. The experimental results show that the HM-SVM approach acquires higher rate in the splice site identification than popular machine learning techniques.
出处 《微计算机信息》 北大核心 2006年第12S期240-242,共3页 Control & Automation
关键词 隐马尔可夫支持向量机 剪接位点 识别 Hidden Markov Support Vector Machines Splice Site Identification
  • 相关文献

参考文献10

  • 1闻芳,卢欣,孙之荣,李衍达.基于支持向量机(SVM)的剪接位点识别[J].生物物理学报,1999,15(4):733-739. 被引量:19
  • 2Sun Y-F,Fan X-D,et al.Identifying splicing sites in eukaryotic RNA:support vector machine approach.Computers in Biology and Medicine,2003,33:17-29
  • 3Barmak Modrek,Christopher Lee.A genomic view of alternatively splicing.Nature Genetics,2002,30:13-19
  • 4Barmak Modrek,Alissa Resch,et al.Genome-wide detection of alternative splicing in expressed sequences of human genes.Nucleic Acids Research,2001,29:2850-2859
  • 5Y-H Huang,Y-T Chen.PALS db:Putative Alteratively Splicing database.Nucleic Acids Research,2002,30:186-190
  • 6Vapnik,V.Statistical Learning Theory.John Wiley,New York,1998
  • 7夏慧煜,周晴,李衍达.隐Markov模型在剪接位点识别中的应用[J].清华大学学报(自然科学版),2002,42(9):1214-1217. 被引量:9
  • 8晏春,杜耀华,高青斌,王正志.基于支持向量机的人类5'非翻译区剪接位点识别[J].生物物理学报,2005,21(4):284-288. 被引量:6
  • 9茅力群.利用HMM提取连续语音中的口型信息[J].微计算机信息,2006(01Z):201-202. 被引量:5
  • 10Yasemin Altun,Ioannis Tsochantaridis,Thomas Hofmann.Hidden Markov Support Vector Machines.Proceedings of the Twentieth International Conference on Machine Learning (ICML2003),Washington DC,2003

二级参考文献20

  • 1孙键,徐军,凌伦奖,沈如群,陈润生.用神经网络法预测mRNA的剪接位点[J].生物物理学报,1993,9(1):127-131. 被引量:7
  • 2郑毅,丁达夫.果蝇内含子3'剪接位点的选择机制[J].生物物理学报,1994,10(3):459-464. 被引量:6
  • 3宋阳 ,刘胜兰 ,张燕宏 .利用SAPI5完成中文语音音素的分解[J].微计算机信息,2005,21(3):230-231. 被引量:16
  • 4Reese MG, Kulp D, Tammana H, Haussler D. Genie-gene finding in drosophila melanogaster. Genome Res, 2000,10:529-538.
  • 5Pertea M, Lin XY, Salzberg SL. GeneSplicer: a new computational method for splice prediction. Nucleic Acides Res,2001,29(5):1185-1190.
  • 6Burge C, Karlin S. Prediction of complete gene structure in human genomic DNA. J Mol Biol, 1997,268:78-94.
  • 7Eden E, Brunak S. Analysis and recognition of 5'UTR intron splice sites in human pre-mRNA. Nucleic Acids Res, 2004,32(3): 1131-1142.
  • 8Vapink VN. The nature of statistical learning theory. NY:spfinger-Verlag, 1995.
  • 9Zien A, Ratsch G, Mika S, Schoilkopf B, Lengauer T, Muller KR. Engineering support vector machine kernels that recognize translation initiation sites. Bioinformatics, 2000,16(9):799-807.
  • 10Zhang SW, Pan Q, Zhang HC, Zhang YL, Wang HY. Classification of protein quaternary structure with support vector machine. Bioinformatics, 2003,19(18):2390-2396.

共引文献30

同被引文献9

  • 1Lau, K.F., Dill, K.A. Theory for protein mutability and biogenesis. Proc. Natl. Acad. Sci. U.S.A. 1990, 87:638 -642.
  • 2Sanzo Miyazawa, R.L.Jernigan. Residue-residue potentials with a favorable contact pair term and unfavorable high packing density term, for simulation and threading. 1996, 256: 623-644.
  • 3Li, H., Tang, C., and Wingreen, N.S. Nature of driving force for protein folding: a result from analyzing the statistical potential. Phys. Rev. Lett. 1997: 79, 765 - 7.
  • 4Chikenji G. Kikuehi M, and Iba Y. Multi-Self-Overlap Ensemble for Protein Folding:Ground State Search and Thermodynamics [J].Physical Review Letter, 1999, 83:1886-1889.
  • 5Hao Li, Chao Tang ,et al. Designability of protein structures: A lattice-model study using the Miyazawa-Jernigan matrix Proteins, 2002, 49(3): 403 -412.
  • 6Berger B, Leighton T. Protein folding in the hydrophobic-hydrophilie (HP) model is NP-complete. J Comput Biol. 1998, 5(1): 27-40.
  • 7W E Hart and S Istrail. Crystallographical universal approximability: A complexity theory of protein folding algorithms on crystal lattices." Sandia National Laboratories, Albuquerque, NM. SAND95-1294. 1995.
  • 8Richa Agarwala, et al. Local rules for protein folding on a triangular lattice and generalized hydrophobicity in the HP model. RECOMB 1997: 1-2.
  • 9Gupta A, Manuch J, and Stacho L. Structure-Approximating Inverse Protein Folding Problem in the 2D HP Model. J Comput Biol. 2005, 12(10): 1328-45.

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部