期刊文献+

基于支持向量机的人类5'非翻译区剪接位点识别 被引量:6

IDENTIFICATION OF 5'UTRs SPLICE SITES IN HUMAN GENE BASED ON SUPPORT VECTOR MACHINE
在线阅读 下载PDF
导出
摘要 基因非编码区域剪接位点的识别是基因识别中一个非常具有挑战性的问题,尤其是5'非翻译区中剪接位点的识别。与一般剪接位点不同,5'非翻译区剪接位点的两侧不存在由编码到非编码的状态转移,所以通常的剪接位点识别算法在非翻译区的性能不太理想。文章采用了基于支持向量机的方法对5'非翻译区中的剪接位点进行识别。为了提高识别精度,采用了基于矩阵相似性度量的核函数参数选取方法,它能够简单快速地确定合适的核函数参数,进而提高核函数的识别性能。通过实验验证,经过参数选择后的支持向量机能够较好地识别5'非翻译区剪接位点。 Identification of splice sites in non-coding regions of genes is one of the most challenging aspects of gene structure recognition, especially the identification of splice sites embedded in human 5' untranslated regions (UTRs). Different from the conventional splice sites identification, there is no transition from coding to non-coding in 5'UTRs, so conventional splice sites prediction methods perform poorly in UTRs. In this paper, support vector machines was used to identify 5'UTRs splice sites. To increase recognition accuracy, the measurement of matrix similarity was used as the criterion of parameters selection. By doing this, apropos parameters were achieved quickly and simply, thereby improved the identification performance. Experiment results showed that 5'UTRs splice sites can be identified well based on SVM with the selection of parameters.
出处 《生物物理学报》 CAS CSCD 北大核心 2005年第4期284-288,共5页 Acta Biophysica Sinica
基金 国家自然科学基金项目(60471003)
关键词 5’非翻译区剪接位点 识别 支持向量机 核函数 参数选择 5'UTRs splice sites Identification Support vector machine Kernel Parameter selection
  • 相关文献

参考文献12

  • 1Davuluri RV, Suzuki Y, Sugano S, Zhang MQ. CART classification of human 5'UTR sequences. Genome Res, 2000,10: 1807-1816.
  • 2Meijer HA, Thomas AAM. Control of eukaryotic protein synthesis by upstream open reading frames in the 5'-untranslated region of an mRNA. Biochem J, 2002,367:1-11.
  • 3Brunak S, Engelbrecht J, Knudsen S. Prediction of human mRNA donor and acceptor sites from the DNA sequence.J Mol Biol, 1999,220:49-65.
  • 4Reese MG, Kulp D, Tammana H, Haussler D. Genie-gene finding in drosophila melanogaster. Genome Res, 2000,10:529-538.
  • 5Pertea M, Lin XY, Salzberg SL. GeneSplicer: a new computational method for splice prediction. Nucleic Acides Res,2001,29(5):1185-1190.
  • 6Burge C, Karlin S. Prediction of complete gene structure in human genomic DNA. J Mol Biol, 1997,268:78-94.
  • 7Eden E, Brunak S. Analysis and recognition of 5'UTR intron splice sites in human pre-mRNA. Nucleic Acids Res, 2004,32(3): 1131-1142.
  • 8Vapink VN. The nature of statistical learning theory. NY:spfinger-Verlag, 1995.
  • 9Zien A, Ratsch G, Mika S, Schoilkopf B, Lengauer T, Muller KR. Engineering support vector machine kernels that recognize translation initiation sites. Bioinformatics, 2000,16(9):799-807.
  • 10Zhang SW, Pan Q, Zhang HC, Zhang YL, Wang HY. Classification of protein quaternary structure with support vector machine. Bioinformatics, 2003,19(18):2390-2396.

同被引文献37

引证文献6

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部