期刊文献+

基于特征Boosting的真核启动子预测方法

Eukaryotic promoter prediction based on feature-boosting
在线阅读 下载PDF
导出
摘要 提出了一个新的启动子检测方法,它基于以下假设:启动子是由一些词模式决定的且不同的启动子由不同的词决定。通过计算散度距离选择最可能的特征并用feature-boosting构造一系列的弱分类器。一定数目的弱分类器可构造一强分类器,这样就可以达到一个较好的性能。和其他分类器不同的是,采用了不同的训练和分类策略。对大型基因序列实验结果和一些较好的算法比较显示该方法预测启动子区域是有效的,且具有较好的敏感性和特异性。 A new method is presented for promoter prediction based on the following hypothesis:Promoter is determined by some word patterns and different promoters are determined by different words.Most potential features are selected by divergence distance to build a sequence of weak classifier by feature-boosting.A number of weak classifiers construct a strong classifier,which can achieve a better performance.Different from other classifier,a different training and classifying strategy is adopted.Experimental results on large genomic sequences and comparisons with several excellent algorithms show that the algorithm is efficient with higher sensitivity and specificity in predicting promoter regions.
出处 《计算机工程与应用》 CSCD 北大核心 2009年第4期180-182,195,共4页 Computer Engineering and Applications
基金 国家自然科学基金No.60772028~~
关键词 DNA序列分析 启动子预测 词模式 特征boosting DNA sequence analysis promoter prediction word patterns feature-boosting
  • 相关文献

参考文献11

  • 1Lander E S.Initial sequencing and analysis of the human genome[J]. Nature, 2001,409(6822) : 860-921.
  • 2Wu S H,Xie X D,Liew A W,et al.Eukaryotic promoter prediction based on relative entropy and positional information [J].Physical Review E,2007,75(041908): 1-7.
  • 3Knudsen S.Promoter2.0:For the recognition of Po Ⅲ promoter sequenees[J].Bioinformatics, 1999,15 : 356-361.
  • 4Down T A,Hubbard T J.Computational detection and location of transcription start sites in mammalian genomic DNA [J].Genome Res,2002,12:458-461.
  • 5Bajic V B,Seah S H,Chong A,et al.Computer model for recognition of functional transcription start sites in polymerase Ⅱ promoters of vertebrates[J].Journal of Molecular Graphics & Modeling,2003, 21:323-332.
  • 6Prestridge D S,Burks C.The density of transcription elements in promoter and non-promoter sequences[J].Hum Mol Genet, 1993,2 :1449-1453.
  • 7Cross S H,Clark V H,Bird A P.Isolation of CpG islands from large genomic clones[J].Nuclcic Acids Res, 1999,27:2099-2107.
  • 8Davuluri R V,Grosse I,Zhang M Q.Computational identification of promoters and first exons in the human genome[J].Nat Genet,2001, 29:412-417.
  • 9Scherf M,Klingenhoff A,Werner T.Highly specific localization of promoter regions in large genomic sequence by Promoterlnspector: A novel context analysis approach[J].J Mol Boil,2000,297:599-606.
  • 10Sergios T,Konstantines K.Pattern recognition[M].San Diego,USA: Academic Press, 2003.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部