期刊文献+

基于密度K中心方法的核酸序列聚类

Cluster of Nucleic Acid Sequences Based on Density K-medoids Method
在线阅读 下载PDF
导出
摘要 针对传统K中心聚类算法存在的初始化敏感、聚类结果多样化等问题,提出一种基于密度的K中心聚类方案,并与序列比对、动态规划等方法有机地融合在一起,实现了对核酸序列的聚类分析。实验表明,该方案与传统K中心聚类算法相比较,初始化较理想,迭代次数较少,聚类效果更优。 Due to the disadvantages of initialization and result in the K-medoids clustering algorithm, a new density-based K-medoids clustering is described. And it combines sequence alignment, dynamic programming and other theories, accomplishes the clustering analysis in the nucleic acid sequences. Experiments prove that this method has better initialization, less iterative times and satisfying results compared with the ordinary K-medoids clustering.
出处 《计算机工程》 EI CAS CSCD 北大核心 2006年第19期280-282,共3页 Computer Engineering
基金 科技部"新城疫防制技术平台"基金资助项目 江苏省动物预防医学重点实验室开放课题资助项目(K04005)
关键词 K中心聚类 直接密度可达 序列比对 动态规划 生物信息学 K-medoids cluster Direct arrived density Sequence alignment Dynamic programming Bioinformatics
  • 相关文献

参考文献6

二级参考文献16

  • 1[1]Henikoff S, Henikoff J G. Quantitative Comparative Research of the Similarity of Amino Acid Sequences. In Proceedings of the 89th National Academic Science Conference of USA. New York, 1992-10: 915-919
  • 2[2]Han J,Kamber M.Data Miniing:Concepts and Techniques.The Morgan Kaufmann Series in Data Management System. Jim Gray:Morgan Kaufmann Publishing,2001
  • 3[3]Dayhoff M O, Schwartz R M, Orcutt B C. A Model of Evolutionary Change in Proteins,in Atlas of Protein Sequences and Structure. Washing DC:National Biomedical Research Foundation,1978:345-358
  • 4[4]Jain A K,Murty M N,Flynn P J.Data Clustering:A survey.ACM Computer Survey,1999-2
  • 5[5]Bellman R.Dynamic Programming Treatment of the Traveling Salesm- an Problem.Journal of the ACM,1962-1
  • 6[6]Zhang Chunting.The Current Status and The Prospect of Bioinform- atics.http://tubic.tju.edu.cn,2002
  • 7[1]Yona G, Linial N, Linial M. ProtoMap:automatic classification of protein sequences,a hierarchy of protein families,and local maps of the protein space[J]. Proteins:Structure,Function and Genetics,1999,37(3):360-378.
  • 8[2]Abascal F, Valencia A. Clustering of proximal sequence space for the identification of protein families[J]. Bioinformatics,2002,18(7):908-921.
  • 9[3]Liu HA, Califano A. CASTOR:clustering algorithm for sequence taxonomical organization and relationships[J]. Journal of Computational Biology ,2003,10(1):21-45.
  • 10[4]Enright AJ, Ouzounis CA. GeneRAGE:a robust algorithm for sequence clustering and domain detection[J].Bioinformatics,2000,16(5):451-457.

共引文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部