期刊文献+

基于集成分类器的凋谢蛋白亚细胞定位预测方法

Ensemble classifier based predicting method of apoptosis protein subcellular localization
原文传递
导出
摘要 凋谢蛋白亚细胞定位预测是研究凋谢蛋白生物功能的1种重要的方法,也是生物信息学研究的重要领域之一。提高凋谢蛋白亚细胞定位预测模型准确性和实用性是该研究的重点。在本研究中,提出了以模糊K近邻分类算法作为基础分类器的集成分类算法。以蛋白质序列内不同间隔的二肽组成表示基本的蛋白质序列的特征集合,采用二进制粒子群算法作为特征选择方法提取能够有效的蛋白质序列特征。这些经过特征选择后的蛋白质序列特征作为集成分类算法中每一个基础分类器的输入向量。经过在2个常用的数据集上使用Jackknife测试,本文算法在CL317数据集上取得了91.5%的预测准确率,在ZW225数据集上取得了88.0%的准确率。与前人报道的算法预测结果比较,本文方法取得了较好的准确率。与使用相同数据集的已经报道凋谢蛋白亚细胞定位预测算法相比,本研究方法取得了预测准确率。 Predicting subcellular localization ofapoptosis protein is an important method to identify the biological function of a newly found protein sequence. Meanwhile, the research field is a hot field in bioinformatics. The prediction model with more accuracy, the model is better. In this study, an ensemble classifier based approach to predict algorithm is presented which aim to prediction apoptosis protein subcellular localization with feature selection. The binary particle swarm optimization is used as feature selection method, and the fuzzy K nearest neighbor classifier is applied as base classifier in ensemble one. Jackknife test is used to validate the performance of this approach. On the dataset of CL317, the accuracy rate is 91.5%, and accuracy rate is 88.0% on the dataset of ZW225. Compared with the results of reported piror works with the same datasets, presented algorithm achieves promising results.
作者 李爱明 魏蓉
出处 《计算机与应用化学》 CAS CSCD 北大核心 2010年第5期645-648,共4页 Computers and Applied Chemistry
关键词 凋谢蛋白 亚细胞定位预测 二进制粒子群算法 特征选择 apoptosis protein, prediction of subcellular location, binary particle swarm optimization, feature selection
  • 相关文献

参考文献5

二级参考文献54

  • 1李凤敏,李前忠.蛋白质亚细胞定位的识别[J].生物物理学报,2004,20(4):297-306. 被引量:11
  • 2黄静,石峰,周怀北.利用支持向量机和蛋白质非稳定性指标预测凋亡蛋白类型[J].生物信息学,2005,3(3):121-123. 被引量:3
  • 3张振慧,王正华,王勇献.利用分组重量编码预测细胞凋亡蛋白的亚细胞定位[J].生物物理学报,2006,22(4):275-282. 被引量:5
  • 4[9]Du QS,Jiang ZQ,He WZ,Li DP,Chou KC.Amino acid principal component analysis (AAPCA) and its applications in protein structural class prediction.Journal of Biomolecular Structure and Dynamics,2006,23:635~640
  • 5[10]Kurgan L,Hornaeian L.Prediction of structural classes for protein sequences and domains-impact of prediction algorithms,sequence representation and homology,and test procedures on accuracy.Pattern Recognition Letter,2006,39:2323~2343
  • 6[11]Kedarisetti KD,Kurgan LA,Dick S.Classifier ensembles for protein structural class prediction with varying homology.Biochem Biophys Res Comraun,2006,348:981~988
  • 7[12]Chou KC,Cai YD.Predicting protein structural class by functional domain composition.Biochemical and Biophysical Research Communications (Corrigendum:ibid.,2005,Vol.329,1362) 2004,321:1007~1009
  • 8[13]Can Y,Liu S,Zhang L,Qin J,Wang J,Tang K.Prediction of protein structural class with Rough Sets.BMC Bioinfor-matics,2006,7:20~25
  • 9[14]Cai YD,Liu XJ,Xu X,Zhou GP.Support vector machines for predicting protein structural class.BMC Bioinformatics,2001,2:3~7
  • 10[15]Feng KY,Cai YD,Chon KC.Boosting classifier for predicting protein domain structural class.Biochemical and Biophysical Research Communications,2005,334:213~217

共引文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部