期刊文献+

基于近邻传播算法的最佳聚类数确定方法比较研究 被引量:30

Comparative Study on Method for Determining Optimal Number of Clusters Based on Affinity Propagation Clustering
在线阅读 下载PDF
导出
摘要 在聚类分析中,决定聚类质量的关键是确定最佳聚类数。提出采用聚类效果较好的近邻传播聚类算法对样本进行聚类,运用6种聚类有效性指标分别对聚类结果进行有效性分析,以确定最佳聚类数。具体分析了这些有效性指标,并改进了IGP指标确定最佳聚类数的方法。针对8个数据集,通过实验比较这些指标的性能。分析和实验结果表明,基于近邻传播聚类算法,IGP指标确定最佳聚类数的性能最好。 It is crucial to determine optimal number of clusters for the quality of clustering in cluster analysis.Based on Affinity Propagation clustering algorithm,a method for determining optimal number of clusters was proposed to analyze the clustering validity and determine optimal number of clusters by using six clustering validity index.These clustering validity indexes were analyzed concretely and the method of using IGP index to determine optimal number of clusters was improved.In connection with eight datasets,the performances of these indexes were compared by simulation experiments.The results of analysis and experiments show that IGP index is the best to determine optimal number of clusters based on Affinity Propagation clustering.
出处 《计算机科学》 CSCD 北大核心 2011年第2期225-228,共4页 Computer Science
基金 国家863计划项目(2007AA1Z158) 国家自然科学基金(60703106)资助。
关键词 近邻传播 聚类数 聚类有效性指标 聚类分析 Affinity propagation Number of clusters Clustering validity index Cluster analysis
  • 相关文献

参考文献7

  • 1Frey B J,Dueck D.Clustering by Passing Messages Between Data Points[J].Science,2007,315(5814):972-976.
  • 2Mézard M.Where Are the Exemplars?[J].Science,2007,315(5814):949-951.
  • 3肖宇,于剑.基于近邻传播算法的半监督聚类[J].软件学报,2008,19(11):2803-2813. 被引量:165
  • 4Kapp A V,Tibshirani R.Are clusters found in one dataset pre-sent in another dataset?[J].Biostatistics,2007,8(1):9-31.
  • 5Dudoit S,Fridlyand J.A Prediction-based Resampling Method for Estimating the Number of Clusters in a Dataset[J].Genome Biology,2002,3(7):1-21.
  • 6Dembélé D,Kastner P.Fuzzy C-means method for clustering microarray data[J].Bioinformatics,2003,19(8):973-980.
  • 7Armstrong S A,Staunton J E,Silverman L B,et al.MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia[J].Nature Genetics,2002,30:41-47.

二级参考文献1

共引文献164

同被引文献294

引证文献30

二级引证文献159

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部