期刊文献+

基于有效性指标的聚类算法选择 被引量:9

Selection of Clustering Algorithms based on a Validity Index
在线阅读 下载PDF
导出
摘要 为数据集选择合适的聚类算法是获得高质量聚类结果的前提和保障.提出了基于有效性指标的聚类算法选择方法,通过对不同聚类算法的聚类结果的质量评价为数据集选择最适合的聚类算法.该方法的优点是在对数据集的情况了解甚少的情况下,也能有效地保障聚类质量.实验结果表明本文方法十分有效,为实验数据集正确选择出最适合的聚类算法,并获得了高质量的聚类结果. It is the precondition and guarantee for high clustering quality that are to select a proper clustering algorithm for a given dataset. The algorithm selection method based on a validity index is proposed. In this method, a proper algorithm is selected for a dataset through the quality evaluation of clustering results from different clustering algorithms. The proposed method has the advantage of guaranteeing clustering quality when there is less prior knowledge about the dataset. The experimental results show that the proposed method is effective and correctly finds out the proper clustering algorithm suiting for the experimental dataset.
作者 王开军 李晓
出处 《四川师范大学学报(自然科学版)》 CAS CSCD 北大核心 2011年第6期915-918,共4页 Journal of Sichuan Normal University(Natural Science)
基金 国家自然科学基金(60502047) 福建省教育厅基金(JA09043)资助项目
关键词 聚类算法选择 有效性指标 Silhouette指标 selection of clustering algorithms validity index Silhouette index
  • 相关文献

参考文献16

  • 1Xa R, Wunsch D 2nd. Survey of clustering algorithms [ J]. IEEE Transactions on Neural Networks,2005,16 (3) :645 - 678.
  • 2Nirkhi S, Hande K. A Survey on clustering algorithms for web applications [ C ]//The 2008 International Conference on Semantic Web and Web Services. Las Vegas : Nevada ,2008 :124 - 129.
  • 3Frey B J, I)ueck D. Clustefng by passing messages between data points[ J ]. Science,2007,315 (5814) :972 - 976.
  • 4Richards A L, Holmans P, O' Donovan M C, et al. A comparison of four clustering methods for brain expression microarray data[J]. BMC Bioixffonnatics ,2008,9:490.
  • 5王娜,杜海峰,庄健,余进涛,王孙安.三种典型的基于图分割的谱聚类方法比较[J].系统仿真学报,2009,21(11):3316-3320. 被引量:2
  • 6任春美 孔磊.数据挖掘中聚类分析技术方法的比较研究[J],科技信息[J].学术研究,2008,:24-192,191.
  • 7Thalamuthu A, Mukhopadhyay I, Zheng X, et ',tl. Evaluation and comparison of gene clustering methods in mieroarray analysis [ J ]. Bioinformaties,2006,22 ( 19 ) : 2405 - 2412.
  • 8Chen G, Jaradat S A, Banerjee N, et al. Evaluation and comparison of clustering algorithms in anglyzing ES cell gene expression data [ J ]. Statistica Sinica,2002 ,12 :241 - 262.
  • 9Halkidi M. Vazirgiannis M. A data set oriented approach for clustering algorithm selection[J]. Lecture Notes in Computer Science, 2001,2168 : 165 - 179.
  • 10易辉,宋晓峰.基于稳定性的基因表达数据聚类算法选择[J].计算机与应用化学,2008,25(5):615-618. 被引量:1

二级参考文献40

  • 1夏怡凡,朱允民,黄宽娜.信息估计融合中的观测量化解决方案[J].四川大学学报(工程科学版),2009,41(2):211-215. 被引量:2
  • 2殷爱茹,李栋,黄亚楼.基因表达数据聚类有效性分析的EFOM法[J].计算机工程与应用,2005,41(17):53-55. 被引量:4
  • 3袁方,周志勇,宋鑫.初始聚类中心优化的k-means算法[J].计算机工程,2007,33(3):65-66. 被引量:157
  • 4MacQucen J B. Some Methods for Classification and Analysis of Multivariate Observations [C]// Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability. 1, Berkeley. CA, USA: University of California Press, 1967: 281-297.
  • 5Duda R O, Hart P E, Stork D G. Pattern Classification [M].北京:机械工业出版社.2004.
  • 6Wu Z, Leathy R. An Optimal Graph Theoretic Approach to Data Clustering Theory and its Application to Image Segmentation [J]. IEEE Trans on PAMI (S0162-8828), 1993, 15(11): 1101-1113.
  • 7Sarkar S, Boyer K L. Quantitative Measures of Change Based on Feature Organization: Eigenvalues and Eigenvectors [C]// Proc IEEE Conf Computer Vision and Pattern Recognition, 1996. USA: IEEE, 1996.
  • 8Shi J, Malik J. Normalized Cuts and Image Segmentation [J]. IEEE Tram on PAMI (S0162-8828), 2000, 22(8): 888-905.
  • 9Ding C, Ren X F, Zha H, et al. Spectral Min-max Cut for Graph Partitioning and Data Clustering [C]//Proc of the IEEE Intl Conf on Data Mining. USA: IEEE CS, 2001:107-114.
  • 10Verma D, Meilai M. A Comparison of Spectral Clustering Algorithms [R]// UW CSE. Technical Report. [2003-05-01]. Seattle, USA: University of Washington, 2003: 1 - 18.

共引文献148

同被引文献104

引证文献9

二级引证文献32

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部