期刊文献+

基于有效性指标的聚类集成学习方法 被引量:1

A CLUSTERING ENSEMBLE LEARNING METHOD BASED ON VALIDITY INDEX
在线阅读 下载PDF
导出
摘要 学习器间的差异性是影响集成学习效果的一个关键因素。目前针对分类集成的研究较多,针对聚类集成的研究则相对较少。基于聚类问题的本质特点,提出一种新的聚类集成学习方法,利用聚类有效性指标度量不同聚类结果性能上的差异,根据有效性指标的评价值为聚类结果分配权值,通过加权投票的决策方法进行聚类集成并确定最佳聚类数。理论研究和实验结果证明了新的聚类集成学习方法的可行性和高效性。 The diversity of learners is an important factor to the performance of ensemble learning.Until now,a large amount of researches in classified ensembles has been done,while limit work focuses on the clustering ensemble.Based on the essential characteristic of clustering,this paper proposes a new method of clustering ensemble learning,which measures the diversity in performance of different clustering results by using cluster validity index.The weights for each cluster result are constituted according to the validity index evaluation.And then the clustering ensemble is conducted and the optimal clustering number is determined both by the decision-making approach of weighted-voting.Theoretical research and experimental results demonstrate the feasibility and high efficiency of the new clustering ensemble leaning method.
作者 王海波 徐涛
出处 《计算机应用与软件》 CSCD 北大核心 2012年第9期45-49,70,共6页 Computer Applications and Software
基金 国家自然科学基金项目(61139002) 中国民用航空局科技项目(MHRD201006 MHRD201101)
关键词 聚类 集成学习 有效性指标 差异性 Clustering ,Ensemble learning ,Validity index ,Diversity
  • 相关文献

参考文献15

  • 1Dienerich T. Machine learning research : four current directions [ J ]. AI Magazine, 1997,18 (4) : 97 - 136.
  • 2Hanan G A, Mohamed S K. On voting-based consensus of cluster en- sembles[ J]. Pattern Recognition ,2010,43 ( 5 ) : 1943 - 1953.
  • 3Kerr G, Ruskin H G, Crane M. Techniques for clustering gene expres- sion data[ J]. Computers in Biology and Medicine, 2008,38 ( 3 ) : 283 - 293.
  • 4Strehl A, Ghosh J. Cluster ensembles-a knowledge reuse framework for combining multiple partitions [ J ]. Journal of Machine Learning Re- search ,2002,3 ( 3 ) :583 - 617.
  • 5Krogh A, Vedelsby J. Neural network ensembles, cross validation, and active learning [ J ]. Advances in Neural Information Processing Sys- tems, 1995,17 (s) :23 l - 238.
  • 6Hadjitodorov S T, Kuneheva L l, Todorova L. Moderate diversity for bet- ter cluster ensembles[ J]. Information Fusion ,2006,7 ( 3 ) :264 - 275.
  • 7唐伟,周志华.基于Bagging的选择性聚类集成[J].软件学报,2005,16(4):496-502. 被引量:99
  • 8李巍,郭强,曹华.具有成功率约束的最优匹配问题[J].计算机工程与应用,2011,47(4):33-35. 被引量:1
  • 9张惟皎,刘春煌,李芳玉.聚类质量的评价方法[J].计算机工程,2005,31(20):10-12. 被引量:62
  • 10Dunn J. Well separated, clusters and optimal fuzzy partitions [ J ]. Jour- nal of Cybernetics, 1974, 4 ( 1 ) :95 - 104.

二级参考文献43

  • 1郭强.分配问题的一种新的迭代算法[J].系统工程与电子技术,2004,26(12):1915-1916. 被引量:11
  • 2CALINSKI R,HARABASZ J.A dendrite method for cluster analysis[J].Communications in Statistics,1974,3(1):1 -27.
  • 3DAVIES D L,BOULDIN D W.A cluster separation measure[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1979,1(2):224-227.
  • 4DUDOIT S,FRIDLYAND J.A prediction-based resampling method for estimating the number of clusters in a dataset[J].Genome Biology,2002,3(7):1-21.
  • 5DIMITRIADOU E,DOLNICAR S,WEINGESSEL A.An examination of indexes for determining the number of cluster in binary data sets[J].Psychometrika,2002,67(1):137-160.
  • 6KAPP A V,TIBSHIRANI R.Are clusters found in one dataset present in another dataset?[J].Biostatistics,2007,8(1):9-31.
  • 7ROUSSEEUW P J.Silhouettes:a graphical aid to the interpretation and validation of cluster analysis[J].Journal of Computational and Applied Mathematics,1987,20(1):53 -65.
  • 8DEMB(E)L(E) D,KASTNER P.Fuzzy C-means method for clustering microarray data[J].Bioinformatics,2003,19(8):973-980.
  • 9Gale D, Shapley L S.College admissions and the stability of marriage[J].American Mathematical Monthly, 1962,69: 9-15.
  • 10Mcvitie D G, Wilson L B.The stable marriage problem[J].Communications of the ACM, 1971,14:486-492.

共引文献305

同被引文献3

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部