期刊文献+

一种面向分类属性数据的聚类融合算法研究 被引量:7

Clustering ensemble algorithm for categorical data
在线阅读 下载PDF
导出
摘要 为了解决单一聚类算法存在结果不准确和随机性大,且现有算法对分类数据聚类时将其转换成数值型会产生误差等问题,提出了一种面向分类属性数据的聚类融合算法。算法利用原有分类属性值的差异产生聚类成员,然后采用相似度方法进行划分,通过寻求目标函数最小的划分来简化聚类过程。算法在UCI数据集上进行了验证,结果表明算法的效率和精度都优于现有算法,说明算法的设计和更新策略是有效的。 In order to prevent the inaccuracy and randomness of single clustering algorithm,and error of existing clustering algorithm transferring categorical data into numerical data for clustering,this paper proposed the clustering ensemble for catego-rical data.The algorithm produced clustering memberships by values of categorical data,and then used similarity degree to partition dataset,which reduced the process of clustering by minimizing the objective function.Finally,applied the algorithm into UCI dataset.The results show its efficiency and accuracy are better than existing algorithms,the design and refreshing methods are effective.
出处 《计算机应用研究》 CSCD 北大核心 2011年第5期1671-1673,共3页 Application Research of Computers
基金 国家自然科学基金资助项目(70801007 70940008) 国家教育部博士点基金资助项目(200801510001) 国家教育部科学技术研究重点资助项目(209030) 国家科技支撑计划资助项目(2009BAG13A03) 中央高校基本科研业务费专项资金资助项目(2009QN085)
关键词 聚类融合 分类属性数据 数据挖掘 相似度 clustering ensemble categorical data data mining similarity degree
  • 相关文献

参考文献9

  • 1EVERITT B S, LANDAU S, LEESE M. Cluster analysis[M]. 4th ed. London: Arnold, 2001.
  • 2JAIN A K, MURTY M N, FLYNN P J. Data clustering: a review [J]. ACM Computing Surveys, 1999,31 ( 3 ) :264-323.
  • 3FRED A L. Finding consistent clusters in data partitions [ C ]//Proc of the 2nd International Workshop on Multiple Classifier Systems. Cambridge: Springer, 2001 : 309-318.
  • 4STREHL A, GHOSH J. Cluster ensembles: a knowledge reuse frame-work for combining multiple partitions [ J ]. Journal of Machine Learning Research, 2003,3(3):583-617.
  • 5HE Zeng-you, XU Xiao-fei, DENG Sheng-chun. A cluster ensemble method for clustering categorical data [ J ]. Information Fusion,2005, 6(2) :143-151.
  • 6FRED A, JAIN A K. Data clustering using evidence accumulation [ C]//Proc of the 16th International Conference on Pattern Recognition. Washington DC : IEEE Computer Society,2002 : 276-280.
  • 7LI Tao-ying, CHEN Yan. Fuzzy clustering ensemble algorithm for partitioning categorical data[ C ]//Proc of the 2nd International Conference on Business Intelligent and Financial Engineering. Washington DC : IEEE Computer Society,2009 : 170-174.
  • 8杨善林,李永森,胡笑旋,潘若愚.K-MEANS算法中的K值优化问题研究[J].系统工程理论与实践,2006,26(2):97-101. 被引量:197
  • 9FORSYTH R. UCI machine learning repository[ DB/OL]. ( 1990-05- 15 ). http ://archive. ies. uci. edu/ml/datasets/Zoo.

二级参考文献6

  • 1Treshansky A,McGraw R.An overview of clustering algorithms[A].Proceedings of SPIE,The International Society for Optical Engineering[C].2001(4367):41-51.
  • 2Clausi D A.K-means Iterative Fisher (KIF) unsupervised clustering algorithm applied to image texture segmentation[J].Pattern Recognition,2002,35:1959-1972.
  • 3Bezdek J C,Pal N R.Some new indexes of cluster validity[J].IEEE Transactions on Systems,Man,and Cybernetics _ Part B:Cybernetics,1998,28(3):301-315.
  • 4Ramze R M,Lelieveldt B P F,Reiber J H C.A new cluster validity indexes for the fuzzy c-mean[J].Pattern Recognition Letters,1998,19:237-246.
  • 5范九伦,裴继红,谢维信.聚类有效性函数:熵公式[J].模糊系统与数学,1998,12(3):68-74. 被引量:19
  • 6于剑,程乾生.模糊聚类方法中的最佳聚类数的搜索范围[J].中国科学(E辑),2002,32(2):274-280. 被引量:131

共引文献196

同被引文献62

  • 1陈孝新.熵权法在股票市场的应用[J].商业研究,2004(16):139-140. 被引量:9
  • 2樊爱军,雷宪章,刘红超,李兴源.研究大规模互联电网区域间振荡的特征值分析方法[J].电网技术,2005,29(17):35-39. 被引量:33
  • 3阳琳贇,王文渊.聚类融合方法综述[J].计算机应用研究,2005,22(12):8-10. 被引量:28
  • 4张旭,沈沉,梅生伟,陈颖.小干扰稳定特征向量和相关因子的分布式算法[J].电力系统自动化,2007,31(14):7-11. 被引量:11
  • 5Han Jiawei,Kamber M.数据挖掘概念与技术[M].范明,孟小峰,译.北京:机械工业出版社,2008.
  • 6JAIN A K. Data clustering: 50 years beyond K-means[ J]. Pattern Recognition Letters ,2010,31 ( 8 ) :651-666.
  • 7JAIN A K, DUBES R C. Algorithms for clustering data[ M]. New Jersey : Prentice-Hall, 1988.
  • 8STREHL A, GHOSH J. Cluster ensembles:a knowledge reuse frame- work for combining multiple partitions [ J ]. Journal of Machine Learning Research ,2002,3( 1 ) :583-617.
  • 9LI Tao, OGIHARA M, MA Sheng. On combining multiple cluster- ings: an overview and a new perspective[ J]. Applied Intelligence, 2010,33(2) :207-219.
  • 10VEGA-PONS S, RUIZ-SHULCLOPER J. A survey of clustering en- semble algorithms [ J]. International Journal of Pattern Recogni- tion and Artificial Intelligence ,2011,25(3 ) :337-372.

引证文献7

二级引证文献23

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部