期刊文献+

基于遗传算法的高维子空间聚类算法设计

Research on high dimensional sub-space clustering algorithm based on genetic algorithm
在线阅读 下载PDF
导出
摘要 针对高维空间数据的特点,为了降低"维数灾难效应"对聚类结果的影响,提出并实现了一种新的基于遗传算法的子空间聚类算法,通过特征选择方法并结合遗传算法的全局搜索能力对所有的特征子空间进行搜索;采用实数制编码方式对解空间进行编码,并设计一种基于距离和信息熵的适应度评估函数来对聚类结果和子空间所包含的特征维进行评估。最后,通过人工数据与真实数据等几组实验验证了算法的高效性和鲁棒性。实验结果表明,本文提出的新算法能够有效地进行高维数据聚类,降低"维数灾效应"的影响。 In view of the characteristics of high dimensional spatial data and in order to reduce the curse of dimensionality effect on clustering results, this paper proposed and implemented a new sub-space clustering algorithm based on genetic algorithm, with the feature-choice method and the combination with global searching ability of genetic algorithm to search all of the feature sub-spaces. A real number system encoding method is adopted to encode the solution space, and a fitness evaluation function based on the distance and information entropy is designed to canT on evaluation on the clustering results and the characteristic dimension out of sub-space. Finally, a series of experiments of artificial data and real data were used to verify the high-efficiency and robustness of the algorithm. The results demonstrate that the new proposed algorithm can effectively carry out the high-dimensional data clustering and reduce the influence on the curse of dimensionality effect.
作者 黄白梅 章政
出处 《电子设计工程》 2013年第5期180-183,共4页 Electronic Design Engineering
基金 湖北省教育厅科学技术研究项目(Q20091112)
关键词 遗传算法 高维空间 聚类 特征维 genetic algorithm high-dimensional space clustering feature dimension
  • 相关文献

参考文献2

二级参考文献22

  • 1钟将,吴中福,吴开贵,欧灵.基于人工免疫网络的动态聚类算法[J].电子学报,2004,32(8):1268-1272. 被引量:23
  • 2任江涛,黄焕宇,孙婧昊,印鉴.基于遗传算法及聚类的基因表达数据特征选择[J].计算机科学,2006,33(9):155-156. 被引量:4
  • 3Parsons L, Haque E, Huan Liu. Subspace Clustering for High Dimensional Data: A Review [J]. SIGKDD Explorations, 2004,6(1):90 105.
  • 4Maulik U,Bandyopadhyay S. Genetic Algorithm-Based Clustering Technique [J]. Pattern Recognition, 2000, 33 (9) :1455-1465.
  • 5http://archive, ics. uci. edu/ml/datasets/Wine.
  • 6A D Gordon.Classification[M].Chapman & HalL/CRC,Boca Raton,FL,2 Edition,1999.163-175.
  • 7Lin Yu Tseng,Shiueng Bien Yang.A genetic clustering algorithm for data with non-spherical-shape clusters[J].Pattern Recognition,2000,33:1251-1259.
  • 8Othman R M,Deris S,Illias R M,et al.Automatic clustering of gene ontology by genetic algorithm[J].International Journal of Information Technology,2006,3(1):37-46.
  • 9Sung-Hae Jun.A hybrid genetic algorithm and new criterion for determining the number of clusters[J].International Journal of Soft Computing,2006,1 (4):313-318.
  • 10Swagatam Das,Archana Chowdhury,Ajith Abraham.A bacterial evolutionary algorithm for automatic data clustering[A].Evolutionary Computation,CEC' 09 IEEE Congress on[C].IEEE,2009.2403-2410.

共引文献26

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部