一种基于量子机制的分类属性数据层次聚类算法

A hierarchical clustering algorithm of categorical attributive data using quantum mechanism

下载PDF

导出

摘要受物理学中量子机制特性的启发,结合层次凝聚思想,通过引入新的相异性度量测度以及聚类度量尺度步长sβtep概念,重新定义以紧致性指标AIAD和离散性指标AIED为基础的聚类有效性函数CVF,提出一种针对分类属性数据的基于量子机制层次聚类算法CQHC.该算法首先在不同粒度水平上划分数据样本产生初始类(簇),然后以聚类有效性函数CVF为评价标准,动态地合并初始类(簇)完成聚类.仿真实验采用2个真实数据集,即:线性可分的大豆疾病样本数据集和线性不可分的动物园数据集.实验结果表明,该算法与已有的其他几个算法相比,不仅具有更高的聚类准确率,而且能够准确地检测出最佳类别数,是有效且可行的. Enlightened by quantum mechanics in physics and incorporated with agglomerative hierarchical clustering, a quantum mechanism-based hierarchical clustering algorithm of categorical attributive data CQHC was proposed by introducing a new dissimilarity measure and a concept of clustering measure scale step βtep, and redefining the cluster validity function CVF based on compactness index AIAD and discrete- ness index AIED. In this algorithm of CQHC, the data sample was partitioned first according to different granularities levels to generate initial clusters. Then the initial clusters were dynamically merged by taking the cluster validity function CVF as evaluation standard and the clustering was completed. Two real data sets, including linear separable soybean disease data sets and linear inseparable zoo data sets, were used for simulation experiment. Experimental result demonstrated that the proposed algorithm was effective and feasible, which not only had higher clustering accuracy, but also accurately detected the best cluster number when compared to other algorithms available.

作者赵正天赵小强李炜段晓燕卢勇

机构地区兰州理工大学电气工程与信息工程学院兰州石化职业技术学院电子电气工程系中国石油兰州石化电仪事业部

出处《兰州理工大学学报》 CAS 北大核心 2009年第5期89-94,共6页 Journal of Lanzhou University of Technology

基金甘肃省自然科学基金(0809RJZA005)

关键词分类属性量子机制层次凝聚聚类度量尺度步长聚类有效性函数 categorical attribute quantum mechanism hierarchical clustering clustering measure scalestep cluster validity function

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献12

1SANGUTHEVAR R. Efficient parallel hierarchical-clustering algorithms [J]. IEEE Transactions on Parallel and Distributed Systems, 2005,16 (6) : 497-502.
2HUANG Zhexue,MICHAEL K N. A fuzzy k-modes algorithm for clustering categorical data [J]. IEEE Trans on Fuzzy Systems, 1999,7(4) :446-452.
3HUANG Zhexue A fast clustering algorithm to cluster very large categorical data sets in data mining [C]//Proceedings of the SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery. New York: ACM Press, 1997 : 1-8.
4HUANG Zhexue Extensions to the k-means algorithm for clustering large data sets with categorical values [J]. Data Mining and Knowledge Discovery, 1998,2(3) : 283-304.
5陈宁,陈安,周龙骧.数值型和分类型混合数据的模糊K-Prototypes聚类算法(英文)[J].软件学报,2001,12(8):1107-1119. 被引量：49
6李志华,王士同.一种基于量子机制的分类属性数据模糊聚类算法[J].系统仿真学报,2008,20(8):2119-2122. 被引量：6
7赵正天,赵小强,李炜.基于量子机制的改进的分类属性数据聚类算法[J].兰州理工大学学报,2009,35(3):98-102. 被引量：2
8AHMAD A, DEW L. A method to compute distance between two categorical values of same attribute in unsupervised learning for categorical data set [J].Pattern Recognition Letters, 2007,28(1): 110-118.
9KIM D W, LEE K H,LEE D. On cluster validity index for estimation of the optimal number of fuzzy clusters [J]. Pattern Recognition, 2004,37(10) : 2009-2025.
10KIM M, RAMAKRISHNA R S. New indices for cluster validity assessment [J]. Pattern Recognition Letters, 2005, 26 (15) : 2353-2363.

二级参考文献27

1吴文丽,刘玉树,赵基海.一种新的混合聚类算法[J].系统仿真学报,2007,19(1):16-18. 被引量：18
2乐逸祥,周磊山,乐群星.微粒群算法的可视化仿真及算法改进[J].系统仿真学报,2007,19(6):1212-1216. 被引量：6
3GUHA S,RASTOGI R,SH M K.CURE:an efficient clustering algorithm for large databases[C]//HAAS L M,TIVARY A.Proc of ACM SIGMOD International Conference on Management of Data.Seattle:ACM Press,1998:73-84.
4HUANG Zhexue,MICHAEL K N.A fuzzy k-modes algorithm for clustering categorical data[J].IEEE Trans on Fuzzy Systems,1999,7(4):446-452.
5HUANG Zhexue.A fast clustering algorithm to cluster very large categorical data sets in data mining[C]//Proceedings of the SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery.New York:ACM Press,1997:1-8.
6HUANG Zhexue.Extensions to the k-means algorithm for clustering large data sets with categorical values[J].Data Mining and Knowledge Discovery,1998,2(3):283-304.
7ESPOSITO F,MALEBRA D,TAMMA V,et al.Classical resemblance measures,analysis of symbolic data[M].New York:Springer,2000:139-152.
8AHMAD A,DEY L.A method to compute distance between two categorical values of same attribute in unsupervised learning for categorical data set[J].Pattern Recognition Letters,2007,28:110-118.
9GANTI V,GEKHRE J E,RAMAKRESHNAN R.CACTUS-clustering data using summaries[C]//Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.San Diego:ACM Press,1999:311-314.
10AHMAD A,DEY L.A feature selection technique for classificatory analysis[J].Pattern Recognition Letters,2005,26:43-56.

共引文献52

1汪加才,朱艺华.模糊K-Prototypes算法中的加权指数研究[J].计算机应用,2005,25(2):348-351. 被引量：4
2汪加才,文巨峰,陈奇,俞瑞钊.结构化模糊K-prototypes聚类算法[J].计算机科学,2005,32(5):155-158. 被引量：2
3张海燕,丁峰,姜丽红.基于模糊聚类的协同过滤推荐方法[J].计算机仿真,2005,22(8):144-147. 被引量：25
4王宇,杨莉.基于凝聚函数的混合属性数据聚类算法[J].大连理工大学学报,2006,46(3):446-448. 被引量：2
5赵宇,李兵,李秀,刘文煌,任守榘.混合属性数据聚类融合算法[J].清华大学学报（自然科学版）,2006,46(10):1673-1676. 被引量：9
6靳忠伟,陈康民,闫伟,王桂华.一种短期电力负荷预测新方法的研究与应用[J].系统仿真学报,2007,19(20):4790-4793. 被引量：4
7陈利风,蒋充剑,程健庆.多平台数据链模拟器及其在通信试验中的应用[J].计算机仿真,2007,24(10):269-271.
8李志华,王士同.一种基于量子机制的分类属性数据模糊聚类算法[J].系统仿真学报,2008,20(8):2119-2122. 被引量：6
9尹波,何松华.基于PSO的模糊K-Prototypes聚类[J].计算机工程与设计,2008,29(11):2883-2885. 被引量：2
10陈舵,崔杜武,李雪,王竹荣.加权模糊聚类及其在电力变压器故障诊断中的应用[J].西安理工大学学报,2008,24(2):182-186. 被引量：8

1赵正天,赵小强,李炜,段晓燕.分类属性数据量子聚类算法的改进[J].计算机应用与软件,2010,27(12):101-104. 被引量：1
2许磊.量子粒子群算法在电网规划中的应用[J].科学技术与工程,2012,20(2):322-324. 被引量：3
3李志华,王士同.一种基于量子机制的分类属性数据模糊聚类算法[J].系统仿真学报,2008,20(8):2119-2122. 被引量：6
4赵正天,赵小强,李炜.基于量子机制的改进的分类属性数据聚类算法[J].兰州理工大学学报,2009,35(3):98-102. 被引量：2
5郑金华,史忠植,谢勇.基于聚类的快速多目标遗传算法[J].计算机研究与发展,2004,41(7):1081-1087. 被引量：14
6邓峰.多跳网络中分类属性数据模糊聚类仿真[J].计算机仿真,2017,34(1):292-295. 被引量：12
7张灿龙,李忠利,陈华彬.一种改进DBSCAN密度聚类算法[J].数字技术与应用,2016,34(11):134-134.
8量子计算机[J].智力（提高版）,2017,0(2).
9王立国,杨月霜,刘丹凤.基于改进三重训练算法的高光谱图像半监督分类[J].哈尔滨工程大学学报,2016,37(6):849-854. 被引量：8
10张华,王艳秋.基于免疫原理的故障检测与诊断模型[J].黑龙江科技信息,2009(3):20-20.

兰州理工大学学报

2009年第5期

浏览历史

内容加载中请稍等...

一种基于量子机制的分类属性数据层次聚类算法

参考文献12

二级参考文献27

共引文献52

相关作者

相关机构

相关主题

浏览历史