一种基于密度的引力聚类算法被引量：1

A Gravitational Clustering Algorithm Based on Density

下载PDF

导出

摘要针对传统基于距离的聚类算法所存在的缺点,将万有引力和牛顿第二运动定律思想引入到聚类过程中,提出了一种改进的基于密度的引力聚类算法GCABD.该算法可以自动决定目标数据集中的簇的个数,并且能发现任意形状的簇且可以过滤"噪声"数据.实验结果表明,所提出的GCABD算法的聚类效果和精度均比典型的K-means算法好,提高了聚类质量. Directing against the drawbacks of traditional algorithm based on distance,the paper introduces gravitation and Newton second law of motion into the process of clustering, and proposes an improved algorithm GCABD （Gravitational Clustering Algorithm Based on Density）. This algorithm can decide automatically the number of clusters in the target data set, and find any clusters with arbitrary forms and filter the noisy data. The experimental results show that GCABD algorithm is superior than typical K-means algorithm in clustering effect and precision, enhances the clustering quality greatly.

作者张天伍李卫平

机构地区河南工程学院计算机科学与工程系中原工学院信息商务学院计算科学系

出处《河南科学》 2008年第11期1400-1404,共5页 Henan Science

关键词数据挖掘聚类分析聚类算法引力 data mining clustering analysis clustering algorithm gravitation

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献7

1Han Jiawei,Kamber M.数据挖掘:概念与技术[M].2版.范明,孟小峰,译.北京:机械工业出版社,2007:251-252.
2Wright W E. Gravitational clustering[J]. Pattern Recognition, 1977,9 (3) : 151-166.
3Jonatan Gomez, Dipankar Dasgupta, Olfa Nasraoui. A new gravitational clustering algorithmiC]I/Proceedings of the Third SIAM International Conference on Data Mining. USA:Society for Industrial & Applied Mathematics, 2003:83-94.
4蒋盛益,李庆华.一种基于引力的聚类方法[J].计算机应用,2005,25(2):286-288. 被引量：9
5Alfred V Aho,John E Hopcroft,Jeffrey D Ullman. The design and analysis of computer algorithms [M]. Beijing:China Machine Press, 2006.
6borah B,Bhattacharyya D K. An improved sampling-based DBSCAN for large spatial database[C]//ICISIP, New York:IEEE Press, 2004: 92-96.
7George Kapis, Eui-Hong (Sam) Han, Vipin Kunmar. CHAMELEON: A hierarchical clustering algorithm using dynamic modeling [J]. IEEE computer, 1999,32 (8) : 68-75.

二级参考文献8

1GUHA S, RASTOGI R, SHIM K. ROCK: A robust clustering algorithm for categorical attributes[ A]. In proceedings of the 15th ICDE[C], 1999.512-521.
2GANTI V, GEHRKE J, RAMAKRISHNAN R. Cactus- clustering categorical data using summaries[ A]. In Proc 1999 Int Conf Knowledge Discovery and Data Mining[ C], 1999.73 -83.
3GUHA S , MEYERSON A , MISHRA N , et al . Clustering data streams: Theory and practice[ J]. Knowledge and Data Engineering,IEEE Transactions on, 2003, 15(3): 515 -528.
4PORTNOY L, ESKIN L, STOLFO S. Intrusion Detection with Unla-beled Data using Clustering[ A]. In Proceedings of ACM CSS Workshop on Data Mining Applied to Security (DMSA-2001) [ C], Philadelphia, PA, 2001.
5ESKIN E, ARNOLD A, PRERAU M, et al. A geometric framework for unsupervised anomaly detection: Detecting intrusions in unla-beled data[ Z]. In Data Mining for Security Applications, 2002.
6SHENG YJ , YU MX . An Efficient Clustering Algorithm [ A ] . In Proc of 2004 International Conference on Machine Learning and Cybernetics[ C], 2004.8.
7MERZ C J, MERPHY P. UCI repository of machine learning databases[ EB/OL]. http://www. ics. uci. edu/ relearn/ MLRRepository. html, 2000.
8何增有,徐晓飞,邓胜春.Squeezer：An Efficient Algorithm for Clustering Categorical Data[J].Journal of Computer Science & Technology,2002,17(5):611-624. 被引量：32

共引文献9

1高原,耿国华,王怡.基于动态矩形的聚类方法的设计与实现[J].计算机应用,2006,26(4):870-871.
2蒋盛益,姜灵敏.一种高效异常检测方法[J].计算机工程,2007,33(7):166-168. 被引量：7
3王凌峰.基于构成要素的聚类算法[J].统计与决策,2007,23(19):26-28. 被引量：1
4苏晓珂,兰洋,程耀东,万仁霞.基于约束的混合属性增量聚类算法[J].计算机工程与设计,2010,31(8):1799-1801.
5袁英,陈立潮,任姚鹏,王秀慧.结合引力的模糊C-值聚类算法研究[J].计算机应用与软件,2010,27(8):271-272. 被引量：2
6贾瑞玉,查丰,耿锦威,宁再早.一种基于引力的分层聚类算法[J].计算机技术与发展,2011,21(3):76-78. 被引量：2
7刘海蓉,闫仁武.一种改进的加权关联规则挖掘算法[J].现代电子技术,2011,34(12):51-54. 被引量：8
8李俊林,符红光.仿分子动理学数据聚类法在基因表达数据上的应用[J].计算机应用,2011,31(10):2774-2777. 被引量：1
9李向丽,耿鹏,邱保志.混合属性数据集的聚类边界检测技术[J].控制与决策,2015,30(1):171-175. 被引量：5

同被引文献6

1蒋盛益,李庆华.一种基于引力的聚类方法[J].计算机应用,2005,25(2):286-288. 被引量：9
2范娜,云庆夏.粒子群优化算法及其应用[J].信息技术,2006,30(1):53-56. 被引量：32
3Miin-Shen Yang,Hsu-Shen Tsai.An Alternative Fuzzy Compactness and Separation Clustering Algorithm[J].ACIVS 2005,LNCS 3708,2005:146-153.
4Laszlo Szilagyi,Sandor M.Szilagyi,Zoltan Benyo.A Modified Fuzzy C-Means Algorithm for MR Brain Image Segmentation[J].ICIAR 2007,LNCS 4633,2007:866-877.
5Juan Liu,Hui Ju.Fuzzy Inspection of Fabric Defects Based on Particle Swarm Optimization(PSO).RSKT 2008,LNAI 5009,2008:700-706.
6尹海丽,王颖洁,白凤波.软硬结合的快速模糊C-均值聚类算法的研究[J].计算机工程与应用,2008,44(22):172-174. 被引量：7

引证文献1

1袁英,陈立潮,任姚鹏,王秀慧.结合引力的模糊C-值聚类算法研究[J].计算机应用与软件,2010,27(8):271-272. 被引量：2

二级引证文献2

1张宇献,钱小毅,董晓,王建辉.基于数据分散度聚类的浆纱质量指标建模与仿真[J].系统仿真学报,2016,28(8):1707-1714. 被引量：1
2罗舒文,万仁霞,苗夺谦.基于簇中心预选策略的三支决策密度峰值聚类算法[J].山西大学学报（自然科学版）,2024,47(1):30-39. 被引量：3

1张天伍,荆立夏.一种基于网格的引力聚类算法[J].微计算机信息,2009,25(18):270-271. 被引量：1
2田银磊,王亚利.一种改进的聚类和孤立点检测算法[J].科学技术与工程,2010,10(22):5412-5416. 被引量：1
3尚俊平,邱保志,刘合兵.一种基于距离的聚类和孤立点检测算法[J].河南科学,2007,25(6):975-978. 被引量：2
4张天伍,詹自熬.一种基于引力的聚类算法[J].河南科学,2009,27(1):70-73.
5张晓民,张枫,刘黎明.一种基于代表点质量的万有引力聚类算法[J].南开大学学报（自然科学版）,2016,49(4):8-15.
6蒋盛益,李庆华.一种基于引力的聚类方法[J].计算机应用,2005,25(2):286-288. 被引量：9
7刘玉.一种基于PKI/GCA的对等网络安全与公平解决方案[J].合肥学院学报（自然科学版）,2007,17(4):59-62.
8付双胜,张明军,刘棣华,鲁晓帆.一种增强的基于GCA的入侵检测方法[J].网络安全技术与应用,2010(10):73-75.
9孟海东,宋飞燕,宋宇辰.数据变换对聚类算法影响的实验分析[J].计算机与现代化,2008(1):21-23.
10谭勋,吐尔根·依布拉音,艾山·吾买尔,张韦煜.基于相似度计算的维吾尔语词聚类[J].新疆大学学报（自然科学版）,2012,29(1):104-107. 被引量：2

河南科学

2008年第11期

浏览历史

内容加载中请稍等...

一种基于密度的引力聚类算法被引量：1

参考文献7

二级参考文献8

共引文献9

同被引文献6

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

一种基于密度的引力聚类算法 被引量：1

参考文献7

二级参考文献8

共引文献9

同被引文献6

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

一种基于密度的引力聚类算法被引量：1