期刊文献+

将KFCM算法应用于源代码挖掘的研究 被引量:3

Research on applying KFCM algorithm to source code mining
在线阅读 下载PDF
导出
摘要 为解决软件工程数据量大、属性多且多为离散型数据的特点,提高软件工程数据的挖掘效率,寻求更快速、高效的聚类算法,提出了将基于核函数的模糊聚类算法应用于源代码挖掘;同时采用TF-IDF方法对离散型文本数据进行处理,解决了核模糊聚类算法不能对文本数据直接进行聚类的问题。将遗传算法与KFCM算法相结合,克服了KFCM只能求解局部极小值的问题。实验结果表明,改进的KFCM算法对软件工程数据的挖掘有很好的聚类效果,且有较高的效率。 It provides that Kernelized fuzzy C-means uses on the research of source code mining for solving the significant number of quantities,multiple attributes and most of the attributes are discrete data and improving the efficiency of mining software engineering data,also seeking faster and more effective cluster approaches;meanwhile,to solve the problem that the KFCM algorithm can not cluster text data directly,the TF-IDF method is used to process the discrete text data.Then we integrate KFCM and genetic algorithm to overcome the defect of only being able to obtain the local minimum value by KFCM.Finally,the experimental results illustrate the improved KFCM algorithm can achieve good clustering performance and high efficiency for software engineering data mining.
作者 孟美芝 张阳
出处 《计算机工程与设计》 CSCD 北大核心 2010年第10期2249-2252,共4页 Computer Engineering and Design
关键词 源代码挖掘 特征空间 核函数 遗传算法 目标函数 source code mining feature space kernel function genetic algorithm object function
  • 相关文献

参考文献15

  • 1Han JW, Kambr M.Data mining concepts and techniques[M]. Beijing:Higher Education Press,2001:145-176.
  • 2苏绍勇,潘金贵.数据挖掘在软件维护中的应用[J].计算机科学,2005,32(10):245-248. 被引量:3
  • 3David Binkley.Source code analysis:A road map[C].Future of Software Engineering,2007:104-119.
  • 4钟智,尹云飞,张师超.软件系统层次的数据挖掘方法[J].计算机科学,2005,32(2):202-205. 被引量:2
  • 5Antonellis P, Antoniou D,Kanellopoulos Y, et al.A data mining methodology for evaluating maintainability according to ISO/ IEC-9126 software engineering-product quality standard [C]. Special Session on System Quality and Maintainability,Organized in Conjunction with the 11th European Conference on Software Maintenance and Reengineering,2007.
  • 6Yiannis Kanellopoulos, Christos Tjortjis. Data mining source code to facilitate program comprehension:Experiments on clustering data retrieved from C++ programs[C].Proc IEEE 12th Int'l Workshop Program Comprehension, IEEE Comp Soc Press, 2004:214-223.
  • 7Dimitris Rousidis, Christos Tjortjis. Clustering data retrieved from Java source code to support software maintenance:A case study[C].Proc IEEE 9th European Conf Software Maintenance and Reengineering,2005:276-279.
  • 8Yiannis Kanellopoulos, Thimios Dimopulos. Mining source code elements for comprehending object-oriented systems and evaluating their maintainability[J].SIGKDD Explorations,2006,8(1):33-40.
  • 9Girolami M.Mercer kernel based clustering in feature space[J]. IEEE Transactions on Neural Network,2002,13 (3):780-784.
  • 10张莉,周伟达,焦李成.核聚类算法[J].计算机学报,2002,25(6):587-590. 被引量:197

二级参考文献47

共引文献236

同被引文献16

  • 1王超,姜威.基于K近邻加权的混合C均值聚类算法[J].计算机工程与应用,2006,42(30):84-87. 被引量:2
  • 2Shao Bin, Xin Hongwei. A real-time computer vision assessment and control of thermal comfort for group-housed pigs [ J ]. Computer and E- lectronics in Agriculture, 2008,62( 1 ) :15 -21.
  • 3Wang ZQ. Geo-statistics and Its Application in Ecology[ M ]. Beijing: Science Press, 1999.
  • 4Wu Y, et al. Brain MRI segmentation using KFCM and Chan-Vese model[ M ]. Transactions of Tianjin University, Springer, 2011,17 : 215 -219.
  • 5曲福恒,崔广才,李岩芳,等.模糊聚类算法及其应用[M].北京:国防工业出版社.2011:68-71.
  • 6Hidetomo Ichihashi, Katsuhiro Honda. FCM Clustering from the View Point of Iteratively Reweighted Least Squares[ C]. IEEE International Conference on Fuzzy Systems, 2005:873 -878.
  • 7Tara Saikumar, Anoop BK, Murthy PS. Robust Adaptive Threshold Algorithm based on Kernel Fuzzy Clustering on Image segmentation [J]. Computer Science & Information Technology (CS & IT) ,2012: 99 - 103.
  • 8Ortega R A, Santibanez 0 A. Determination of management zones in corn based on soil fertility [ J ]. Computers and Electronics in Agricul- ture,2007 (58) :48-59.
  • 9毛澄映,卢炎生,胡小华.数据挖掘技术在软件工程中的应用综述[J].计算机科学,2009,36(5):1-6. 被引量:21
  • 10陶新民,徐晶,付强,刘兴丽.基于样本密度KFCM新算法及其在故障诊断的应用[J].振动与冲击,2009,28(8):61-64. 被引量:14

引证文献3

二级引证文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部