期刊文献+

软件工程数据挖掘研究进展 被引量:25

Software Engineering Data Mining: A Survey
在线阅读 下载PDF
导出
摘要 随着计算机软件的规模不断扩大,手工获取、开发和维护软件所需的信息越来越困难。数据挖掘技术可从软件工程数据中自动发现所需信息,加快软件开发进程。对软件工程数据挖掘的研究进展进行了综述。概述了软件工程数据挖掘的基本概念与技术挑战;详细评述了在软件工程各个阶段,数据挖掘技术所能发现的信息/知识,以及获取这些信息/知识的意义、难点、步骤和方法,重点介绍了数据预处理和数据表示方法;对软件工程数据挖掘研究的发展趋势进行了展望。 With the rapid enlargement of software scale, to retrieve manually the relevant information of software development and maintenance is becoming more and more difficult. Data mining technology can help to discover useful information from software engineering data automatically, which thus speeds up the process of software development. This paper surveys the state of the art techniques of software engineering data mining. First, it presents basic concepts and technical challenges of software engineering data mining. Then, it discusses the details of data mining at different phases of software engineering, including motivation, problems, procedures and approaches, specifically, it emphasizes the methods of data pre-processing development of software engineering data mining technology. and representation. Finally, it gives a vision of future
出处 《计算机科学与探索》 CSCD 2012年第1期1-31,共31页 Journal of Frontiers of Computer Science and Technology
基金 国家自然科学基金(60873040 60873070)~~
关键词 软件工程 数据挖掘 数据表示 数据预处理 机器学习 software engineering data mining data representation data pre-processing machine learning
  • 相关文献

参考文献120

  • 1Tan Pangning. Introduction to data mining[M]. Upper Saddle River, NJ, USA: Pearson Education, 2006.
  • 2Xie Tao, Thummalapenta S, Lo D, et al. Data mining for software engineering[J]. Computer, 2009, 42: 55-62.
  • 3Wheeler D. Linux kernel 2.6: It's worth more! 2004.
  • 4Royce W W. Managing the development of large software systems: concepts and techniques[C]//Proceedings of the 9th International Conference on Software Engineering (ICSE '87), Monterey, CA, USA, 1987. Los Alamitos, CA,USA: IEEE Computer Society, 1987: 328-338.
  • 5Ko A J, DeLine R, Venolia G. Information needs in collocated software development teams[C]//Proceedings of the 29th International Conference on Software Engineering (ICSE '07), Minneapolis, MN, USA, 2007. Washington, DC, USA: IEEE Computer Society, 2007: 344-353.
  • 6Han Jiawei, Kamber M, Pei Jian. Data mining: concepts and techniques[M]. [S.l.]: Morgan Kaufmann, 2005.
  • 7Deerwester S, Dumais S T, Furnas G W, et al. Indexing by latent semantic analysis[J]. Journal of the American Society for Information Science, 1990, 41(6): 391--407.
  • 8Hofmann T. Probabilistic latent semantic indexing[C]// Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '99), Berkeley, CA, USA, 1999. New York, NY, USA: ACM, 1999: 50-57.
  • 9Blei D M, Ng A Y, Jordan M I. Latent Dirichlet allocation[J]. The Journal of Machine Learning Research, 2003, 3: 993-1022.
  • 10Rish I. An empirical study of the naive Bayes classifier[C]// Proceedings of International Joint Conference on Artificial Intelligence (IJCAI), Seattle, USA, 2001: 41-46.

同被引文献298

  • 1李新,张晓静,米燕涛.软件开发过程中的数据挖掘[J].石家庄职业技术学院学报,2007,19(2):31-33. 被引量:8
  • 2赵志升,罗德林,李海英.数据挖掘技术与应用[J].河北北方学院学报(自然科学版),2006,22(6):63-66. 被引量:10
  • 3张尧学.透明计算:概念、结构和示例[J].电子学报,2004,32(F12):169-174. 被引量:48
  • 4张增敏,谢嘉,李长河,隋连升.数据挖掘技术在变电站设备及缺陷管理系统中的应用[J].山东农业大学学报(自然科学版),2006,37(4):642-646. 被引量:4
  • 5王国胤,张清华,胡军.粒计算研究综述[J].智能系统学报,2007,2(6):8-26. 被引量:116
  • 6Srivastava J,Cooley R,Deshpande M,et al.Web usage mining:Discovery and applications of usage patterns from web data[J].ACM SIGKDD Explorations Newsletter,2000,1 (2):12-23.
  • 7王青,伍书剑,李明树.软件缺陷预测技术.软件学报,2008,19(7):1565—1580.http://www.jos.org.cn/1000—9825/19/1565.htm.
  • 8Hall T, Beecham S, Bowes D, Gray D, Counsell S. A systematic literature review on fault prediction performance in software engineering. IEEE Trans. on Software Engineering, 2012,38(6): 1276-1304. [doi: 10.1109/TSE.2011.103 ].
  • 9Radjenovic D, Hericko M, Torkar R, Zivkovic A. Software fault prediction metrics: A systematic literature review. Information and Software Technology, 2013,55(8): 1397-1418. [doi: 10.1016/j.infsof.2013.02.009].
  • 10Akiyama E. An example of software system debugging. In: Proc. of the Int'1 Federation of Information Proc. Societies Congress. New York: Springer Science and Business Media, 1971. 353-359.

引证文献25

二级引证文献280

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部