Cross-project software defect prediction based on multi-source data sets

导出

摘要 Cross-project defect prediction(CPDP) uses one or more source projects to build a defect prediction model and applies the model to the target project. There is usually a big difference between the data distribution of the source project and the target project, which makes it difficult to construct an effective defect prediction model. In order to alleviate the problem of negative migration between the source project and the target project in CPDP, this paper proposes an integrated transfer adaptive boosting(TrAdaBoost) algorithm based on multi-source data sets(MSITrA). The algorithm uses an existing two-stage data filtering algorithm to obtain source project data related to the target project from multiple source items, and then uses the integrated TrAdaBoost algorithm proposed in the paper to build a CPDP model. The experimental results of Promise’s 15 public data sets show that: 1) The cross-project software defect prediction model proposed in this paper has better performance in all tested CPDP methods;2) In the within-project software defect prediction(WPDP) experiment, the proposed CPDP method has achieved the better experimental results than the tested WPDP method.

作者 Huang Junfu Wang Yawen Gong Yunzhan Jin Dahai

机构地区 School of Computer Science(National Pilot Software Engineering School)

出处《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2021年第4期75-87,共13页 中国邮电高校学报（英文版）

关键词 cross-project defect prediction multi-source transfer adaptive boosting ensemble learning

分类号 TP311.5 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献4

1刘望舒,陈翔,顾庆,刘树龙,陈道蓄.一种面向软件缺陷预测的可容忍噪声的特征选择框架[J].计算机学报,2018,41(3):506-520. 被引量：18
2刘望舒,陈翔,顾庆,刘树龙,陈道蓄.软件缺陷预测中基于聚类分析的特征选择方法[J].中国科学：信息科学,2016,46(9):1298-1320. 被引量：25
3陈翔,王莉萍,顾庆,王赞,倪超,刘望舒,王秋萍.跨项目软件缺陷预测方法研究综述[J].计算机学报,2018,41(1):254-274. 被引量：47
4何吉元,孟昭鹏,陈翔,王赞,樊向宇.一种半监督集成跨项目软件缺陷预测方法[J].软件学报,2017,28(6):1455-1473. 被引量：18

二级参考文献60

1Wang Q, Wu S J, Li M S. Software defect prediction. J Softw, 2008, 19:1565-1580.
2Hall T, Beecham S, Bowes D, et al. A systematic literature review on fault prediction performance in software engineering. IEEE Trans Softw Eng, 2012, 38:1276-1304.
3Yu S S, Zhou S G, Guan J H. Software engineering data mining: a survey. J Front Comput Sci Tech, 2012, 6:1-31.
4Chen X, Gu Q, Liu W S, et al. Survey of static software defect prediction. J Softw, 2016, 1:1-25.
5Ghotra B, McIntosh S, Hassan A E. Revisiting the impact of classification techniques on the performance of defect prediction models. In: Proceedings of the International Conference on Software Engineering, Firenze, 2015. 789 -800.
6Peters F, Menzies T, Layman L. LACE2: better privacy-preserving data sharing for cross project defect prediction. In: Proceedings of the International Conference on Software Engineering, Firenze, 2015. 801-811.
7Tantithamthavorn C, McIntosh S, Hassan A E, et al. The impact of mislabelling on the performance and interpretation of defect prediction models. In: Proceedings of the International Conference on Software Engineering, Firenze, 2015. 812-823.
8Jing X Y, Wu F, Dong X W, et M. Heterogeneous cross-company defect prediction by unified metric representation and CCA-based transfer learning. In: Proceedings of the International Symposium on Foundations of Software Engineering, Bergamo, 2015. 496-507.
9Nam J, Kim S. Heterogeneous defect prediction. In: Proceedings of the International Symposium on Foundations of Software Engineering, Bergamo, 2015. 508-519.
10Kim M, Nam J, Yeon J, et al. REMI: defect prediction for efficient API testing. In: Proceedings of the International Symposium on Foundations of Software Engineering, Bergamo, 2015. 990-993.

共引文献85

1陈曙,叶俊民,刘童.一种基于领域适配的跨项目软件缺陷预测方法[J].软件学报,2020,31(2):266-281. 被引量：15
2武玉英,孙平,何喜军,蒋国瑞.基于迁移学习的新产品销量预测模型[J].系统工程,2018,36(6):124-132. 被引量：3
3王莉萍,陈翔,王秋萍,赵英全.基于Box-Cox转换的集成跨项目软件缺陷预测方法[J].计算机应用研究,2017,34(7):2023-2026. 被引量：3
4陈翔,王莉萍,顾庆,王赞,倪超,刘望舒,王秋萍.跨项目软件缺陷预测方法研究综述[J].计算机学报,2018,41(1):254-274. 被引量：47
5马子逸,马传香,刘瑞奇,余啸.面向软件缺陷个数预测的混合式特征选择方法[J].计算机应用研究,2018,35(2):487-492. 被引量：2
6李怀强,周扬.关于软件架构生命周期准确性预测仿真[J].计算机仿真,2018,35(6):308-312.
7姜丽,姜淑娟,于巧.软件缺陷预测中基于排序集成的特征选择方法[J].小型微型计算机系统,2018,39(7):1410-1414. 被引量：4
8黄裕.基于多视角缺失补全算法的数据挖掘研究[J].计算技术与自动化,2018,37(2):67-72. 被引量：1
9李丽媛,江国华.一种面向软件缺陷预测的特征聚类选择方法[J].计算技术与自动化,2018,37(2):126-131. 被引量：3
10霍小卫,刘江坡.可重构嵌入式软件缺陷优化预测仿真研究[J].计算机仿真,2018,35(8):443-447.

1Kendra A McClure,Kyle M Gardner,Peter MA Toivonen,Cheryl R Hampson,Jun Song,Charles F Forney,John DeLong,Istvan Rajcan,Sean Myles.QTL analysis of soft scald in two apple populations[J].Horticulture Research,2016,3(1):117-123. 被引量：1
2张福正,李琨,李仕林,赵李强,董厚琦.基于集成学习PCA多元融合的输电线路图像生成研究[J].光电子．激光,2021,32(8):841-851. 被引量：3
3Digvijay Pandey,Nidhi Verma,Tajamul Islam,Wegayehu Enbeyle,Binay Kumar Pandey,PMadhusudana Patra.The Response of Consumer Food Price Index(CFPI)due to the Impact of Pandemic COVID-19 on Indian Agriculture Sector[J].NASS Journal of Agricultural Sciences,2021,3(1):29-35.
4Hong Xu,Guijun Ma,Qingqiao Tan,Qiang Zhou,Wen Su,Rongxiu Li.Staged-probability strategy of processing shotgun proteomic data to discover more functionally important proteins[J].Protein & Cell,2012,3(2):140-147.
5Yuanhong Li,Hongjun Wang,Weiliang Zhou,Zehao Xue.Monocular vision and calculation of regular three-dimensional target pose based on Otsu and Haar-feature AdaBoost classifier[J].International Journal of Agricultural and Biological Engineering,2020,13(5):171-180.
6Michael B.Rahaim,Jessica Morrison,Thomas D.C.Little.Beam control for indoor FSO and dynamic dual-use VLC lighting systems[J].Journal of Communications and Information Networks,2017,2(4):11-27.
7Aizat Zhaanbaeva,Keqiang Peng,Abiola Oyebamiji,Kyiazbek Asilbekov.Source characteristics and genesis of Sb mineralization from the Au and Sb deposits of the Youjiang Basin,SW China:constraints from stibnite trace element and isotope geochemistry[J].Acta Geochimica,2021,40(5):659-675. 被引量：1

The Journal of China Universities of Posts and Telecommunications

2021年第4期

浏览历史

内容加载中请稍等...

Cross-project software defect prediction based on multi-source data sets

参考文献4

二级参考文献60

共引文献85

相关作者

相关机构

相关主题

浏览历史