A transfer cost-sensitive boosting approach for cross-project defect prediction

被引:105
|
作者
Ryu, Duksan [1 ]
Jang, Jong-In [1 ]
Baik, Jongmoon [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Comp, 291 Daehak Ro,373-1 Guseong Dong, Daejeon 305701, South Korea
基金
新加坡国家研究基金会;
关键词
Boosting; Class imbalance; Cost-sensitive learning; Cross-project defect prediction; Software defect prediction; Transfer learning;
D O I
10.1007/s11219-015-9287-1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software defect prediction has been regarded as one of the crucial tasks to improve software quality by effectively allocating valuable resources to fault-prone modules. It is necessary to have a sufficient set of historical data for building a predictor. Without a set of sufficient historical data within a company, cross-project defect prediction (CPDP) can be employed where data from other companies are used to build predictors. In such cases, a transfer learning technique, which extracts common knowledge from source projects and transfers it to a target project, can be used to enhance the prediction performance. There exists the class imbalance problem, which causes difficulties for the learner to predict defects. The main impacts of imbalanced data under cross-project settings have not been investigated in depth. We propose a transfer cost-sensitive boosting method that considers both knowledge transfer and class imbalance for CPDP when given a small amount of labeled target data. The proposed approach performs boosting that assigns weights to the training instances with consideration of both distributional characteristics and the class imbalance. Through comparative experiments with the transfer learning and the class imbalance learning techniques, we show that the proposed model provides significantly higher defect detection accuracy while retaining better overall performance. As a result, a combination of transfer learning and class imbalance learning is highly effective for improving the prediction performance under cross-project settings. The proposed approach will help to design an effective prediction model for CPDP. The improved defect prediction performance could help to direct software quality assurance activities and reduce costs. Consequently, the quality of software can be managed effectively.
引用
收藏
页码:235 / 272
页数:38
相关论文
共 50 条
  • [21] eCPDP : Early Cross-Project Defect Prediction
    Kwon, Sunjae
    Ryu, Duksan
    Baik, Jongmoon
    2021 IEEE 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2021), 2021, : 470 - 481
  • [22] Cross-Project Defect Prediction: A Literature Review
    Pal, Sourabh
    Sillitti, Alberto
    IEEE ACCESS, 2022, 10 : 118697 - 118717
  • [23] Heterogeneous Cross-Project Defect Prediction Using Encoder Networks and Transfer Learning
    Haque, Radowanul
    Ali, Aftab
    McClean, Sally
    Cleland, Ian
    Noppen, Joost
    IEEE ACCESS, 2024, 12 : 409 - 419
  • [24] Cross-Project Transfer Learning on Lightweight Code Semantic Graphs for Defect Prediction
    Fang, Dingbang
    Liu, Shaoying
    Li, Yang
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2023, 33 (07) : 1095 - 1117
  • [25] Misclassification Cost-Sensitive Software Defect Prediction
    Xu, Ling
    Wang, Bei
    Liu, Ling
    Zhou, Mo
    Liao, Shengping
    Yan, Meng
    2018 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2018, : 256 - 263
  • [26] Cross-Project and Within-Project Semisupervised Software Defect Prediction: A Unified Approach
    Wu, Fei
    Jing, Xiao-Yuan
    Sun, Ying
    Sun, Jing
    Huang, Lin
    Cui, Fangyi
    Sun, Yanfei
    IEEE TRANSACTIONS ON RELIABILITY, 2018, 67 (02) : 581 - 597
  • [27] A Novel Cross-Project Software Defect Prediction Algorithm Based on Transfer Learning
    Shiqi Tang
    Song Huang
    Changyou Zheng
    Erhu Liu
    Cheng Zong
    Yixian Ding
    Tsinghua Science and Technology, 2022, 27 (01) : 41 - 57
  • [28] CSSG: A cost-sensitive stacked generalization approach for software defect prediction
    Eivazpour, Zeinab
    Keyvanpour, Mohammad Reza
    SOFTWARE TESTING VERIFICATION & RELIABILITY, 2021, 31 (05):
  • [29] An Evaluation of Cross-Project Defect Prediction Approaches on Cross-Personalized Defect Prediction
    Amasaki, Sousuke
    Aman, Hirohisa
    Yokogawa, Tomoyuki
    PRODUCT-FOCUSED SOFTWARE PROCESS IMPROVEMENT, PROFES 2022, 2022, 13709 : 433 - 448
  • [30] A two-phase transfer learning model for cross-project defect prediction
    Liu, Chao
    Yang, Dan
    Xia, Xin
    Yan, Meng
    Zhang, Xiaohong
    INFORMATION AND SOFTWARE TECHNOLOGY, 2019, 107 : 125 - 136