A transfer cost-sensitive boosting approach for cross-project defect prediction

被引:105
|
作者
Ryu, Duksan [1 ]
Jang, Jong-In [1 ]
Baik, Jongmoon [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Comp, 291 Daehak Ro,373-1 Guseong Dong, Daejeon 305701, South Korea
基金
新加坡国家研究基金会;
关键词
Boosting; Class imbalance; Cost-sensitive learning; Cross-project defect prediction; Software defect prediction; Transfer learning;
D O I
10.1007/s11219-015-9287-1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software defect prediction has been regarded as one of the crucial tasks to improve software quality by effectively allocating valuable resources to fault-prone modules. It is necessary to have a sufficient set of historical data for building a predictor. Without a set of sufficient historical data within a company, cross-project defect prediction (CPDP) can be employed where data from other companies are used to build predictors. In such cases, a transfer learning technique, which extracts common knowledge from source projects and transfers it to a target project, can be used to enhance the prediction performance. There exists the class imbalance problem, which causes difficulties for the learner to predict defects. The main impacts of imbalanced data under cross-project settings have not been investigated in depth. We propose a transfer cost-sensitive boosting method that considers both knowledge transfer and class imbalance for CPDP when given a small amount of labeled target data. The proposed approach performs boosting that assigns weights to the training instances with consideration of both distributional characteristics and the class imbalance. Through comparative experiments with the transfer learning and the class imbalance learning techniques, we show that the proposed model provides significantly higher defect detection accuracy while retaining better overall performance. As a result, a combination of transfer learning and class imbalance learning is highly effective for improving the prediction performance under cross-project settings. The proposed approach will help to design an effective prediction model for CPDP. The improved defect prediction performance could help to direct software quality assurance activities and reduce costs. Consequently, the quality of software can be managed effectively.
引用
收藏
页码:235 / 272
页数:38
相关论文
共 50 条
  • [41] Cost-sensitive transfer kernel canonical correlation analysis for heterogeneous defect prediction
    Li, Zhiqiang
    Jing, Xiao-Yuan
    Wu, Fei
    Zhu, Xiaoke
    Xu, Baowen
    Ying, Shi
    AUTOMATED SOFTWARE ENGINEERING, 2018, 25 (02) : 201 - 245
  • [42] Cross-project smell-based defect prediction
    Bruno Sotto-Mayor
    Meir Kalech
    Soft Computing, 2021, 25 : 14171 - 14181
  • [43] Cross-project smell-based defect prediction
    Sotto-Mayor, Bruno
    Kalech, Meir
    SOFT COMPUTING, 2021, 25 (22) : 14171 - 14181
  • [44] Multi-Objective Cross-Project Defect Prediction
    Canfora, Gerardo
    De Lucia, Andrea
    Di Penta, Massimiliano
    Oliveto, Rocco
    Panichella, Annibale
    Panichella, Sebastiano
    2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION (ICST 2013), 2013, : 252 - 261
  • [45] Boosting cost-sensitive trees
    Ting, KM
    Zheng, ZJ
    DISCOVERY SCIENCE, 1998, 1532 : 244 - 255
  • [46] Improve cross-project just-in-time defect prediction with dynamic transfer learning
    Dai, Hongming
    Xi, Jianqing
    Dai, Hong-Liang
    JOURNAL OF SYSTEMS AND SOFTWARE, 2025, 219
  • [47] Discriminating features-based cost-sensitive approach for software defect prediction
    Ali, Aftab
    Khan, Naveed
    Abu-Tair, Mamun
    Noppen, Joost
    McClean, Sally
    McChesney, Ian
    AUTOMATED SOFTWARE ENGINEERING, 2021, 28 (02)
  • [48] An Empirical Study on Combining Source Selection and Transfer Learning for Cross-Project Defect Prediction
    Wen, Wanzhi
    Zhang, Bin
    Gu, Xiang
    Ju, Xiaolin
    2019 IEEE 1ST INTERNATIONAL WORKSHOP ON INTELLIGENT BUG FIXING (IBF '19), 2019, : 29 - 38
  • [49] Cost-sensitive Dictionary Learning for Software Defect Prediction
    Liang Niu
    Jianwu Wan
    Hongyuan Wang
    Kaiwei Zhou
    Neural Processing Letters, 2020, 52 : 2415 - 2449
  • [50] Cost-sensitive Dictionary Learning for Software Defect Prediction
    Niu, Liang
    Wan, Jianwu
    Wang, Hongyuan
    Zhou, Kaiwei
    NEURAL PROCESSING LETTERS, 2020, 52 (03) : 2415 - 2449