Using Class Imbalance Learning for Software Defect Prediction

被引:389
|
作者
Wang, Shuo [1 ]
Yao, Xin [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, CERCIA, Birmingham B15 2TT, W Midlands, England
基金
英国工程与自然科学研究理事会;
关键词
Class imbalance learning; ensemble learning; negative correlation learning; software defect prediction; STATIC CODE ATTRIBUTES; NEURAL-NETWORKS; MACHINE;
D O I
10.1109/TR.2013.2259203
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To facilitate software testing, and save testing costs, a wide range of machine learning methods have been studied to predict defects in software modules. Unfortunately, the imbalanced nature of this type of data increases the learning difficulty of such a task. Class imbalance learning specializes in tackling classification problems with imbalanced distributions, which could be helpful for defect prediction, but has not been investigated in depth so far. In this paper, we study the issue of if and how class imbalance learning methods can benefit software defect prediction with the aim of finding better solutions. We investigate different types of class imbalance learning methods, including resampling techniques, threshold moving, and ensemble algorithms. Among those methods we studied, AdaBoost.NC shows the best overall performance in terms of the measures including balance, G-mean, and Area Under the Curve (AUC). To further improve the performance of the algorithm, and facilitate its use in software defect prediction, we propose a dynamic version of AdaBoost. NC, which adjusts its parameter automatically during training. Without the need to pre-define any parameters, it is shown to be more effective and efficient than the original AdaBoost. NC.
引用
收藏
页码:434 / 443
页数:10
相关论文
共 50 条
  • [31] Handling Class-Imbalance with KNN (Neighbourhood) Under-Sampling for Software Defect Prediction
    Goyal, Somya
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (03) : 2023 - 2064
  • [32] MAHAKIL: Diversity Based Oversampling Approach to Alleviate the Class Imbalance Issue in Software Defect Prediction
    Benni, Kwabena Ebo
    Keung, Jacky
    Phannachitta, Passakorn
    Monden, Akito
    Mensah, Solomon
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2018, 44 (06) : 534 - 550
  • [33] A Survey on Software Defect Prediction Using Deep Learning
    Akimova, Elena N.
    Bersenev, Alexander Yu
    Deikov, Artem A.
    Kobylkin, Konstantin S.
    Konygin, Anton, V
    Mezentsev, Ilya P.
    Misilov, Vladimir E.
    MATHEMATICS, 2021, 9 (11)
  • [34] An Empirical Study on Data Sampling Methods in Addressing Class Imbalance Problem in Software Defect Prediction
    Odejide, Babajide J.
    Bajeh, Amos O.
    Balogun, Abdullateef O.
    Alanamu, Zubair O.
    Adewole, Kayode S.
    Akintola, Abimbola G.
    Salihu, Shakirat A.
    Usman-Hamza, Fatima E.
    Mojeed, Hammed A.
    SOFTWARE ENGINEERING PERSPECTIVES IN SYSTEMS, VOL. 1, 2022, 501 : 594 - 610
  • [35] Software defect prediction using learning to rank approach
    Ali Bou Nassif
    Manar Abu Talib
    Mohammad Azzeh
    Shaikha Alzaabi
    Rawan Khanfar
    Ruba Kharsa
    Lefteris Angelis
    Scientific Reports, 13
  • [36] Software Defect Density Prediction Using Deep Learning
    Alghanim, Firas
    Azzeh, Mohammad
    El-Hassan, Ammar
    Qattous, Hazem
    IEEE ACCESS, 2022, 10 : 114629 - 114641
  • [37] Handling Class-Imbalance with KNN (Neighbourhood) Under-Sampling for Software Defect Prediction
    Somya Goyal
    Artificial Intelligence Review, 2022, 55 : 2023 - 2064
  • [38] Handling class overlap and imbalance using overlap driven under-sampling with balanced random forest in software defect prediction
    Dar, Abdul Waheed
    Farooq, Sheikh Umar
    INNOVATIONS IN SYSTEMS AND SOFTWARE ENGINEERING, 2024,
  • [39] Class Imbalance in Software Fault Prediction Data Set
    Arun, C.
    Lakshmi, C.
    ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY COMPUTATIONS IN ENGINEERING SYSTEMS, 2020, 1056 : 745 - 757
  • [40] COSTE: Complexity-based OverSampling TEchnique to alleviate the class imbalance problem in software defect prediction
    Feng, Shuo
    Keung, Jacky
    Yu, Xiao
    Xiao, Yan
    Bennin, Kwabena Ebo
    Kabir, Md Alamgir
    Zhang, Miao
    INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 129