Software Defect Prediction Using an Intelligent Ensemble-Based Model

被引:10
|
作者
Ali, Misbah [1 ]
Mazhar, Tehseen [1 ]
Arif, Yasir [2 ]
Al-Otaibi, Shaha [3 ]
Ghadi, Yazeed Yasin [4 ]
Shahzad, Tariq [5 ]
Khan, Muhammad Amir [6 ]
Hamam, Habib [5 ,7 ,8 ,9 ]
机构
[1] Virtual Univ Pakistan, Dept Comp Sci, Lahore 55150, Pakistan
[2] Global Inst, Dept Comp Sci, Lahore 54000, Pakistan
[3] Princess Nourah bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Syst, POB 84428, Riyadh 11671, Saudi Arabia
[4] Al Ain Univ, Dept Comp Sci & Software Engn, Abu Dhabi, U Arab Emirates
[5] Univ Johannesburg, Sch Elect Engn, Dept Elect & Elect Engn Sci, ZA-2006 Johannesburg, South Africa
[6] Univ Teknol MARA, Coll Comp Informat & Math, Sch Comp Sci, Shah Alam 40450, Selangor, Malaysia
[7] Univ Moncton, Fac Engn, Moncton, NB E1A3 E9, Canada
[8] Int Inst Technol & Management IITG, Commune Dakanda 1989, Libreville, Gabon
[9] Bridges Acad Excellence, Tunis 1002, Centre Ville, Tunisia
关键词
Machine learning; Bayes methods; software defect prediction; ensemble classification; heterogeneous classifiers; random forest; support vector machine; naive Bayes; QUALITY;
D O I
10.1109/ACCESS.2024.3358201
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software defect prediction plays a crucial role in enhancing software quality while achieving cost savings in testing. Its primary objective is to identify and send only defective modules to the testing stage. This research introduces an intelligent ensemble-based software defect prediction model that combines diverse classifiers. The proposed model employs a two-stage prediction process to detect defective modules. In the first stage, four supervised machine learning algorithms are employed: Random Forest, Support Vector Machine, Naive Bayes, and Artificial Neural Network. These algorithms are optimized through iterative parameter optimization to achieve the highest accuracy possible. In the second stage, the predictive accuracy of the individual classifiers is integrated into a voting ensemble to make the final predictions. This ensemble approach further improves the accuracy and reliability of the defect predictions. Seven historical defect datasets from the NASA MDP repository, namely CM1, JM1, MC2, MW1, PC1, PC3, and PC4, were utilized to implement and evaluate the proposed defect prediction system. The results demonstrate that each dataset's proposed intelligent system achieved remarkable accuracy, outperforming twenty state-of-the-art defect prediction techniques, including base classifiers and ensemble methods.
引用
收藏
页码:20376 / 20395
页数:20
相关论文
共 50 条
  • [1] Software Defect Prediction Using an Intelligent Ensemble-Based Model (vol 12, pg 20376, 2024)
    Ali, Misbah
    Mazhar, Tehseen
    Arif, Yasir
    Al-Otaibi, Shaha
    Ghadi, Yazeed Yasin
    Shahzad, Tariq
    Khan, Muhammad Amir
    Hamam, Habib
    IEEE ACCESS, 2024, 12 : 164843 - 164843
  • [2] The Use of Ensemble-Based Data Preprocessing Techniques for Software Defect Prediction
    Gao, Kehan
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2014, 24 (09) : 1229 - 1253
  • [3] Cost-sensitive and ensemble-based prediction model for outsourced software project risk prediction
    Hu, Yong
    Feng, Bin
    Mo, Xizhu
    Zhang, Xiangzhou
    Ngai, E. W. T.
    Fan, Ming
    Liu, Mei
    DECISION SUPPORT SYSTEMS, 2015, 72 : 11 - 23
  • [4] Data and Ensemble Machine Learning Fusion Based Intelligent Software Defect Prediction System
    Abbas, Sagheer
    Aftab, Shabib
    Khan, Muhammad Adnan
    Ghazal, Taher M.
    Al Hamadi, Hussam
    Yeun, Chan Yeob
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03): : 6083 - 6100
  • [5] Ensemble learning based software defect prediction
    Dong, Xin
    Liang, Yan
    Miyamoto, Shoichiro
    Yamaguchi, Shingo
    JOURNAL OF ENGINEERING RESEARCH, 2023, 11 (04): : 377 - 391
  • [6] Software Defect Prediction Based Ensemble Approach
    Harikiran J.
    Chandana B.S.
    Srinivasarao B.
    Raviteja B.
    Reddy T.S.
    Computer Systems Science and Engineering, 2023, 45 (03): : 2313 - 2331
  • [7] Prediction and Analysis of PM 2.5 Concentration in Seoul Using Ensemble-based Model
    Ryu, Minji
    Son, Sanghun
    Kim, Jinsoo
    KOREAN JOURNAL OF REMOTE SENSING, 2022, 38 (06) : 1191 - 1205
  • [8] An ensemble-based model for predicting agile software development effort
    Malgonde, Onkar
    Chari, Kaushal
    EMPIRICAL SOFTWARE ENGINEERING, 2019, 24 (02) : 1017 - 1055
  • [9] An ensemble-based model for predicting agile software development effort
    Onkar Malgonde
    Kaushal Chari
    Empirical Software Engineering, 2019, 24 : 1017 - 1055
  • [10] Software Defect Prediction Using Heterogeneous Ensemble Classification Based on Segmented Patterns
    Alsawalqah, Hamad
    Hijazi, Neveen
    Eshtay, Mohammed
    Faris, Hossam
    Al Radaideh, Ahmed
    Aljarah, Ibrahim
    Alshamaileh, Yazan
    APPLIED SCIENCES-BASEL, 2020, 10 (05):