Software Defect Prediction Using an Intelligent Ensemble-Based Model

被引:10
|
作者
Ali, Misbah [1 ]
Mazhar, Tehseen [1 ]
Arif, Yasir [2 ]
Al-Otaibi, Shaha [3 ]
Ghadi, Yazeed Yasin [4 ]
Shahzad, Tariq [5 ]
Khan, Muhammad Amir [6 ]
Hamam, Habib [5 ,7 ,8 ,9 ]
机构
[1] Virtual Univ Pakistan, Dept Comp Sci, Lahore 55150, Pakistan
[2] Global Inst, Dept Comp Sci, Lahore 54000, Pakistan
[3] Princess Nourah bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Syst, POB 84428, Riyadh 11671, Saudi Arabia
[4] Al Ain Univ, Dept Comp Sci & Software Engn, Abu Dhabi, U Arab Emirates
[5] Univ Johannesburg, Sch Elect Engn, Dept Elect & Elect Engn Sci, ZA-2006 Johannesburg, South Africa
[6] Univ Teknol MARA, Coll Comp Informat & Math, Sch Comp Sci, Shah Alam 40450, Selangor, Malaysia
[7] Univ Moncton, Fac Engn, Moncton, NB E1A3 E9, Canada
[8] Int Inst Technol & Management IITG, Commune Dakanda 1989, Libreville, Gabon
[9] Bridges Acad Excellence, Tunis 1002, Centre Ville, Tunisia
关键词
Machine learning; Bayes methods; software defect prediction; ensemble classification; heterogeneous classifiers; random forest; support vector machine; naive Bayes; QUALITY;
D O I
10.1109/ACCESS.2024.3358201
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software defect prediction plays a crucial role in enhancing software quality while achieving cost savings in testing. Its primary objective is to identify and send only defective modules to the testing stage. This research introduces an intelligent ensemble-based software defect prediction model that combines diverse classifiers. The proposed model employs a two-stage prediction process to detect defective modules. In the first stage, four supervised machine learning algorithms are employed: Random Forest, Support Vector Machine, Naive Bayes, and Artificial Neural Network. These algorithms are optimized through iterative parameter optimization to achieve the highest accuracy possible. In the second stage, the predictive accuracy of the individual classifiers is integrated into a voting ensemble to make the final predictions. This ensemble approach further improves the accuracy and reliability of the defect predictions. Seven historical defect datasets from the NASA MDP repository, namely CM1, JM1, MC2, MW1, PC1, PC3, and PC4, were utilized to implement and evaluate the proposed defect prediction system. The results demonstrate that each dataset's proposed intelligent system achieved remarkable accuracy, outperforming twenty state-of-the-art defect prediction techniques, including base classifiers and ensemble methods.
引用
收藏
页码:20376 / 20395
页数:20
相关论文
共 50 条
  • [21] Multi-model ensemble-based probabilistic prediction of tropical cyclogenesis using TIGGE model forecasts
    Neeru Jaiswal
    C. M. Kishtawal
    Swati Bhomia
    P. K. Pal
    Meteorology and Atmospheric Physics, 2016, 128 : 601 - 611
  • [22] An Ensemble Oversampling Model for Class Imbalance Problem in Software Defect Prediction
    Huda, Shamsul
    Liu, Kevin
    Abdelrazek, Mohamed
    Ibrahim, Amani
    Alyahya, Sultan
    Al-Dossari, Hmood
    Ahmad, Shafiq
    IEEE ACCESS, 2018, 6 : 24184 - 24195
  • [23] Ensemble-based software fault prediction with two staged data pre-processing
    Kulkarni, Shubham P.
    Patel, Sanjeev
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2023, 72 (03) : 212 - 222
  • [24] Model Ensemble-Based Prognostic Framework for Fatigue Crack Growth Prediction
    Hoang-Phuong Nguyen
    Zio, Enrico
    Liu, Jie
    2017 2ND INTERNATIONAL CONFERENCE ON SYSTEM RELIABILITY AND SAFETY (ICSRS), 2017, : 327 - 331
  • [25] Ensemble-based deep learning techniques for customer churn prediction model
    Subramanian, R. Siva
    Yamini, B.
    Sudha, Kothandapani
    Sivakumar, S.
    KYBERNETES, 2024,
  • [26] Handling Imbalanced Data using Ensemble Learning in Software Defect Prediction
    Malhotra, Ruchika
    Jain, Juhi
    PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, 2020, : 300 - 304
  • [27] Software Defect Prediction Using Ensemble Learning: A Systematic Literature Review
    Matloob, Faseeha
    Ghazal, Taher M.
    Taleb, Nasser
    Aftab, Shabib
    Ahmad, Munir
    Khan, Muhammad Adnan
    Abbas, Sagheer
    Soomro, Tariq Rahim
    IEEE ACCESS, 2021, 9 : 98754 - 98771
  • [28] Prediction of Stock Market Using an Ensemble Learning-based Intelligent Model
    Faghihi-Nezhad, Mohammad-Taghi
    Minaei-Bidgoli, Behrouz
    INDUSTRIAL ENGINEERING AND MANAGEMENT SYSTEMS, 2018, 17 (03): : 479 - 496
  • [29] eDiaPredict: An Ensemble-based Framework for Diabetes Prediction
    Singh, Ashima
    Dhillon, Arwinder
    Kumar, Neeraj
    Hossain, M. Shamim
    Muhammad, Ghulam
    Kumar, Manoj
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (02)
  • [30] Ensemble-based prediction of RNA secondary structures
    Nima Aghaeepour
    Holger H Hoos
    BMC Bioinformatics, 14