A Feature Selection-Based K-NN Model for Fast Software Defect Prediction

被引:4
|
作者
Awotunde, Joseph Bamidele [1 ]
Misra, Sanjay [2 ]
Adeniyi, Abidemi Emmanuel [2 ]
Abiodun, Moses Kazeem [1 ,3 ]
Kaushik, Manju [4 ]
Lawrence, Morolake Oladayo [5 ]
机构
[1] Univ Ilorin, Dept Comp Sci, Ilorin, Nigeria
[2] Ostfold Univ Coll, Dept Comp Sci & Commun, Halden, Norway
[3] Landmark Univ, Dept Comp Sci, Omu Aran, Nigeria
[4] Amity Univ, Amity Inst Informat Technol, Jaipur, Rajasthan, India
[5] Baze Univ, Dept Comp Sci, Abuja, Nigeria
关键词
Software defect prediction; Machine learning; Extreme gradient boost; Feature selection; Prediction; Software development life cycle;
D O I
10.1007/978-3-031-10542-5_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software Defect Prediction (SDP) is an advanced technological method of predicting software defects in the software development life cycle. Various research works have been previously being done on SDP but the performance of these methods varied from several datasets, hence, making them inconsistent for SDP in the unknown software project. But the hybrid technique using feature selection enabled with machine learning for SDP can be very efficient as it takes the advantage of various methods to come up with better prediction accuracy for a given dataset when compared with an individual classifier. The major issues with individual ML-based models for SDP are the long detection time, vulnerability of the software project, and high dimensionality of the feature parameters. Therefore, this study proposes a hybrid model using a feature selection enabled Extreme Gradient Boost (XGB) classifier to address these mentioned challenges. The cleaned NASA MDP datasets were used for the implementation of the proposed model, and various performance metrics like F-score, accuracy, and MCC were used to reveal the performance of the model. The results of the proposed model when compared with state-of-the-art methods without feature selection perform better in terms of the metrics used. The results reveal that the proposed model outperformed all other prediction techniques.
引用
收藏
页码:49 / 61
页数:13
相关论文
共 50 条
  • [1] Combining feature selection with feature weighting for k-NN classifier
    Bao, YG
    Du, XY
    Ishii, N
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2002, 2002, 2412 : 461 - 468
  • [2] A many objective based feature selection model for software defect prediction
    Mao, Qi
    Zhang, Jingbo
    Zhao, Tianhao
    Cai, Xingjuan
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (19):
  • [3] Exploring the Feature Selection of the EEG Signal Time and Frequency Domain Features for k-NN and Weighted k-NN
    Diah, Theresia K.
    Faqih, Akhmad
    Kusumoputro, Benyamin
    PROCEEDINGS OF 2019 IEEE R10 HUMANITARIAN TECHNOLOGY CONFERENCE (IEEE R10 HTC 2019), 2019, : 196 - 199
  • [4] DBSCAN Parameter Selection Based on K-NN
    Delgado, Leonardo
    Morales, Eduardo F.
    ADVANCES IN COMPUTATIONAL INTELLIGENCE (MICAI 2021), PT I, 2021, 13067 : 187 - 198
  • [5] Induction motor fault diagnosis based on the k-NN and optimal feature selection
    Nguyen, Ngoc-Tu
    Lee, Hong-Hee
    INTERNATIONAL JOURNAL OF ELECTRONICS, 2010, 97 (09) : 1071 - 1081
  • [6] Feature Selection by Using DE Algorithm and k-NN Classifier
    Senel, Fatih Ahmet
    Yuksel, Asim Sinan
    Yigit, Tuncay
    ARTIFICIAL INTELLIGENCE AND APPLIED MATHEMATICS IN ENGINEERING PROBLEMS, 2020, 43 : 886 - 893
  • [7] Efficient Selection Algorithm for Fast k-NN Search on GPUs
    Tang, Xiaoxin
    Huang, Zhiyi
    Eyers, David
    Mills, Steven
    Guo, Minyi
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 397 - 406
  • [8] Software Defect Prediction Scheme Based on Feature Selection
    Wang, Pei
    Jin, Cong
    Jin, Shu-Wei
    2012 INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING (ISISE), 2012, : 477 - 480
  • [9] Bagging-based ensemble model and algorithm of k-NN prediction
    He, Liang
    Song, Qin-Bao
    Hai, Zhen
    Shen, Jun-Yi
    Kongzhi yu Juece/Control and Decision, 2010, 25 (01): : 48 - 52
  • [10] Fast k-means based on k-NN Graph
    Deng, Cheng-Hao
    Zhao, Wan-Lei
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1220 - 1223