An Empirical Study for Enhanced Software Defect Prediction Using a Learning-Based Framework

被引:0
|
作者
Kamal Bashir
Tianrui Li
Chubato Wondaferaw Yohannese
机构
[1] Southwest Jiaotong University,School of Information Science and Technology
[2] Karary University,Department of Information Technology, College of Computer Science and Information Technology
来源
International Journal of Computational Intelligence Systems | 2018年 / 12卷
关键词
Software defect prediction; Feature selection; Data sampling; Noise filtering;
D O I
暂无
中图分类号
学科分类号
摘要
The object of software defect prediction (SDP) is to identify defect-prone modules. This is achieved through constructing prediction models using datasets obtained by mining software historical depositories. However, data mined from these depositories are often associated with high dimensionality, class imbalance, and mislabels which deteriorate classification performance and increase model complexity. In order to mitigate the consequences, this paper proposes an integrated preprocessing framework in which feature selection (FS), data balance (DB), and noise filtering (NF) techniques are fused to deal with the factors that deteriorate learning performance. We apply the proposed framework on three software metrics, namely static code metric (SCM), object oriented metric (OOM), and combined metric (CombM) and build models based on four scenarios (S): (S1) original data; (S2) FS subsets; (S3) FS subsets after DB using random under sampling (RUS) and synthetic minority oversampling technique (SMOTE); (S4) FS subsets after DB (RUS and SMOTE); and NF using iterative partitioning filter (IPF) and iterative noise filtering based on the fusing of classifiers (INFFC). Empirical results show that 1. the integrated preprocessing of FS, DB, and NF improves the performance of all the models built for SDP, 2. for all FS methods, all the models improve performance progressively from S2 through to S4 in all the software metrics, 3. model performance based on S4 is statistically significantly better than the performance based on S3 for all the software metrics, and 4. in order to achieve optimal model performance for SDP, appropriate implementation of the proposed framework is required. The results also validate the effectiveness of our proposal and provide guidelines for achieving quality training data that enhances model performance for SDP.
引用
收藏
页码:282 / 298
页数:16
相关论文
共 50 条
  • [41] Deep Learning-Based Defect Prediction for Mobile Applications
    Jorayeva, Manzura
    Akbulut, Akhan
    Catal, Cagatay
    Mishra, Alok
    SENSORS, 2022, 22 (13)
  • [42] Software defect prediction using learning to rank approach
    Nassif, Ali Bou
    Talib, Manar Abu
    Azzeh, Mohammad
    Alzaabi, Shaikha
    Khanfar, Rawan
    Kharsa, Ruba
    Angelis, Lefteris
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [43] Performing Software Defect Prediction Using Deep Learning
    Gurung, Saksham
    Communications in Computer and Information Science, 2022, 1697 CCIS : 319 - 331
  • [44] Using Class Imbalance Learning for Software Defect Prediction
    Wang, Shuo
    Yao, Xin
    IEEE TRANSACTIONS ON RELIABILITY, 2013, 62 (02) : 434 - 443
  • [45] A Survey on Software Defect Prediction Using Deep Learning
    Akimova, Elena N.
    Bersenev, Alexander Yu
    Deikov, Artem A.
    Kobylkin, Konstantin S.
    Konygin, Anton, V
    Mezentsev, Ilya P.
    Misilov, Vladimir E.
    MATHEMATICS, 2021, 9 (11)
  • [46] Software defect prediction using learning to rank approach
    Ali Bou Nassif
    Manar Abu Talib
    Mohammad Azzeh
    Shaikha Alzaabi
    Rawan Khanfar
    Ruba Kharsa
    Lefteris Angelis
    Scientific Reports, 13
  • [47] Software Defect Density Prediction Using Deep Learning
    Alghanim, Firas
    Azzeh, Mohammad
    El-Hassan, Ammar
    Qattous, Hazem
    IEEE ACCESS, 2022, 10 : 114629 - 114641
  • [48] Multi-graph learning-based software defect location
    Yin, Ying
    Shi, Yucen
    Zhao, Yuhai
    Wahab, Fazal
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2024, 36 (04)
  • [49] Using Coding-Based Ensemble Learning to Improve Software Defect Prediction
    Sun, Zhongbin
    Song, Qinbao
    Zhu, Xiaoyan
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1806 - 1817
  • [50] Software Defect Prediction Method Based on Stable Learning
    Fan, Xi
    Mao, Jingen
    Lian, Liangjue
    Yu, Li
    Zheng, We
    Ge, Yun
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (01): : 65 - 84