Studying the effectiveness of deep active learning in software defect prediction

被引:2
|
作者
Feyzi F. [1 ]
Daneshdoost A. [1 ]
机构
[1] Faculty of Engineering, University of Guilan, Rasht
关键词
active learning; Bug prediction; code metrics; deep learning;
D O I
10.1080/1206212X.2023.2252117
中图分类号
学科分类号
摘要
Accurate prediction of defective software modules is of great importance for prioritizing quality assurance efforts, reasonably allocating testing resources, reducing costs and improving software quality. Several studies have used machine learning to predict software defects. However, complex structures and imbalanced class distributions in software defect data make learning an effective defect prediction model challenging. In this article, two deep learning-based defect prediction models using static code metrics are proposed. In order to enhance the learning process and improve the performance of the proposed models, pool-based active learning is employed. In this regard, the possibility of using active learning to mitigate the need for a large amount of labeled data in the process of building deep learning models is investigated. To deal with imbalanced distribution of software modules between defective and non-defective classes, Near-Miss under-sampling and KNN, with different number of neighbors, are used. The reason for choosing them is their good performance in binary classification problems. Experiments are performed on two well-known, publicly available datasets, GitHub Bug Dataset and public Unified Bug Dataset for java projects. The evaluation results reveal the effectiveness of our proposed models in comparison to the traditional machine learning algorithms. In the conducted investigations on the Unified Bug Dataset, at the file level, the value of F-measure and AUC criteria have improved by 13 and 11 percent, respectively and at the class level, the values have improved by 14 and 11 percent, respectively. © 2023 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:534 / 552
页数:18
相关论文
共 50 条
  • [41] Dictionary Learning Based Software Defect Prediction
    Jing, Xiao-Yuan
    Ying, Shi
    Zhang, Zhi-Wu
    Wu, Shan-Shan
    Liu, Jin
    36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2014), 2014, : 414 - 423
  • [42] Multiview Transfer Learning for Software Defect Prediction
    Chen, Jinyin
    Yang, Yitao
    Hu, Keke
    Xuan, Qi
    Liu, Yi
    Yang, Chao
    IEEE ACCESS, 2019, 7 : 8901 - 8916
  • [43] Software Defect Prediction Based on Non-Linear Manifold Learning and Hybrid Deep Learning Techniques
    Zhu, Kun
    Zhang, Nana
    Zhang, Qing
    Ying, Shi
    Wang, Xu
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 65 (02): : 1467 - 1486
  • [44] Deep learning or classical machine learning? An empirical study on line-level software defect prediction
    Zhou, Yufei
    Liu, Xutong
    Guo, Zhaoqiang
    Zhou, Yuming
    Zhang, Corey
    Qian, Junyan
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2024, 36 (10)
  • [45] Software Defect Prediction via Deep Belief Network
    Wei Hua
    Shan Chun
    Hu Changzhen
    Zhang Yu
    Yu Xiao
    CHINESE JOURNAL OF ELECTRONICS, 2019, 28 (05) : 925 - 932
  • [46] Software Defect Prediction via Deep Belief Network
    WEI Hua
    SHAN Chun
    HU Changzhen
    ZHANG Yu
    YU Xiao
    ChineseJournalofElectronics, 2019, 28 (05) : 925 - 932
  • [47] Application of Deep Learning in Software Defect Prediction: Systematic Literature Review and Meta-analysis
    Zain, Zuhaira Muhammad
    Sakri, Sapiah
    Ismail, Nurul Halimatul Asmak
    INFORMATION AND SOFTWARE TECHNOLOGY, 2023, 158
  • [48] Semantic and traditional feature fusion for software defect prediction using hybrid deep learning model
    Abdu, Ahmed
    Zhai, Zhengjun
    Abdo, Hakim A.
    Algabri, Redhwan
    Al-masni, Mohammed A.
    Muhammad, Mannan Saeed
    Gu, Yeong Hyeon
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [49] Deep Active Learning for Surface Defect Detection
    Lv, Xiaoming
    Duan, Fajie
    Jiang, Jia-Jia
    Fu, Xiao
    Gan, Lin
    SENSORS, 2020, 20 (06)
  • [50] Assessment of the Software Defect Prediction Cost Effectiveness in an Industrial Project
    Hryszko, Jaroslaw
    Madeyski, Lech
    SOFTWARE ENGINEERING: CHALLENGES AND SOLUTIONS, 2017, 504