Studying the effectiveness of deep active learning in software defect prediction

被引:2
|
作者
Feyzi F. [1 ]
Daneshdoost A. [1 ]
机构
[1] Faculty of Engineering, University of Guilan, Rasht
关键词
active learning; Bug prediction; code metrics; deep learning;
D O I
10.1080/1206212X.2023.2252117
中图分类号
学科分类号
摘要
Accurate prediction of defective software modules is of great importance for prioritizing quality assurance efforts, reasonably allocating testing resources, reducing costs and improving software quality. Several studies have used machine learning to predict software defects. However, complex structures and imbalanced class distributions in software defect data make learning an effective defect prediction model challenging. In this article, two deep learning-based defect prediction models using static code metrics are proposed. In order to enhance the learning process and improve the performance of the proposed models, pool-based active learning is employed. In this regard, the possibility of using active learning to mitigate the need for a large amount of labeled data in the process of building deep learning models is investigated. To deal with imbalanced distribution of software modules between defective and non-defective classes, Near-Miss under-sampling and KNN, with different number of neighbors, are used. The reason for choosing them is their good performance in binary classification problems. Experiments are performed on two well-known, publicly available datasets, GitHub Bug Dataset and public Unified Bug Dataset for java projects. The evaluation results reveal the effectiveness of our proposed models in comparison to the traditional machine learning algorithms. In the conducted investigations on the Unified Bug Dataset, at the file level, the value of F-measure and AUC criteria have improved by 13 and 11 percent, respectively and at the class level, the values have improved by 14 and 11 percent, respectively. © 2023 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:534 / 552
页数:18
相关论文
共 50 条
  • [1] Active Learning for Software Defect Prediction
    Luo, Guangchun
    Ma, Ying
    Qin, Ke
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (06) : 1680 - 1683
  • [2] On the use of deep learning in software defect prediction
    Giray, Gorkem
    Bennin, Kwabena Ebo
    Koksal, Omer
    Babur, Onder
    Tekinerdogan, Bedir
    JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 195
  • [3] Deep learning based software defect prediction
    Qiao, Lei
    Li, Xuesong
    Umer, Qasim
    Guo, Ping
    NEUROCOMPUTING, 2020, 385 : 100 - 110
  • [4] Software Defect Prediction using Deep Learning
    Nevendra, Meetesh
    Singh, Pradeep
    ACTA POLYTECHNICA HUNGARICA, 2021, 18 (10) : 173 - 189
  • [5] Deep Learning for Software Defect Prediction in time
    Yadav, Monika
    Singh, Vijendra
    Rastogi, Priyanka
    2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 7 - 12
  • [6] Is deep learning good enough for software defect prediction?
    Pandey, Sushant Kumar
    Haldar, Arya
    Tripathi, Anil Kumar
    INNOVATIONS IN SYSTEMS AND SOFTWARE ENGINEERING, 2023,
  • [7] Performing Software Defect Prediction Using Deep Learning
    Gurung, Saksham
    Communications in Computer and Information Science, 2022, 1697 CCIS : 319 - 331
  • [8] A Survey on Software Defect Prediction Using Deep Learning
    Akimova, Elena N.
    Bersenev, Alexander Yu
    Deikov, Artem A.
    Kobylkin, Konstantin S.
    Konygin, Anton, V
    Mezentsev, Ilya P.
    Misilov, Vladimir E.
    MATHEMATICS, 2021, 9 (11)
  • [9] Deep Semantic Feature Learning for Software Defect Prediction
    Wang, Song
    Liu, Taiyue
    Nam, Jaechang
    Tan, Lin
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2020, 46 (12) : 1267 - 1293
  • [10] A Survey of Software Defect Prediction Based on Deep Learning
    Meetesh Nevendra
    Pradeep Singh
    Archives of Computational Methods in Engineering, 2022, 29 : 5723 - 5748