Studying the effectiveness of deep active learning in software defect prediction

被引：2

作者：

Feyzi F. ^{[1
]}

Daneshdoost A. ^{[1
]}

机构：

[1] Faculty of Engineering, University of Guilan, Rasht

来源：

International Journal of Computers and Applications | 2023年 / 45卷 / 7-8期

关键词：

active learning; Bug prediction; code metrics; deep learning;

D O I：

10.1080/1206212X.2023.2252117

中图分类号：

学科分类号：

摘要：

Accurate prediction of defective software modules is of great importance for prioritizing quality assurance efforts, reasonably allocating testing resources, reducing costs and improving software quality. Several studies have used machine learning to predict software defects. However, complex structures and imbalanced class distributions in software defect data make learning an effective defect prediction model challenging. In this article, two deep learning-based defect prediction models using static code metrics are proposed. In order to enhance the learning process and improve the performance of the proposed models, pool-based active learning is employed. In this regard, the possibility of using active learning to mitigate the need for a large amount of labeled data in the process of building deep learning models is investigated. To deal with imbalanced distribution of software modules between defective and non-defective classes, Near-Miss under-sampling and KNN, with different number of neighbors, are used. The reason for choosing them is their good performance in binary classification problems. Experiments are performed on two well-known, publicly available datasets, GitHub Bug Dataset and public Uniﬁed Bug Dataset for java projects. The evaluation results reveal the effectiveness of our proposed models in comparison to the traditional machine learning algorithms. In the conducted investigations on the Unified Bug Dataset, at the file level, the value of F-measure and AUC criteria have improved by 13 and 11 percent, respectively and at the class level, the values have improved by 14 and 11 percent, respectively. © 2023 Informa UK Limited, trading as Taylor & Francis Group.

引用

页码：534 / 552

页数：18

共 50 条

[1] Active Learning for Software Defect Prediction
Luo, Guangchun
Ma, Ying
Qin, Ke
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (06) : 1680 - 1683
[2] On the use of deep learning in software defect prediction
Giray, Gorkem
Bennin, Kwabena Ebo
Koksal, Omer
Babur, Onder
Tekinerdogan, Bedir
JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 195
[3] Deep learning based software defect prediction
Qiao, Lei
Li, Xuesong
Umer, Qasim
Guo, Ping
NEUROCOMPUTING, 2020, 385 : 100 - 110
[4] Software Defect Prediction using Deep Learning
Nevendra, Meetesh
Singh, Pradeep
ACTA POLYTECHNICA HUNGARICA, 2021, 18 (10) : 173 - 189
[5] Deep Learning for Software Defect Prediction in time
Yadav, Monika
Singh, Vijendra
Rastogi, Priyanka
2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 7 - 12
[6] Is deep learning good enough for software defect prediction?
Pandey, Sushant Kumar
Haldar, Arya
Tripathi, Anil Kumar
INNOVATIONS IN SYSTEMS AND SOFTWARE ENGINEERING, 2023,
[7] Performing Software Defect Prediction Using Deep Learning
Gurung, Saksham
Communications in Computer and Information Science, 2022, 1697 CCIS : 319 - 331
[8] A Survey on Software Defect Prediction Using Deep Learning
Akimova, Elena N.
Bersenev, Alexander Yu
Deikov, Artem A.
Kobylkin, Konstantin S.
Konygin, Anton, V
Mezentsev, Ilya P.
Misilov, Vladimir E.
MATHEMATICS, 2021, 9 (11)
[9] Deep Semantic Feature Learning for Software Defect Prediction
Wang, Song
Liu, Taiyue
Nam, Jaechang
Tan, Lin
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2020, 46 (12) : 1267 - 1293
[10] A Survey of Software Defect Prediction Based on Deep Learning
Meetesh Nevendra
Pradeep Singh
Archives of Computational Methods in Engineering, 2022, 29 : 5723 - 5748

← 1 2 3 4 5 →