An Imbalanced Deep Learning Model for Bug Localization

被引:2
|
作者
Bui Thi Mai Anh [1 ]
Nguyen Viet Luyen [1 ]
机构
[1] Hanoi Univ Sci & Technol, Sch Informat & Commun Technol, Lab Intelligent Software Engn, Hanoi, Vietnam
关键词
bug localization; deep neural network; imbalanced data-set; bootstrapping;
D O I
10.1109/APSECW53869.2021.00017
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Debugging and locating faulty source files are tedious and time-consuming tasks. To improve the productivity and to help developers focus on crucial files, automated bug localization models have been proposed for years. These models recommend buggy source files by ranking them according to their relevance to a given bug report. There are two significant challenges in this research field: (i) narrowing the lexical gap between bug reports which are typically described using natural languages and source files written in programming languages; (ii) reducing the impact of imbalanced data distribution in model training as a far fewer of source files relate to a given bug report while the majority of them are not relevant. In this paper, we propose a deep neural network model to investigate essential information hidden within bug reports and source files through capturing not only lexical relations but also semantic details as well as domain knowledge features such as historical bug fixings, code change history. To address the skewed class distribution, we apply a focal loss function combining with a bootstrapping method to rectify samples of the minority class within iterative training batches to our proposed model. We assessed the performance of our approach over six large scale Java open-source projects. The empirical results have showed that the proposed method outperformed other state-of-the-art models by improving the Mean Average Precision (MAP) and Mean Reciprocal Rank (MRR) scores from 3% to 11% and from 2% to 14%, respectively.
引用
收藏
页码:32 / 40
页数:9
相关论文
共 50 条
  • [1] A deep multimodal model for bug localization
    Zhu, Ziye
    Li, Yun
    Wang, Yu
    Wang, Yaojing
    Tong, Hanghang
    DATA MINING AND KNOWLEDGE DISCOVERY, 2021, 35 (04) : 1369 - 1392
  • [2] A deep multimodal model for bug localization
    Ziye Zhu
    Yun Li
    Yu Wang
    Yaojing Wang
    Hanghang Tong
    Data Mining and Knowledge Discovery, 2021, 35 : 1369 - 1392
  • [3] Bug Localization with Combination of Deep Learning and Information Retrieval
    An Ngoc Lam
    Anh Tuan Nguyen
    Hoan Anh Nguyen
    Nguyen, Tien N.
    2017 IEEE/ACM 25TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC), 2017, : 218 - 229
  • [4] Deep Learning With Customized Abstract Syntax Tree for Bug Localization
    Liang, Hongliang
    Sun, Lu
    Wang, Meilin
    Yang, Yuxing
    IEEE ACCESS, 2019, 7 : 116309 - 116320
  • [5] Applying Deep Learning Algorithm to Automatic Bug Localization and Repair
    Yang, Geunseok
    Min, Kyeongsic
    Lee, Byungjeong
    PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 1634 - 1641
  • [6] Automatic bug localization using a combination of deep learning and model transformation through node classification
    Yousofvand, Leila
    Soleimani, Seyfollah
    Rafe, Vahid
    SOFTWARE QUALITY JOURNAL, 2023, 31 (04) : 1045 - 1063
  • [7] Automatic bug localization using a combination of deep learning and model transformation through node classification
    Leila Yousofvand
    Seyfollah Soleimani
    Vahid Rafe
    Software Quality Journal, 2023, 31 : 1045 - 1063
  • [8] Deep Transfer Bug Localization
    Huo, Xuan
    Thung, Ferdian
    Li, Ming
    Lo, David
    Shi, Shu-Ting
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2021, 47 (07) : 1368 - 1380
  • [9] Software bug localization based on optimized and ensembled deep learning models
    Ali, Waqas
    Bo, Lili
    Sun, Xiaobing
    Wu, Xiaoxue
    Ali, Aakash
    Wei, Ying
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2024, 36 (08)
  • [10] On Usefulness of the Deep-Learning-Based Bug Localization Models to Practitioners
    Polisetty, Sravya
    Miranskyy, Andriy
    Basar, Ayse
    15TH INTERNATIONAL CONFERENCE ON PREDICTIVE MODELS AND DATA ANALYTICS IN SOFTWARE ENGINEERING (PROMISE'19), 2019, : 16 - 25