On the relationship between bug reports and queries for text retrieval-based bug localization

被引:0
|
作者
Chris Mills
Esteban Parra
Jevgenija Pantiuchina
Gabriele Bavota
Sonia Haiduc
机构
[1] Florida State University,
[2] Università della Svizzera italiana,undefined
来源
关键词
Bug localization; Query formulation; Text retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
As societal dependence on software continues to grow, bugs are becoming increasingly costly in terms of financial resources as well as human safety. Bug localization is the process by which a developer identifies buggy code that needs to be fixed to make a system safer and more reliable. Unfortunately, manually attempting to locate bugs solely from the information in a bug report requires advanced knowledge of how a system is constructed and the way its constituent pieces interact. Therefore, previous work has investigated numerous techniques for reducing the human effort spent in bug localization. One of the most common approaches is Text Retrieval (TR) in which a system’s source code is indexed into a search space that is then queried for code relevant to a given bug report. In the last decade, dozens of papers have proposed improvements to bug localization using TR with largely positive results. However, several other studies have called the technique into question. According to these studies, evaluations of TR-based approaches often lack sufficient controls on biases that artificially inflate the results, namely: misclassified bugs, tangled commits, and localization hints. Here we argue that contemporary evaluations of TR approaches also include a negative bias that outweighs the previously identified positive biases: while TR approaches expect a natural language query, most evaluations simply formulate this query as the full text of a bug report. In this study we show that highly performing queries can be extracted from the bug report text, in order to make TR effective even without the aforementioned positive biases. Further, we analyze the provenance of terms in these highly performing queries to drive future work in automatic query extraction from bug reports.
引用
收藏
页码:3086 / 3127
页数:41
相关论文
共 50 条
  • [31] On the Effectiveness of Information Retrieval Based Bug Localization for C Programs
    Saha, Ripon K.
    Lawall, Julia
    Khurshid, Sarfraz
    Perry, Dewayne E.
    2014 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2014, : 161 - 170
  • [32] SBugLocater: Bug Localization Based on Deep Matching and Information Retrieval
    Huang, Xuxiang
    Xiang, Chen
    Li, Hua
    He, Peng
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [33] Information Retrieval and Spectrum Based Bug Localization: Better Together
    Le, Tien-Duy B.
    Oentaryo, Richard J.
    Lo, David
    2015 10TH JOINT MEETING OF THE EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND THE ACM SIGSOFT SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE 2015) PROCEEDINGS, 2015, : 579 - 590
  • [34] Automatic Classification of Bug Reports Based on Multiple Text Information and Reports' Intention
    Meng, Fanqi
    Wang, Xuesong
    Wang, Jingdong
    Wang, Peifang
    THEORETICAL ASPECTS OF SOFTWARE ENGINEERING, TASE 2022, 2022, 13299 : 131 - 147
  • [35] Bug Localization Using Revision Log Analysis and Open Bug Repository Text Categorization
    Moin, Amir H.
    Khansari, Mohammad
    OPEN SOURCE SOFTWARE: NEW HORIZONS, 2010, 319 : 188 - 199
  • [36] Information Retrieval Based Bug Localization: Research Problem, Progress, and Challenges
    Guo Z.-Q.
    Zhou H.-C.
    Liu S.-R.
    Li Y.-H.
    Chen L.
    Zhou Y.-M.
    Xu B.-W.
    Li, Yan-Hui (yanhuili@nju.edu.cn); Zhou, Yu-Ming (zhouyuming@nju.edu.cn), 1600, Chinese Academy of Sciences (31): : 2826 - 2854
  • [37] Research Progress on Software Bug Localization Technology Based on Information Retrieval
    Zhang Y.
    Liu J.-K.
    Xia X.
    Wu M.-H.
    Yan H.
    Xia, Xin (Xin.Xia@monash.edu), 1600, Chinese Academy of Sciences (31): : 2432 - 2452
  • [38] Improving Bug Localization using Structured Information Retrieval
    Saha, Ripon K.
    Lease, Matthew
    Khurshid, Sarfraz
    Perry, Dewayne E.
    2013 28TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2013, : 345 - 355
  • [39] Bug Localization with Combination of Deep Learning and Information Retrieval
    An Ngoc Lam
    Anh Tuan Nguyen
    Hoan Anh Nguyen
    Nguyen, Tien N.
    2017 IEEE/ACM 25TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC), 2017, : 218 - 229
  • [40] The forgotten role of search queries in IR-based bug localization: an empirical study
    Mohammad Masudur Rahman
    Foutse Khomh
    Shamima Yeasmin
    Chanchal K. Roy
    Empirical Software Engineering, 2021, 26