On the relationship between bug reports and queries for text retrieval-based bug localization

被引:0
|
作者
Chris Mills
Esteban Parra
Jevgenija Pantiuchina
Gabriele Bavota
Sonia Haiduc
机构
[1] Florida State University,
[2] Università della Svizzera italiana,undefined
来源
关键词
Bug localization; Query formulation; Text retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
As societal dependence on software continues to grow, bugs are becoming increasingly costly in terms of financial resources as well as human safety. Bug localization is the process by which a developer identifies buggy code that needs to be fixed to make a system safer and more reliable. Unfortunately, manually attempting to locate bugs solely from the information in a bug report requires advanced knowledge of how a system is constructed and the way its constituent pieces interact. Therefore, previous work has investigated numerous techniques for reducing the human effort spent in bug localization. One of the most common approaches is Text Retrieval (TR) in which a system’s source code is indexed into a search space that is then queried for code relevant to a given bug report. In the last decade, dozens of papers have proposed improvements to bug localization using TR with largely positive results. However, several other studies have called the technique into question. According to these studies, evaluations of TR-based approaches often lack sufficient controls on biases that artificially inflate the results, namely: misclassified bugs, tangled commits, and localization hints. Here we argue that contemporary evaluations of TR approaches also include a negative bias that outweighs the previously identified positive biases: while TR approaches expect a natural language query, most evaluations simply formulate this query as the full text of a bug report. In this study we show that highly performing queries can be extracted from the bug report text, in order to make TR effective even without the aforementioned positive biases. Further, we analyze the provenance of terms in these highly performing queries to drive future work in automatic query extraction from bug reports.
引用
收藏
页码:3086 / 3127
页数:41
相关论文
共 50 条
  • [1] On the relationship between bug reports and queries for text retrieval-based bug localization
    Mills, Chris
    Parra, Esteban
    Pantiuchina, Jevgenija
    Bavota, Gabriele
    Haiduc, Sonia
    EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (05) : 3086 - 3127
  • [2] Are Bug Reports Enough for Text Retrieval-based Bug Localization?
    Mills, Chris
    Pantiuchina, Jevgenija
    Parra, Esteban
    Bavota, Gabriele
    Haiduc, Sonia
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2018, : 381 - 392
  • [3] On the Value of Bug Reports for Retrieval-based Bug Localization
    Lawrie, Dawn
    Binkley, Dave
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2018, : 524 - 528
  • [4] Using Observed Behavior to Reformulate Queries during Text Retrieval-based Bug Localization
    Chaparro, Oscar
    Florez, Juan Manuel
    Marcus, Andrian
    2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2017, : 376 - 387
  • [5] Query Quality Prediction for Text Retrieval-based Bug Localization
    Liu, Wenjie
    Zou, Weiqin
    Chen, Bingting
    Cai, Biyu
    Zhang, Jingxuan
    2024 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2024, : 340 - 351
  • [6] On the Use of Stack Traces to Improve Text Retrieval-based Bug Localization
    Moreno, Laura
    Treadway, John Joseph
    Marcus, Andrian
    Shen, Wuwei
    2014 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2014, : 151 - 160
  • [7] Using bug descriptions to reformulate queries during text-retrieval-based bug localization
    Chaparro, Oscar
    Florez, Juan Manuel
    Marcus, Andrian
    EMPIRICAL SOFTWARE ENGINEERING, 2019, 24 (05) : 2947 - 3007
  • [8] Using bug descriptions to reformulate queries during text-retrieval-based bug localization
    Oscar Chaparro
    Juan Manuel Florez
    Andrian Marcus
    Empirical Software Engineering, 2019, 24 : 2947 - 3007
  • [9] Where Should the Bugs Be Fixed? More Accurate Information Retrieval-Based Bug Localization Based on Bug Reports
    Zhou, Jian
    Zhang, Hongyu
    Lo, David
    2012 34TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2012, : 14 - 24
  • [10] Are Information Retrieval-based Bug Localization Techniques Trustworthy?
    Kim, Misoo
    Lee, Eunseok
    PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING - COMPANION (ICSE-COMPANION, 2018, : 248 - 249