Overfitting in Semantics-based Automated Program Repair

被引:14
|
作者
Le, Xuan-Bach D. [1 ]
Thung, Ferdian [1 ]
Lo, David [1 ]
Le Goues, Claire [2 ]
机构
[1] Singapore Management Univ, Singapore, Singapore
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
Automated Program Repair; Program Synthesis; Symbolic Execution; Patch Overfitting;
D O I
10.1145/3180155.3182536
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Existing APR techniques can be generally divided into two families: semantics- vs. heuristics-based. Semantics-based APR uses symbolic execution and test suites to extract semantic constraints, and uses program synthesis to synthesize repairs that satisfy the extracted constraints. Heuristic-based APR generates large populations of repair candidates via source manipulation, and searches for the best among them. Both families largely rely on a primary assumption that a program is correctly patched if the generated patch leads the program to pass all provided test cases. Patch correctness is thus an especially pressing concern. A repair technique may generate overfitting patches, which lead a program to pass all existing test cases, but fails to generalize beyond them. In this work, we revisit the overfitting problem with a focus on semantics-based APR techniques, complementing previous studies of the overfitting problem in heuristics-based APR. We perform our study using IntroClass and Codeflaws benchmarks, two datasets well-suited for assessing repair quality, to systematically characterize and understand the nature of overfitting in semantics-based APR. We find that similar to heuristics-based APR, overfitting also occurs in semantics-based APR in various different ways.
引用
收藏
页码:163 / 163
页数:1
相关论文
共 50 条
  • [41] Semantics-based composition of EMBOSS services
    Lamprecht A.-L.
    Naujokat S.
    Margaria T.
    Steffen B.
    Journal of Biomedical Semantics, 2 (Suppl 1)
  • [42] Semantics-based transformation of arithmetic expressions
    Martel, Matthieu
    STATIC ANALYSIS, PROCEEDINGS, 2007, 4634 : 298 - 314
  • [43] A Semantics-Based Measure of Emoji Similarity
    Wijeratne, Sanjaya
    Balasuriya, Lakshika
    Sheth, Amit
    Doran, Derek
    2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017), 2017, : 646 - 653
  • [44] Proceedings of the ACM SIGPLAN Symposium on Partial Evaluation and Semantics-Based Program Manipulation: Foreword
    Gallagher, John
    Voigtländer, Janis
    Proceedings of the ACM SIGPLAN Symposium on Partial Evaluation and Semantics-Based Program Manipulation, 2010,
  • [45] Semantics-based transaction processing for real-time databases: The case of automated stock trading
    Konana, P
    Ram, S
    INFORMS JOURNAL ON COMPUTING, 1999, 11 (03) : 299 - 315
  • [46] Semantics-based protocol for business process transactions
    Kang, Dongwoo
    Lee, Sunjae
    Kim, Kwangsoo
    Lee, Jae Yeol
    SEMANTIC WEB - ASWC 2006, PROCEEDINGS, 2006, 4185 : 343 - 350
  • [47] Semantics-Based Code Search Demonstration Proposal
    Reiss, Steven P.
    2009 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE, CONFERENCE PROCEEDINGS, 2009, : 385 - 386
  • [48] A Semantics-Based Trajectory Segmentation Simplification Method
    Minshi Liu
    Guifang He
    Yi Long
    Journal of Geovisualization and Spatial Analysis, 2021, 5
  • [49] Semantics-based object caching in distributed systems
    Department of Computer Science, University of Minnesota, 200 Union Street SE, Minneapolis, MN 55455, United States
    不详
    IEEE Trans Parallel Distrib Syst, 12 (1750-1764):
  • [50] A Semantics-Based Approach to Retrieving Biomedical Information
    Andreasen, Troels
    Bulskov, Henrik
    Zambach, Sine
    Lassen, Tine
    Madsen, Bodil Nistrup
    Jensen, Per Anker
    Thomsen, Hanne Erdman
    Nilsson, Jorgen Fischer
    FLEXIBLE QUERY ANSWERING SYSTEMS, 2011, 7022 : 108 - +