Large Language Models for Automated Program Repair

被引:2
|
作者
Ribeiro, Francisco [1 ]
机构
[1] Univ Minho, HASLab INESC TEC, Braga, Portugal
关键词
automated program repair; fault localization; code generation; type systems;
D O I
10.1145/3618305.3623587
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces two methods for automated program repair (APR) utilizing pre-trained language models. The first method demonstrates program repair as a code completion task and is validated on a dataset of Java programs. The second method, Mentat, leverages OCaml's parser and type system as fault localization techniques to generate prompts for GPT-3, producing candidate patches. Evaluation results show promising repair rates, with 27% and 39.2% effectiveness, respectively. For OCaml, a comparative study employing an automated validation strategy is presented in which the technique outperforms other tools. Language models are effective at APR, enhancing bug fixing and freeing developers to focus on other critical aspects of software engineering.
引用
收藏
页码:7 / 9
页数:3
相关论文
共 50 条
  • [41] Mathematical discoveries from program search with large language models
    Romera-Paredes, Bernardino
    Barekatain, Mohammadamin
    Novikov, Alexander
    Balog, Matej
    Kumar, M. Pawan
    Dupont, Emilien
    Ruiz, Francisco J. R.
    Ellenberg, Jordan S.
    Wang, Pengming
    Fawzi, Omar
    Kohli, Pushmeet
    Fawzi, Alhussein
    NATURE, 2024, 625 (7995) : 468 - 475
  • [42] LPR: Large Language Models-Aided Program Reduction
    Zhang, Mengxiao
    Tian, Yongqiang
    Xu, Zhenyang
    Dong, Yiwen
    Tan, Shin Hwei
    Sun, Chengnian
    PROCEEDINGS OF THE 33RD ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2024, 2024, : 261 - 273
  • [43] Large language models streamline automated machine learning for clinical studies
    Arasteh, Soroosh Tayebi
    Han, Tianyu
    Lotfinia, Mahshad
    Kuhl, Christiane
    Kather, Jakob Nikolas
    Truhn, Daniel
    Nebelung, Sven
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [44] Automated fact-checking of climate claims with large language models
    Leippold, Markus
    Vaghefi, Saeid Ashraf
    Stammbach, Dominik
    Muccione, Veruska
    Bingler, Julia
    Ni, Jingwei
    Senni, Chiara Colesanti
    Wekhof, Tobias
    Schimanski, Tobias
    Gostlow, Glen
    Yu, Tingyu
    Luterbacher, Juerg
    Huggel, Christian
    NPJ CLIMATE ACTION, 2025, 4 (01):
  • [45] PENTESTGPT: Evaluating and Harnessing Large Language Models for Automated Penetration Testing
    Deng, Gelei
    Liu, Yi
    Mayoral-Vilches, Victor
    Liu, Peng
    Li, Yuekang
    Xu, Yuan
    Zhang, Tianwei
    Liu, Yang
    Pinzger, Martin
    Rass, Stefan
    PROCEEDINGS OF THE 33RD USENIX SECURITY SYMPOSIUM, SECURITY 2024, 2024, : 847 - 864
  • [46] Large language models streamline automated machine learning for clinical studies
    Soroosh Tayebi Arasteh
    Tianyu Han
    Mahshad Lotfinia
    Christiane Kuhl
    Jakob Nikolas Kather
    Daniel Truhn
    Sven Nebelung
    Nature Communications, 15
  • [47] Directions Towards Efficient and Automated Data Wrangling with Large Language Models
    Zhang, Zeyu
    Groth, Paul
    Calixto, Iacer
    Schelter, Sebastian
    2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW, 2024, : 301 - 304
  • [48] Automated Analysis of Algorithm Descriptions Quality, Through Large Language Models
    Sterbini, Andrea
    Temperini, Marco
    GENERATIVE INTELLIGENCE AND INTELLIGENT TUTORING SYSTEMS, PT I, ITS 2024, 2024, 14798 : 258 - 271
  • [49] Improving requirements completeness: automated assistance through large language models
    Dipeeka Luitel
    Shabnam Hassani
    Mehrdad Sabetzadeh
    Requirements Engineering, 2024, 29 : 73 - 95
  • [50] Exploring and Unleashing the Power of Large Language Models in Automated Code Translation
    Yang, Zhen
    Liu, Fang
    Yu, Zhongxing
    Keung, Jacky Wai
    Li, Jia
    Liu, Shuo
    Hong, Yifan
    Ma, Xiaoxue
    Jin, Zhi
    Li, Ge
    arXiv,