From Commit Message Generation to History-Aware Commit Message Completion

被引:6
|
作者
Eliseeva, Aleksandra [1 ]
Sokolov, Yaroslav [2 ]
Bogomolov, Egor [3 ]
Golubev, Yaroslav [1 ]
Dig, Danny [4 ]
Bryksin, Timofey [3 ]
机构
[1] JetBrains Res, Belgrade, Serbia
[2] JetBrains, Munich, Germany
[3] JetBrains Res, Paphos, Cyprus
[4] Univ Colorado Boulder, JetBrains Res, Boulder, CO USA
关键词
D O I
10.1109/ASE56229.2023.00078
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Commit messages are crucial to software development, allowing developers to track changes and collaborate effectively. Despite their utility, most commit messages lack important information since writing high-quality commit messages is tedious and time-consuming. The active research on commit message generation (CMG) has not yet led to wide adoption in practice. We argue that if we could shift the focus from commit message generation to commit message completion and use previous commit history as additional context, we could significantly improve the quality and the personal nature of the resulting commit messages. In this paper, we propose and evaluate both of these novel ideas. Since the existing datasets lack historical data, we collect and share a novel dataset called CommitChronicle, containing 10.7M commits across 20 programming languages. We use this dataset to evaluate the completion setting and the usefulness of the historical context for state-of-the-art CMG models and GPT3.5-turbo. Our results show that in some contexts, commit message completion shows better results than generation, and that while in general GPT-3.5-turbo performs worse, it shows potential for long and detailed messages. As for the history, the results show that historical information improves the performance of CMG models in the generation task, and the performance of GPT-3.5-turbo in both generation and completion.
引用
收藏
页码:723 / 735
页数:13
相关论文
共 50 条
  • [1] KADEL: Knowledge-Aware Denoising Learning for Commit Message Generation
    Tao, Wei
    Zhou, Yucheng
    Wang, Yanlin
    Zhang, Hongyu
    Wang, Haofen
    Zhang, Wenqiang
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (05)
  • [2] Commit Message Matters: Investigating Impact and Evolution of Commit Message Quality
    Li, Jiawei
    Ahmed, Iftekhar
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE, 2023, : 806 - 817
  • [3] Evaluating Commit Message Generation: To BLEU Or Not To BLEU?
    Dey, Samanta
    Vinayakarao, Venkatesh
    Gupta, Monika
    Dechu, Sampath
    2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: NEW IDEAS AND EMERGING RESULTS (ICSE-NIER 2022), 2022, : 31 - 35
  • [4] Delving into Commit-Issue Correlation to Enhance Commit Message Generation Models
    Wang, Liran
    Tang, Xunzhu
    He, Yichen
    Ren, Changyu
    Shi, Shuhua
    Yan, Chaoran
    Li, Zhoujun
    2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 710 - 722
  • [5] Commit Message Generation for Source Code Changes
    Xu, Shengbin
    Yao, Yuan
    Xu, Feng
    Gu, Tianxiao
    Tong, Hanghang
    Lu, Jian
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3975 - 3981
  • [6] Quality Assurance for Automated Commit Message Generation
    Wang, Bei
    Yan, Meng
    Liu, Zhongxin
    Xu, Ling
    Xia, Xin
    Zhang, Xiaohong
    Yang, Dan
    2021 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER 2021), 2021, : 260 - 271
  • [7] COME: Commit Message Generation with Modification Embedding
    He, Yichen
    Wang, Liran
    Wang, Kaiyi
    Zhang, Yupeng
    Zhang, Hang
    Li, Zhoujun
    PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 792 - 803
  • [8] Context-aware Retrieval-based Deep Commit Message Generation
    Wang, Haoye
    Xia, Xin
    Lo, David
    He, Qiang
    Wang, Xinyu
    Grundy, John
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2021, 30 (04)
  • [9] Revisiting Learning-based Commit Message Generation
    Dong, Jinhao
    Lou, Yiling
    Hao, Dan
    Tan, Lin
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE, 2023, : 794 - 805
  • [10] On the Evaluation of Commit Message Generation Models: An Experimental Study
    Tao, Wei
    Wang, Yanlin
    Shi, Ensheng
    Du, Lun
    Han, Shi
    Zhang, Hongyu
    Zhang, Dongmei
    Zhang, Wenqiang
    2021 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2021), 2021, : 126 - 136