Prompt Tuning in Code Intelligence: An Experimental Evaluation

被引:6
|
作者
Wang, Chaozheng [1 ]
Yang, Yuanhang [1 ]
Gao, Cuiyun [1 ]
Peng, Yun [2 ]
Zhang, Hongyu [3 ,4 ]
Lyu, Michael R. [2 ]
机构
[1] Harbin Inst Technol, Shenzhen 518055, Peoples R China
[2] Chinese Univ Hong Kong, Hong Kong 999077, Peoples R China
[3] Univ Newcastle, Newcastle, Australia
[4] Chongqing Univ, Chongqing 400044, Peoples R China
基金
中国国家自然科学基金;
关键词
Tuning; Codes; Task analysis; Training; Predictive models; Adaptation models; Source coding; Code intelligence; prompt tuning; empirical study;
D O I
10.1109/TSE.2023.3313881
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Pre-trained models have been shown effective in many code intelligence tasks, such as automatic code summarization and defect prediction. These models are pre-trained on large-scale unlabeled corpus and then fine-tuned in downstream tasks. However, as the inputs to pre-training and downstream tasks are in different forms, it is hard to fully explore the knowledge of pre-trained models. Besides, the performance of fine-tuning strongly relies on the amount of downstream task data, while in practice, the data scarcity scenarios are common. Recent studies in the natural language processing (NLP) field show that prompt tuning, a new paradigm for tuning, alleviates the above issues and achieves promising results in various NLP tasks. In prompt tuning, the prompts inserted during tuning provide task-specific knowledge, which is especially beneficial for tasks with relatively scarce data. In this article, we empirically evaluate the usage and effect of prompt tuning in code intelligence tasks. We conduct prompt tuning on popular pre-trained models CodeBERT and CodeT5 and experiment with four code intelligence tasks including defect prediction, code search, code summarization, and code translation. Our experimental results show that prompt tuning consistently outperforms fine-tuning in all four tasks. In addition, prompt tuning shows great potential in low-resource scenarios, e.g., improving the BLEU scores of fine-tuning by more than 26% on average for code summarization. Our results suggest that instead of fine-tuning, we could adapt prompt tuning for code intelligence tasks to achieve better performance, especially when lacking task-specific data. We also discuss the implications for adapting prompt tuning in code intelligence tasks.
引用
收藏
页码:4869 / 4885
页数:17
相关论文
共 50 条
  • [21] PTE: Prompt tuning with ensemble verbalizers
    Liang, Liheng
    Wang, Guancheng
    Lin, Cong
    Feng, Zhuowen
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 262
  • [22] LION: Implicit Vision Prompt Tuning
    Wang, Haixin
    Chang, Jianlong
    Zhai, Yihang
    Luo, Xiao
    Sun, Jinan
    Lin, Zhouchen
    Tian, Qi
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5372 - 5380
  • [23] Prompt Tuning in Biomedical Relation Extraction
    He, Jianping
    Li, Fang
    Li, Jianfu
    Hu, Xinyue
    Nian, Yi
    Xiang, Yang
    Wang, Jingqi
    Wei, Qiang
    Li, Yiming
    Xu, Hua
    Tao, Cui
    JOURNAL OF HEALTHCARE INFORMATICS RESEARCH, 2024, 8 (02) : 206 - 224
  • [24] Review of Research on Adapter and Prompt Tuning
    Lin, Lingde
    Liu, Na
    Wang, Zhengan
    Computer Engineering and Applications, 59 (02): : 12 - 21
  • [25] Prompt Tuning in Biomedical Relation Extraction
    Jianping He
    Fang Li
    Jianfu Li
    Xinyue Hu
    Yi Nian
    Yang Xiang
    Jingqi Wang
    Qiang Wei
    Yiming Li
    Hua Xu
    Cui Tao
    Journal of Healthcare Informatics Research, 2024, 8 : 206 - 224
  • [26] Pro-Tuning: Unified Prompt Tuning for Vision Tasks
    Nie, Xing
    Ni, Bolin
    Chang, Jianlong
    Meng, Gaofeng
    Huo, Chunlei
    Xiang, Shiming
    Tian, Qi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4653 - 4667
  • [27] EXPERIMENTAL EVALUATION OF A SELF-TUNING CONTROLLER
    LIM, CM
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 1990, 37 (03) : 193 - 194
  • [28] Experimental evaluation of code properties for WCET analysis
    Colin, A
    Petters, SM
    RTSS 2003: 24TH IEEE INTERNATIONAL REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, 2003, : 190 - 199
  • [29] Code tuning in context
    Bentley, L
    DR DOBBS JOURNAL, 1999, 24 (05): : 125 - 128
  • [30] G-Prompt: Graphon-based Prompt Tuning for graph classification
    Duan, Yutai
    Liu, Jie
    Chen, Shaowei
    Chen, Liyi
    Wu, Jianhua
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (03)