DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning

被引:0
|
作者
Wang, Yejie [1 ]
He, Keqing [2 ]
Dong, Guanting [1 ]
Wang, Pei [1 ]
Zeng, Weihao [1 ]
Diao, Muxi [1 ]
Zhang, Mengdi [2 ]
Wang, Jingang [2 ]
Cai, Xunliang [2 ]
Xu, Weiran [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Meituan, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Code Large Language Models (Code LLMs) have demonstrated outstanding performance in code-related tasks. Several instruction tuning approaches have been proposed to boost the code generation performance of pre-trained Code LLMs. In this paper, we introduce a diverse instruction model (DolphCoder) with self-evaluating for code generation. It learns diverse instruction targets and combines a code evaluation objective to enhance its code generation ability. Our model achieves superior performance on the HumanEval and MBPP benchmarks, demonstrating new insights for future code instruction tuning work. Our key findings are: (1) Augmenting more diverse responses with distinct reasoning paths increases the code capability of LLMs. (2) Improving one's ability to evaluate the correctness of code solutions also enhances their ability to create it. Our code is released at https://github.com/prisnlp/DolphCoder.
引用
收藏
页码:4706 / 4721
页数:16
相关论文
共 50 条
  • [21] Multi-stage guided code generation for Large Language Models
    Han, Yewei
    Lyu, Chen
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 139
  • [22] A Comparative Analysis of Instruction Fine-Tuning Large Language Models for Financial Text Classification
    Fatemi, Sorouralsadat
    Hu, Yuheng
    Mousavi, Maryam
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2025, 16 (01)
  • [23] CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions
    Rao, Jun
    Liu, Xuebo
    Lian, Lian
    Cheng, Shengjun
    Liao, Yunjie
    Zhang, Min
    EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference, 2024, : 10064 - 10083
  • [24] JMedLoRA:Medical Domain Adaptation on Japanese Large Language Models using Instruction-tuning
    Sukeda, Issey
    Suzuki, Masahiro
    Kodera, Satoshi
    Sakaji, Hiroki
    arXiv, 2023,
  • [25] An Empirical Study on Fine-tuning Large Language Models of Code for Automated Program Repair
    Huang, Kai
    Meng, Xiangxin
    Zhang, Jian
    Liu, Yang
    Wang, Wenjie
    Li, Shuhao
    Zhang, Yuqing
    2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 1162 - 1174
  • [26] Fine-Tuning Large Language Models to Improve Accuracy and Comprehensibility of Automated Code Review
    Yu, Yongda
    Rong, Guoping
    Shen, Haifeng
    Zhang, He
    Shao, Dong
    Wang, Min
    Wei, Zhao
    Xu, Yong
    Wang, Juhong
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2025, 34 (01)
  • [27] Enhancing Visual Information Extraction with Large Language Models Through Layout-Aware Instruction Tuning
    Li, Teng
    Wang, Jiapeng
    Jin, Lianwen
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII, 2025, 15037 : 276 - 289
  • [28] InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment
    Wang, Jianing
    Wu, Junda
    Hon, Yupeng
    Liu, Yao
    Gao, Ming
    McAuley, Julian
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13492 - 13510
  • [29] Multi-objective robust PSSs tuning in large scale power system using particle swarm optimisation
    Safari, Amin
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2014, 21 (02) : 202 - 210
  • [30] Hybridization of multi-objective evolutionary algorithms and fuzzy control for automated construction, tuning, and analysis of neuronal models
    Parth Patel
    Myles Johnson-Gray
    Emlyne Forren
    Atish Malik
    Tomasz G Smolinski
    BMC Neuroscience, 14 (Suppl 1)