A few-shot learning method based on knowledge graph in large language models

被引:0
|
作者
Wang, Feilong [1 ,2 ]
Shi, Donghui [1 ,2 ]
Aguilar, Jose [3 ,4 ,5 ]
Cui, Xinyi [1 ]
机构
[1] Anhui Jianzhu Univ, Sch Elect & Informat Engn, Dept Comp Engn, Hefei 230601, Peoples R China
[2] Mass Spectrometry Key Technol R&D&Clin Applicat An, Hefei 230601, Peoples R China
[3] Univ EAFIT, Grp Invest IDI T, Medellin, Colombia
[4] Univ Los Andes, Ctr Estudios Microelect & Sistemas Distribuidos, Merida, Venezuela
[5] IMDEA Networks Inst, Madrid, Spain
关键词
Large language model; Few-shot learning; Fine-tuning; Knowledge-driven dialog generation; Knowledge graph;
D O I
10.1007/s41060-024-00699-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The emergence of large language models has significantly transformed natural language processing and text generation. Fine-tuning these models for specific domains enables them to generate answers tailored to the unique requirements of those fields, such as in legal or medical domains. However, these models often perform poorly in few-shot scenarios. Herein, the challenges of data scarcity in fine-tuning large language models in low-sample scenarios were addressed by proposing three different KDGI (Knowledge-Driven Dialog Generation Instances) generation strategies, including entity-based KDGI generation, relation-based KDGI generation, and semantic-based multi-level KDGI generation. These strategies aimed to enhance few-shot datasets to address the issue of low fine-tuning metrics caused by insufficient data. Specifically, knowledge graphs were utilized to define the distinct KDGI generation strategies for enhancing few-shot data. Subsequently, these KDGI data were employed to fine-tune the large language model using the P-tuning v2 approach. Through multiple experiments, the effectiveness of the three KDGI generation strategies was validated using BLEU and ROUGE metrics, and the fine-tuning benefits of few-shot learning on large language models were confirmed. To further evaluate the effectiveness of KDGI, additional experiments were conducted, including LoRA-based fine-tuning in the medical domain and comparative studies leveraging Mask Language Model augmentation, back-translation, and noise injection methods. Consequently, the paper proposes a reference method for leveraging knowledge graphs in prompt data engineering, which shows potential in facilitating few-shot learning for fine-tuning large language models.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] A survey of few-shot knowledge graph completion
    Zhang, Chaoqin
    Li, Ting
    Yin, Yifeng
    Ma, Jiangtao
    Gan, Yong
    Zhang, Yanhua
    Qiao, Yaqiong
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (04) : 6127 - 6143
  • [22] ATLAS: Few-shot Learning with Retrieval Augmented Language Models
    Izacard, Gautier
    Lewis, Patrick
    Lomeli, Maria
    Hosseini, Lucas
    Petroni, Fabio
    Schick, Timo
    Dwivedi-Yu, Jane
    Joulin, Armand
    Riedel, Sebastian
    Grave, Edouard
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [23] Few-Shot Knowledge Graph Entity Typing
    Zhu, Guozhen
    Zhang, Zhongbao
    Su, Sen
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT I, 2022, 13280 : 328 - 340
  • [24] Knowledge Graph Reasoning for Few-Shot Problems
    Zhang, Xiaoli
    Guo, Jinhui
    Liang, Kun
    Xu, Gefei
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14878 : 187 - 196
  • [25] Learning Meta Soft Prompt for Few-Shot Language Models
    Chien, Jen-Tzung
    Chen, Ming-Yen
    Xue, Jing-Hao
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 57 - 62
  • [26] Gaussian Metric Learning for Few-Shot Uncertain Knowledge Graph Completion
    Zhang, Jiatao
    Wu, Tianxing
    Qi, Guilin
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT I, 2021, 12681 : 256 - 271
  • [27] Knowledge Graph enhanced Multimodal Learning for Few-shot Visual Recognition
    Han, Mengya
    Zhan, Yibing
    Yu, Baosheng
    Luo, Yong
    Du, Bo
    Tao, Dacheng
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [28] Meta-Learning Based Few-Shot Link Prediction for Emerging Knowledge Graph
    Zhang, Yu-Feng
    Chen, Wei
    Zhao, Peng-Peng
    Xu, Jia-Jie
    Fang, Jun-Hua
    Zhao, Lei
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2024, 39 (05) : 1058 - 1077
  • [29] Few-Shot Image Classification Method Based on Visual Language Prompt Learning
    Li B.
    Wang X.
    Teng S.
    Lyu X.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2024, 47 (02): : 11 - 17
  • [30] Few-Shot Learning on Graph Convolutional Network Based on Meta learning
    Liu X.-L.
    Feng L.
    Liao L.-X.
    Gong X.
    Su H.
    Wang J.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (03): : 885 - 897