KALA: Knowledge-Augmented Language Model Adaptation

被引:0
|
作者
Kang, Minki [1 ,2 ]
Baek, Jinheon [1 ]
Hwang, Sung Ju [1 ,2 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
[2] AITRICS, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pre-trained language models (PLMs) have achieved remarkable success on various natural language understanding tasks. Simple fine-tuning of PLMs, on the other hand, might be suboptimal for domain-specific tasks because they cannot possibly cover knowledge from all domains. While adaptive pre-training of PLMs can help them obtain domain-specific knowledge, it requires a large training cost. Moreover, adaptive pre-training can harm the PLM's performance on the downstream task by causing catastrophic forgetting of its general knowledge. To overcome such limitations of adaptive pre-training for PLM adaption, we propose a novel domain adaption framework for PLMs coined as Knowledge-Augmented Language model Adaptation (KALA), which modulates the intermediate hidden representations of PLMs with domain knowledge, consisting of entities and their relational facts. We validate the performance of our KALA on question answering and named entity recognition tasks on multiple datasets across various domains. The results show that, despite being computationally efficient, our KALA largely outperforms adaptive pre-training. Code is available at: https://github.com/Nardien/KALA.
引用
收藏
页码:5144 / 5167
页数:24
相关论文
共 50 条
  • [1] Thai Knowledge-Augmented Language Model Adaptation (ThaiKALA)
    Ruangchutiphophan, Pavaris
    Saetia, Chanatip
    Ayutthaya, Thititorn Seneewong Na
    Chalothorn, Tawunrat
    2023 18TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING, ISAI-NLP, 2023,
  • [2] Knowledge-Augmented Language Model Verification
    Baek, Jinheon
    Jeong, Soyeong
    Kang, Minki
    Park, Jong C.
    Hwang, Sung Ju
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 1720 - 1736
  • [3] A Novel Knowledge-augmented Model Customization Approach for Arabic Offensive Language Detection
    Husain, Fatemah
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (12)
  • [4] Knowledge-Augmented Visual Question Answering With Natural Language Explanation
    Xie, Jiayuan
    Cai, Yi
    Chen, Jiali
    Xu, Ruohang
    Wang, Jiexin
    Li, Qing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2652 - 2664
  • [5] A knowledge-augmented neural network model for sarcasm detection
    Ren, Yafeng
    Wang, Zilin
    Peng, Qiong
    Ji, Donghong
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (06)
  • [6] Knowledge-Augmented Language Model and Its Application to Unsupervised Named-Entity Recognition
    Liu, Angli
    Du, Jingfei
    Stoyanov, Veselin
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1142 - 1150
  • [7] Knowledge-Augmented Visual Question Answering With Natural Language Explanation
    Xie, Jiayuan
    Cai, Yi
    Chen, Jiali
    Xu, Ruohang
    Wang, Jiexin
    Li, Qing
    IEEE Transactions on Image Processing, 2024, 33 : 2652 - 2664
  • [8] The Second Workshop on Knowledge-Augmented Methods for Natural Language Processing
    Yu, Wenhao
    Tong, Lingbo
    Shi, Weijia
    Peng, Nanyun
    Jiang, Meng
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5899 - 5900
  • [9] Knowledge-Augmented Language Models for Cause-Effect Relation Classification
    Hosseini, Pedram
    Broniatowski, David A.
    Diab, Mona
    PROCEEDINGS OF THE FIRST WORKSHOP ON COMMONSENSE REPRESENTATION AND REASONING (CSRR 2022), 2022, : 43 - 48
  • [10] Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive Tasks
    Kang, Minki
    Lee, Seanie
    Baek, Jinheon
    Kawaguchi, Kenji
    Hwang, Sung Ju
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,