KALA: Knowledge-Augmented Language Model Adaptation

被引:0
|
作者
Kang, Minki [1 ,2 ]
Baek, Jinheon [1 ]
Hwang, Sung Ju [1 ,2 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
[2] AITRICS, Seoul, South Korea
来源
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES | 2022年
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pre-trained language models (PLMs) have achieved remarkable success on various natural language understanding tasks. Simple fine-tuning of PLMs, on the other hand, might be suboptimal for domain-specific tasks because they cannot possibly cover knowledge from all domains. While adaptive pre-training of PLMs can help them obtain domain-specific knowledge, it requires a large training cost. Moreover, adaptive pre-training can harm the PLM's performance on the downstream task by causing catastrophic forgetting of its general knowledge. To overcome such limitations of adaptive pre-training for PLM adaption, we propose a novel domain adaption framework for PLMs coined as Knowledge-Augmented Language model Adaptation (KALA), which modulates the intermediate hidden representations of PLMs with domain knowledge, consisting of entities and their relational facts. We validate the performance of our KALA on question answering and named entity recognition tasks on multiple datasets across various domains. The results show that, despite being computationally efficient, our KALA largely outperforms adaptive pre-training. Code is available at: https://github.com/Nardien/KALA.
引用
收藏
页码:5144 / 5167
页数:24
相关论文
共 50 条
  • [31] Prior Knowledge-Augmented Broad Reinforcement Learning Framework for Fault Diagnosis of Heterogeneous Multiagent Systems
    Guo, Li
    Ren, Yiran
    Li, Runze
    Jiang, Bin
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (01) : 115 - 123
  • [32] Knowledge-Augmented Deep Learning for Segmenting and Detecting Cerebral Aneurysms With CT Angiography: A Multicenter Study
    Wei, Jianyong
    Song, Xinyu
    Wei, Xiaoer
    Yang, Zhiwen
    Dai, Lisong
    Wang, Mengfei
    Sun, Zheng
    Jin, Yidong
    Ma, Chune
    Hu, Chunhong
    Xie, Xueqian
    Yang, Zhenghan
    Zhang, Yonggao
    Lv, Fajin
    Lu, Jie
    Zhu, Yueqi
    Li, Yuehua
    RADIOLOGY, 2024, 312 (02)
  • [33] Prior knowledge-augmented unsupervised shapelet learning for unknown abnormal working condition discovery in industrial process
    Wan, Xiaoxue
    Cen, Lihui
    Chen, Xiaofang
    Xie, Yongfang
    Gui, Weihua
    ADVANCED ENGINEERING INFORMATICS, 2024, 60
  • [34] Knowledge-augmented face perception: Prospects for the Bayesian brain-framework to align AI and human vision
    Maier, Martin
    Blume, Florian
    Bideau, Pia
    Hellwich, Olaf
    Rahman, Rasha Abdel
    CONSCIOUSNESS AND COGNITION, 2022, 101
  • [35] EXTERNAL KNOWLEDGE AUGMENTED POLYPHONE DISAMBIGUATION USING LARGE LANGUAGE MODEL<bold> </bold>
    Li, Chen
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 520 - 524
  • [36] Augmented language model with deep learning adaptation on sentiment analysis for E-learning recommendation
    Alatrash, Rawaa
    Priyadarshini, Rojalina
    Ezaldeen, Hadi
    Alhinnawi, Akram
    COGNITIVE SYSTEMS RESEARCH, 2022, 75 : 53 - 69
  • [37] Knowledge-Augmented Contrastive Learning for Abnormality Classification and Localization in Chest X-rays with Radiomics using a Feedback Loop
    Han, Yan
    Chen, Chongyan
    Tewfik, Ahmed
    Glicksberg, Benjamin
    Ding, Ying
    Peng, Yifan
    Wang, Zhangyang
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1789 - 1798
  • [38] KAT: A Knowledge Augmented Transformer for Vision-and-Language
    Gui, Liangke
    Wang, Borui
    Huang, Qiuyuan
    Hauptmann, Alexander
    Bisk, Yonatan
    Gao, Jianfeng
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 956 - 968
  • [39] Knowledge Augmented Inference Network for Natural Language Inference
    Jiang, Shan
    Li, Bohan
    Liu, Chunhua
    Yu, Dong
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING (CCKS 2018), 2019, 957 : 129 - 135
  • [40] Prior Knowledge-Augmented Self-Supervised Feature Learning for Few-Shot Intelligent Fault Diagnosis of Machines
    Zhang, Tianci
    Chen, Jinglong
    He, Shuilong
    Zhou, Zitong
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (10) : 10573 - 10584