Thai Knowledge-Augmented Language Model Adaptation (ThaiKALA)

被引:0
|
作者
Ruangchutiphophan, Pavaris [1 ]
Saetia, Chanatip [1 ]
Ayutthaya, Thititorn Seneewong Na [1 ]
Chalothorn, Tawunrat [1 ]
机构
[1] Kasikorn Business Technol Grp, Bangkok, Thailand
关键词
Knowledge-Augmented; Language Model; Question Answering;
D O I
10.1109/iSAI-NLP60301.2023.10355001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large language models have exhibited considerable prowess in diverse NLP tasks, demonstrating promising performance. However, they still have limitations in effectively capturing domain-specific knowledge and contextually-relevant information, resulting in hallucination issues. To address these challenges, This paper presents ThaiKALA, a framework designed for the Thai language to augment domain-specific knowledge into the language model. The framework utilizes three modules to handle Thai language specifically: event extraction, a self-defined ID database, and a multilingual language model. To confirm the performance, the framework is also evaluated with strong generative baselines like GPT-3 and GPT-3.5-turbo-16k. As a result, ThaiKALA, with only Entity Memory, outperforms all baselines including GPT-3 and GPT-3.5 in extractive Question Answering (EQA) tasks, achieving a higher exact match (42.48%) and competitive F1 scores (67.07%). These results demonstrate that ThaiKALA is effective in enhancing the language model's performance on Thai extractive QA by augmenting the extracted knowledge.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] KALA: Knowledge-Augmented Language Model Adaptation
    Kang, Minki
    Baek, Jinheon
    Hwang, Sung Ju
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5144 - 5167
  • [2] Knowledge-Augmented Language Model Verification
    Baek, Jinheon
    Jeong, Soyeong
    Kang, Minki
    Park, Jong C.
    Hwang, Sung Ju
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 1720 - 1736
  • [3] A Novel Knowledge-augmented Model Customization Approach for Arabic Offensive Language Detection
    Husain, Fatemah
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (12)
  • [4] Knowledge-Augmented Visual Question Answering With Natural Language Explanation
    Xie, Jiayuan
    Cai, Yi
    Chen, Jiali
    Xu, Ruohang
    Wang, Jiexin
    Li, Qing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2652 - 2664
  • [5] A knowledge-augmented neural network model for sarcasm detection
    Ren, Yafeng
    Wang, Zilin
    Peng, Qiong
    Ji, Donghong
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (06)
  • [6] Knowledge-Augmented Language Model and Its Application to Unsupervised Named-Entity Recognition
    Liu, Angli
    Du, Jingfei
    Stoyanov, Veselin
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1142 - 1150
  • [7] Knowledge-Augmented Visual Question Answering With Natural Language Explanation
    Xie, Jiayuan
    Cai, Yi
    Chen, Jiali
    Xu, Ruohang
    Wang, Jiexin
    Li, Qing
    IEEE Transactions on Image Processing, 2024, 33 : 2652 - 2664
  • [8] The Second Workshop on Knowledge-Augmented Methods for Natural Language Processing
    Yu, Wenhao
    Tong, Lingbo
    Shi, Weijia
    Peng, Nanyun
    Jiang, Meng
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5899 - 5900
  • [9] Knowledge-Augmented Language Models for Cause-Effect Relation Classification
    Hosseini, Pedram
    Broniatowski, David A.
    Diab, Mona
    PROCEEDINGS OF THE FIRST WORKSHOP ON COMMONSENSE REPRESENTATION AND REASONING (CSRR 2022), 2022, : 43 - 48
  • [10] Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive Tasks
    Kang, Minki
    Lee, Seanie
    Baek, Jinheon
    Kawaguchi, Kenji
    Hwang, Sung Ju
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,