Thai Knowledge-Augmented Language Model Adaptation (ThaiKALA)

被引:0
|
作者
Ruangchutiphophan, Pavaris [1 ]
Saetia, Chanatip [1 ]
Ayutthaya, Thititorn Seneewong Na [1 ]
Chalothorn, Tawunrat [1 ]
机构
[1] Kasikorn Business Technol Grp, Bangkok, Thailand
来源
2023 18TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING, ISAI-NLP | 2023年
关键词
Knowledge-Augmented; Language Model; Question Answering;
D O I
10.1109/iSAI-NLP60301.2023.10355001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large language models have exhibited considerable prowess in diverse NLP tasks, demonstrating promising performance. However, they still have limitations in effectively capturing domain-specific knowledge and contextually-relevant information, resulting in hallucination issues. To address these challenges, This paper presents ThaiKALA, a framework designed for the Thai language to augment domain-specific knowledge into the language model. The framework utilizes three modules to handle Thai language specifically: event extraction, a self-defined ID database, and a multilingual language model. To confirm the performance, the framework is also evaluated with strong generative baselines like GPT-3 and GPT-3.5-turbo-16k. As a result, ThaiKALA, with only Entity Memory, outperforms all baselines including GPT-3 and GPT-3.5 in extractive Question Answering (EQA) tasks, achieving a higher exact match (42.48%) and competitive F1 scores (67.07%). These results demonstrate that ThaiKALA is effective in enhancing the language model's performance on Thai extractive QA by augmenting the extracted knowledge.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Knowledge-augmented Graph Machine Learning for Drug Discovery: From Precision to Interpretability
    Zhong, Zhiqiang
    Mottin, Davide
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5841 - 5842
  • [22] Contrastive knowledge-augmented self-distillation approach for few-shot learning
    Zhang, Lixu
    Shao, Mingwen
    Chen, Sijie
    Liu, Fukang
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (05)
  • [23] Knowledge-Augmented Interpretable Network for Zero-Shot Stance Detection on Social Media
    Zhang, Bowen
    Ding, Daijun
    Huang, Zhichao
    Li, Ang
    Li, Yangyang
    Zhang, Baoquan
    Huang, Hu
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, : 1 - 12
  • [24] Prior Knowledge-Augmented Meta-Learning for Fine-Grained Fault Diagnosis
    Zhou, Yuhang
    Zhang, Qiang
    Huang, Ting
    Cai, Zhengyang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (06) : 8115 - 8124
  • [25] View-Based Knowledge-Augmented Multimodal Semantic Understanding for Optical Remote Sensing Images
    Zhu, Lilu
    Su, Xiaolu
    Tang, Jiaxuan
    Hu, Yanfeng
    Wang, Yang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [26] Online Temporal Language Model Adaptation for a Thai Broadcast News Transcription System
    Saykham, Kwanchiva
    Chotimongkol, Ananlada
    Wutiwiwatchai, Chai
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1690 - 1694
  • [27] EIKA: Explicit & Implicit Knowledge-Augmented Network for entity-aware sports video captioning
    Xi, Zeyu
    Shi, Ge
    Sun, Haoying
    Zhang, Bowen
    Li, Shuyi
    Wu, Lifang
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 274
  • [28] A knowledge-augmented heterogeneous graph convolutional network for aspect-level multimodal sentiment analysis
    Yujie, Wan
    Yuzhong, Chen
    Jiali, Lin
    Jiayuan, Zhong
    Chen, Dong
    COMPUTER SPEECH AND LANGUAGE, 2024, 85
  • [29] KaTaGCN: Knowledge-Augmented and Time-Aware Graph Convolutional Network for efficient traffic forecasting
    Wang, Yuyan
    Hu, Jie
    Teng, Fei
    Peng, Lilan
    Du, Shengdong
    Li, Tianrui
    INFORMATION FUSION, 2024, 111
  • [30] Analyzing Surveillance Videos using automatically generated processing sequences with Knowledge-Augmented Genetic Algorithms
    Samarabandu, Jagath
    Ranaweera, Kamal
    2016 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2016,