KinyaBERT: a Morphology-aware Kinyarwanda Language Model

被引:0
|
作者
Nzeyimana, Antoine [1 ]
Rubungo, Andre Niyongabo [2 ]
机构
[1] Univ Massachusetts, Amherst, MA 01003 USA
[2] Univ Politecn Cataluna, Barcelona, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pre-trained language models such as BERT have been successful at tackling many natural language processing tasks. However, the unsupervised sub-word tokenization methods commonly used in these models (e.g., byte-pair encoding - BPE) are sub-optimal at handling morphologically rich languages. Even given a morphological analyzer, naive sequencing of morphemes into a standard BERT architecture is inefficient at capturing morphological compositionality and expressing word-relative syntactic regularities. We address these challenges by proposing a simple yet effective twotier BERT architecture that leverages a morphological analyzer and explicitly represents morphological compositionality. Despite the success of BERT, most of its evaluations have been conducted on high-resource languages, obscuring its applicability on low-resource languages. We evaluate our proposed method on the low-resource morphologically rich Kinyarwanda language, naming the proposed model architecture KinyaBERT. A robust set of experimental results reveal that KinyaBERT outperforms solid baselines by 2% in F1 score on a named entity recognition task and by 4.3% in average score of a machine-translated GLUE benchmark. KinyaBERT fine-tuning has better convergence and achieves more robust results on multiple tasks even in the presence of translation noise.(1)
引用
收藏
页码:5347 / 5363
页数:17
相关论文
共 50 条
  • [21] Attractive deep morphology-aware active contour network for vertebral body contour extraction with extensions to heterogeneous and semi-supervised scenarios
    Zhao, Shen
    Wang, Jinhong
    Wang, Xinxin
    Wang, Yikang
    Zheng, Hanying
    Chen, Bin
    Zeng, An
    Wei, Fuxin
    Al-Kindi, Sadeer
    Li, Shuo
    MEDICAL IMAGE ANALYSIS, 2023, 89
  • [22] Uncertainty Aware Learning for Language Model Alignment
    Wang, Yikun
    Zheng, Rui
    Ding, Liang
    Zhang, Qi
    Li, Dahua
    Tao, Dacheng
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 11087 - 11099
  • [23] Continuous Experience-aware Language Model
    Mukherjee, Subhabrata
    Guennemann, Stephan
    Weikum, Gerhard
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1075 - 1084
  • [24] SYNTAX AND SEMANTICS OF REDUPLICATION IN KINYARWANDA + BANTU LANGUAGE OF RWANDA - A SEMIOTIC ACCOUNT
    KIMENYI, A
    LINGUISTIQUE, 1987, 23 (01): : 147 - 156
  • [25] DCrownFormer: Morphology-Aware Point-to-Mesh Generation Transformer for Dental Crown Prosthesis from 3D Scan Data of Antagonist and Preparation Teeth
    Yang, Su
    Han, Jiyong
    Lim, Sang-Heon
    Yoo, Ji-Yong
    Kim, SuJeong
    Song, Dahyun
    Kim, Sunjung
    Kim, Jun-Min
    Yi, Won-Jin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VI, 2024, 15006 : 109 - 119
  • [26] Templatic morphology through syntactic selection: Valency-changing extensions in Kinyarwanda
    Banerjee, Neil
    GLOSSA-A JOURNAL OF GENERAL LINGUISTICS, 2019, 4 (01):
  • [27] Translinguistic apposition: Structure and function of a language choice practice in Kinyarwanda news articles
    Gafaranga, Joseph
    INTERNATIONAL JOURNAL OF BILINGUALISM, 2025,
  • [28] THE APPROVED HARMONIZED VERSION OF THE INTERNATIONAL INDEX OF ERECTILE FUNCTION INTO KINYARWANDA, THE NATIVE LANGUAGE OF RWANDA
    Grunert, R.
    Muhawenimana, E.
    Grunert, M.
    JOURNAL OF SEXUAL MEDICINE, 2017, 14 (02): : E57 - E58
  • [29] Model-aware Language Specification with Java']Java
    Porubaen, Jaroslav
    Chodarev, Sergej
    2015 13TH INTERNATIONAL CONFERENCE ON ENGINEERING OF MODERN ELECTRIC SYSTEMS (EMES), 2015,
  • [30] Hand-Model-Aware Sign Language Recognition
    Hu, Hezhen
    Zhou, Wengang
    Li, Houqiang
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1558 - 1566