Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models

被引:0
|
作者
Qi, Jirui [1 ]
Fernandez, Raquel [2 ]
Bisazza, Arianna [1 ]
机构
[1] Univ Groningen, Ctr Language & Cognit, Groningen, Netherlands
[2] Univ Amsterdam, Inst Log Language & Computat, Amsterdam, Netherlands
基金
欧洲研究理事会; 荷兰研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multilingual large-scale Pretrained Language Models (PLMs) have been shown to store considerable amounts of factual knowledge, but large variations are observed across languages. With the ultimate goal of ensuring that users with different language backgrounds obtain consistent feedback from the same model, we study the cross-lingual consistency (CLC) of factual knowledge in various multilingual PLMs. To this end, we propose a Ranking-based Consistency (RankC) metric to evaluate knowledge consistency across languages independently from accuracy. Using this metric, we conduct an in-depth analysis of the determining factors for CLC, both at model level and at language-pair level. Among other results, we find that increasing model size leads to higher factual probing accuracy in most languages, but does not improve cross-lingual consistency. Finally, we conduct a case study on CLC when new factual associations are inserted in the PLMs via model editing. Results on a small sample of facts inserted in English reveal a clear pattern whereby the new piece of knowledge transfers only to languages with which English has a high RankC score.
引用
收藏
页码:10650 / 10666
页数:17
相关论文
共 50 条
  • [21] Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders
    Vulic, Ivan
    Glavas, Goran
    Liu, Fangyu
    Collier, Nigel
    Ponti, Edoardo Maria
    Korhonen, Anna
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 2089 - 2105
  • [22] Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models
    He, Zhiwei
    Zhou, Binglin
    Hao, Hongkun
    Liu, Aiwei
    Wang, Xing
    Tu, Zhaopeng
    Zhang, Zhuosheng
    Wang, Rui
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 4115 - 4129
  • [23] XLM-K: Improving Cross-Lingual Language Model Pre-training with Multilingual Knowledge
    Jiang, Xiaoze
    Liang, Yaobo
    Chen, Weizhu
    Duan, Nan
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10840 - 10848
  • [24] On the cross-lingual transferability of multilingual prototypical models across NLU tasks
    Cattan, Oralie
    Servan, Christophe
    Rosset, Sophie
    1ST WORKSHOP ON META LEARNING AND ITS APPLICATIONS TO NATURAL LANGUAGE PROCESSING (METANLP 2021), 2021, : 36 - 43
  • [25] A Benchmark Evaluation of Multilingual Large Language Models for Arabic Cross-Lingual Named-Entity Recognition
    Al-Duwais, Mashael
    Al-Khalifa, Hend
    Al-Salman, Abdulmalik
    ELECTRONICS, 2024, 13 (17)
  • [26] Code-switching finetuning: Bridging multilingual pretrained language models for enhanced cross-lingual performance
    Zan, Changtong
    Ding, Liang
    Shen, Li
    Cao, Yu
    Liu, Weifeng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 139
  • [27] Isotropic Representation Can Improve Zero-Shot Cross-Lingual Transfer on Multilingual Language Models
    Ji, Yixin
    Wang, Jikai
    Li, Juntao
    Yee, Hai
    Zhang, Min
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8104 - 8118
  • [28] cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation
    Gupta, Kshitij
    Gautam, Devansh
    Mamidi, Radhika
    Proceedings - International Conference on Pattern Recognition, 2022, 2022-August : 1734 - 1741
  • [29] cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation
    Gupta, Kshitij
    Gautam, Devansh
    Mamidi, Radhika
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1734 - 1741
  • [30] On cross-lingual retrieval with multilingual text encoders
    Litschko, Robert
    Vulic, Ivan
    Ponzetto, Simone Paolo
    Glavas, Goran
    INFORMATION RETRIEVAL JOURNAL, 2022, 25 (02): : 149 - 183