On the Cross-lingual Transferability of Monolingual Representations

被引:0
|
作者
Artetxe, Mikel [1 ,2 ]
Ruder, Sebastian [2 ]
Yogatama, Dani [2 ]
机构
[1] Univ Basque Country UPV EHU, HiTZ Ctr, Leioa, Spain
[2] DeepMind, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State-of-the-art unsupervised multilingual models (e.g., multilingual BERT) have been shown to generalize in a zero-shot cross-lingual setting. This generalization ability has been attributed to the use of a shared subword vocabulary and joint training across multiple languages giving rise to deep multilingual abstractions. We evaluate this hypothesis by designing an alternative approach that transfers a monolingual model to new languages at the lexical level. More concretely, we first train a transformer-based masked language model on one language, and transfer it to a new language by learning a new embedding matrix with the same masked language modeling objective-freezing parameters of all other layers. This approach does not rely on a shared vocabulary or joint training. However, we show that it is competitive with multilingual BERT on standard cross-lingual classification benchmarks and on a new Cross-lingual Question Answering Dataset (XQuAD). Our results contradict common beliefs of the basis of the generalization ability of multilingual models and suggest that deep monolingual models learn some abstractions that generalize across languages. We also release XQuAD as a more comprehensive cross-lingual benchmark, which comprises 240 paragraphs and 1190 question-answer pairs from SQuAD v1.1 translated into ten languages by professional translators.
引用
收藏
页码:4623 / 4637
页数:15
相关论文
共 50 条
  • [11] A Framework for the Construction of Monolingual and Cross-lingual Word Similarity Datasets
    Camacho-Collados, Jose
    Pilehvar, Mohammad Taher
    Navigli, Roberto
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 1 - 7
  • [12] XNLI: Evaluating Cross-lingual Sentence Representations
    Conneau, Alexis
    Rinott, Ruty
    Lample, Guillaume
    Schwenk, Holger
    Stoyanov, Ves
    Williams, Adina
    Bowman, Samuel R.
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2475 - 2485
  • [13] Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual
    Li, Xuansong
    Strassel, Stephanie M.
    Ji, Heng
    Griffitt, Kira
    Ellis, Joe
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3098 - 3105
  • [14] Unsupervised Cross-Lingual Information Retrieval Using Monolingual Data Only
    Litschko, Robert
    Glavas, Goran
    Ponzetto, Simone Paolo
    Vulic, Ivan
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 1253 - 1256
  • [15] Multilingual, Cross-lingual, and Monolingual Speech Emotion Recognition on EmoFilm Dataset
    Atmaja, Bagus Tris
    Sasou, Akira
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1019 - 1025
  • [16] TeacherSim: Cross-lingual Machine Translation Evaluation with Monolingual Embedding as Teacher
    Yang, Hao
    Zhang, Min
    Tao, Shimin
    Ma, Miaomiao
    Qin, Ying
    Wei, Daimeng
    2023 25TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY, ICACT, 2023, : 283 - 287
  • [17] On the cross-lingual transferability of multilingual prototypical models across NLU tasks
    Cattan, Oralie
    Servan, Christophe
    Rosset, Sophie
    1ST WORKSHOP ON META LEARNING AND ITS APPLICATIONS TO NATURAL LANGUAGE PROCESSING (METANLP 2021), 2021, : 36 - 43
  • [18] Cross-lingual Dependency Parsing Based on Distributed Representations
    Guo, Jiang
    Che, Wanxiang
    Yarowsky, David
    Wang, Haifeng
    Liu, Ting
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 1234 - 1244
  • [19] Cross-Lingual Universal Dependency Parsing Only From One Monolingual Treebank
    Sun, Kailai
    Li, Zuchao
    Zhao, Hai
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13393 - 13407
  • [20] Generalized Tuning of Distributional Word Vectors for Monolingual and Cross-Lingual Lexical Entailment
    Glavas, Goran
    Vulic, Ivan
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4824 - 4830