Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation

被引:0
|
作者
Zhang, Biao [1 ]
Williams, Philip [1 ]
Titov, Ivan [1 ,2 ]
Sennrich, Rico [1 ,3 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
[2] Univ Amsterdam, ILLC, Amsterdam, Netherlands
[3] Univ Zurich, Dept Computat Linguist, Zurich, Switzerland
基金
欧盟地平线“2020”; 瑞士国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Massively multilingual models for neural machine translation (NMT) are theoretically attractive, but often underperform bilingual models and deliver poor zero-shot translations. In this paper, we explore ways to improve them. We argue that multilingual NMT requires stronger modeling capacity to support language pairs with varying typological characteristics, and overcome this bottleneck via language-specific components and deepening NMT architectures. We identify the off-target translation issue (i.e. translating into a wrong target language) as the major source of the inferior zero-shot performance, and propose random online backtranslation to enforce the translation of unseen training language pairs. Experiments on OPUS-100 (a novel multilingual dataset with 100 languages) show that our approach substantially narrows the performance gap with bilingual models in both one-to-many and many-to-many settings, and improves zero-shot performance by similar to 10 BLEU, approaching conventional pivot-based methods.
引用
收藏
页码:1628 / 1639
页数:12
相关论文
共 50 条
  • [1] Pruning Residual Networks in Multilingual Neural Machine Translation to Improve Zero-Shot Translation
    Lu, Kaiwen
    Yang, Yating
    Dong, Rui
    Ma, Bo
    Wang, Lei
    Zhou, Xi
    Ahmat, Ahtamjan
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024, 2025, 15361 : 280 - 292
  • [2] Learn and Consolidate: Continual Adaptation for Zero-Shot and Multilingual Neural Machine Translation
    Huang, Kaiyu
    Li, Peng
    Liu, Junpeng
    Sung, Maosong
    Liu, Yang
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13938 - 13951
  • [3] On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation
    Chen, Liang
    Ma, Shuming
    Zhang, Dongdong
    Wei, Furu
    Chang, Baobao
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9542 - 9558
  • [4] Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency Regularization
    Gao, Pengzhi
    Zhang, Liwen
    He, Zhongjun
    Wu, Hua
    Wang, Haifeng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 12103 - 12119
  • [5] An Empirical Investigation of Word Alignment Supervision for Zero-Shot Multilingual Neural Machine Translation
    Raganato, Alessandro
    Vazquez, Raul
    Creutz, Mathias
    Tiedemann, Jorg
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8449 - 8456
  • [6] Enhancing Zero-Shot Translation in Multilingual Neural Machine Translation: Focusing on Obtaining Location-Agnostic Representations
    Zhang, Jiarui
    Huang, Heyan
    Hu, Yue
    Guo, Ping
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT VII, 2024, 15022 : 194 - 208
  • [7] Consistency by Agreement in Zero-shot Neural Machine Translation
    Al-Shedivat, Maruan
    Parikh, Ankur P.
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1184 - 1197
  • [8] Monolingual Adapters for Zero-Shot Neural Machine Translation
    Philip, Jerin
    Berard, Alexandre
    Galle, Matthias
    Besacier, Laurent
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4465 - 4470
  • [9] Massively Multilingual Neural Machine Translation
    Aharoni, Roee
    Johnson, Melvin
    Firat, Orhan
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3874 - 3884
  • [10] TACKLING DATA SCARCITY IN SPEECH TRANSLATION USING ZERO-SHOT MULTILINGUAL MACHINE TRANSLATION TECHNIQUES
    Tu Anh Dinh
    Liu, Danni
    Niehues, Jan
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6222 - 6226