Language-Aware Multilingual Machine Translation with Self-Supervised Learning

被引:0
|
作者
Xu, Haoran [1 ]
Maillard, Jean [2 ]
Goswami, Vedanuj [2 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
[2] Meta AI, New York, NY USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multilingual machine translation (MMT) benefits from cross-lingual transfer but is a challenging multitask optimization problem. This is partly because there is no clear framework to systematically learn language-specific parameters. Self-supervised learning (SSL) approaches that leverage large quantities of monolingual data (where parallel data is unavailable) have shown promise by improving translation performance as complementary tasks to the MMT task. However, jointly optimizing SSL and MMT tasks is even more challenging. In this work, we first investigate how to utilize intra-distillation to learn more language-specific parameters and then show the importance of these language-specific parameters. Next, we propose a novel but simple SSL task, concurrent denoising, that co-trains with the MMT task by concurrently denoising monolingual data on both the encoder and decoder. Finally, we apply intra-distillation to this co-training approach. Combining these two approaches significantly improves MMT performance, outperforming three state-of-the-art SSL methods by a large margin, e.g., 11.3% and 3.7% improvement on an 8-language and a 15-language benchmark compared with MASS, respectively(1).
引用
收藏
页码:526 / 539
页数:14
相关论文
共 50 条
  • [1] Language-aware Interlingua for Multilingual Neural Machine Translation
    Zhu, Changfeng
    Yu, Heng
    Cheng, Shanbo
    Luo, Weihua
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1650 - 1655
  • [2] Self-Supervised Neural Machine Translation
    Ruiter, Dana
    Espana-Bonet, Cristina
    van Genabith, Josef
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1828 - 1834
  • [3] LAE: Language-Aware Encoder for Monolingual and Multilingual ASR
    Tian, Jinchuan
    Yu, Jianwei
    Zhang, Chunlei
    Weng, Chao
    Zou, Yuexian
    Yu, Dong
    INTERSPEECH 2022, 2022, : 3178 - 3182
  • [4] Self-Induced Curriculum Learning in Self-Supervised Neural Machine Translation
    Ruiter, Dana
    Van Genabith, Josef
    Espana-Bonet, Cristina
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2560 - 2571
  • [5] Self-Supervised Quality Estimation for Machine Translation
    Zheng, Yuanhang
    Tan, Zhixing
    Zhang, Meng
    Maimaiti, Mieradilijiang
    Luan, Huanbo
    Sun, Maosong
    Liu, Qun
    Liu, Yang
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3322 - 3334
  • [6] Geography-Aware Self-Supervised Learning
    Ayush, Kumar
    Uzkent, Burak
    Meng, Chenlin
    Tanmay, Kumar
    Burke, Marshall
    Lobell, David
    Ermon, Stefano
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10161 - 10170
  • [7] Self-supervised and Multilingual Learning Applied to the Wolof, Swahili and Fongbe
    Pindoh, Prestilien Djionang
    Yonta, Paulin Melatagia
    RESEARCH IN COMPUTER SCIENCE, CRI 2023, 2024, 2085 : 80 - 91
  • [8] Structure-aware protein self-supervised learning
    Chen, Can
    Zhou, Jingbo
    Wang, Fan
    Liu, Xue
    Dou, Dejing
    BIOINFORMATICS, 2023, 39 (04)
  • [9] Self-Supervised Attention-Aware Reinforcement Learning
    Wu, Haiping
    Khetarpa, Khimya
    Precup, Doina
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10311 - 10319
  • [10] Towards language-aware pedagogy? Experiences of students in multilingual Finnish schools
    Repo, Elisa
    LANGUAGE AND EDUCATION, 2023, 37 (04) : 460 - 482