Language-Aware Multilingual Machine Translation with Self-Supervised Learning

被引:0
|
作者
Xu, Haoran [1 ]
Maillard, Jean [2 ]
Goswami, Vedanuj [2 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
[2] Meta AI, New York, NY USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multilingual machine translation (MMT) benefits from cross-lingual transfer but is a challenging multitask optimization problem. This is partly because there is no clear framework to systematically learn language-specific parameters. Self-supervised learning (SSL) approaches that leverage large quantities of monolingual data (where parallel data is unavailable) have shown promise by improving translation performance as complementary tasks to the MMT task. However, jointly optimizing SSL and MMT tasks is even more challenging. In this work, we first investigate how to utilize intra-distillation to learn more language-specific parameters and then show the importance of these language-specific parameters. Next, we propose a novel but simple SSL task, concurrent denoising, that co-trains with the MMT task by concurrently denoising monolingual data on both the encoder and decoder. Finally, we apply intra-distillation to this co-training approach. Combining these two approaches significantly improves MMT performance, outperforming three state-of-the-art SSL methods by a large margin, e.g., 11.3% and 3.7% improvement on an 8-language and a 15-language benchmark compared with MASS, respectively(1).
引用
收藏
页码:526 / 539
页数:14
相关论文
共 50 条
  • [31] Data compression and inference in cosmology with self-supervised machine learning
    Akhmetzhanova, Aizhan
    Mishra-Sharma, Siddharth
    Dvorkin, Cora
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2023, 527 (03) : 7459 - 7481
  • [32] Seismic Blind Deconvolution Based on Self-Supervised Machine Learning
    Yin, Xia
    Xu, Wenhao
    Yang, Zhifang
    Wu, Bangyu
    APPLIED SCIENCES-BASEL, 2024, 14 (12):
  • [33] OBJECT-AWARE SELF-SUPERVISED MULTI-LABEL LEARNING
    Xu Kaixin
    Liu Liyang
    Zhao Ziyuan
    Zeng, Zeng
    Veeravalli, Bharadwaj
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 361 - 365
  • [34] Time Interval Aware Collaborative Sequential Recommendation with Self-supervised Learning
    Ma, Chenrui
    Li, Li
    Chen, Rui
    Li, Xi
    Wang, Yichen
    WEB AND BIG DATA, PT III, APWEB-WAIM 2022, 2023, 13423 : 87 - 101
  • [35] Frequency-Aware Self-Supervised Long-Tailed Learning
    Lin, Ci-Siang
    Chen, Min-Hung
    Wang, Yu-Chiang Frank
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 963 - 972
  • [36] Uncertainty-Aware Self-Supervised Learning of Spatial Perception Tasks
    Nava, Mirko
    Paolillo, Antonio
    Guzzi, Jerome
    Gambardella, Luca Maria
    Giusti, Alessandro
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 6693 - 6700
  • [37] Context-Aware Self-Supervised Learning of Whole Slide Images
    Aryal M.
    Yahya Soltani N.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (08): : 1 - 10
  • [38] Enhancing Hyperedge Prediction With Context-Aware Self-Supervised Learning
    Ko, Yunyong
    Tong, Hanghang
    Kim, Sang-Wook
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (04) : 1772 - 1784
  • [39] Knowledge-Aware Self-supervised Graph Representation Learning for Recommendation
    Sun, Yeheng
    Zhu, Jinghua
    Xi, Heran
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 420 - 432
  • [40] Capturing the Physics of MaNGA Galaxies with Self-supervised Machine Learning
    Sarmiento, Regina
    Huertas-Company, Marc
    Knapen, Johan H.
    Sanchez, Sebastian F.
    Dominguez Sanchez, Helena
    Drory, Niv
    Falcon-Barroso, Jesus
    ASTROPHYSICAL JOURNAL, 2021, 921 (02):