Understanding and Mitigating the Uncertainty in Zero-Shot Translation

被引:0
|
作者
Wang, Wenxuan [1 ]
Jiao, Wenxiang [2 ]
Wang, Shuo [3 ]
Tu, Zhaopeng [2 ]
Lyu, Michael R. [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong 999077, Peoples R China
[2] Tencent AI Lab, Shenzhen 518057, Peoples R China
[3] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
关键词
Uncertainty; Data models; Training; Training data; Predictive models; Computational modeling; Transformers; Vocabulary; Speech processing; Neural machine translation; zero-shot translation; uncertainty;
D O I
10.1109/TASLP.2024.3485555
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Zero-shottranslation is a promising direction for building a comprehensive multilingual neural machine translation (MNMT) system. However, its quality is still not satisfactory due to off-target issues. In this paper, we aim to understand and alleviate the off-target issues from the perspective of uncertainty in zero-shot translation. By carefully examining the translation output and model confidence, we identify two uncertainties that are responsible for the off-target issues, namely, extrinsic data uncertainty and intrinsic model uncertainty. Based on the observations, we propose two lightweight and complementary approaches to denoise the training data for model training and explicitly penalize the off-target translations by unlikelihood training during model training. Extensive experiments on both balanced and imbalanced datasets show that our approaches significantly improve the performance of zero-shot translation over strong MNMT baselines.
引用
收藏
页码:4894 / 4904
页数:11
相关论文
共 50 条
  • [21] Uncertainty-Aware Learning for Zero-Shot Semantic Segmentation
    Hu, Ping
    Sclaroff, Stan
    Saenko, Kate
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [22] A Unified Approach for Conventional Zero-Shot, Generalized Zero-Shot, and Few-Shot Learning
    Rahman, Shafin
    Khan, Salman
    Porikli, Fatih
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5652 - 5667
  • [23] Pruning Residual Networks in Multilingual Neural Machine Translation to Improve Zero-Shot Translation
    Lu, Kaiwen
    Yang, Yating
    Dong, Rui
    Ma, Bo
    Wang, Lei
    Zhou, Xi
    Ahmat, Ahtamjan
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024, 2025, 15361 : 280 - 292
  • [24] Zero-Shot Translation of Attention Patterns in VQA Models to Natural Language
    Salewski, Leonard
    Koepke, A. Sophia
    Lensch, Hendrik P. A.
    Akata, Zeynep
    PATTERN RECOGNITION, DAGM GCPR 2023, 2024, 14264 : 378 - 393
  • [25] Zero-Shot Commonsense Question Answering with Cloze Translation and Consistency Optimization
    Dou, Zi-Yi
    Peng, Nanyun
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10572 - 10580
  • [26] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
    Yang, Shuai
    Zhou, Yifan
    Liu, Ziwei
    Loy, Chen Change
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 8703 - 8712
  • [27] Entropy-Based Uncertainty Calibration for Generalized Zero-Shot Learning
    Chen, Zhi
    Huang, Zi
    Li, Jingjing
    Zhang, Zheng
    DATABASES THEORY AND APPLICATIONS (ADC 2021), 2021, 12610 : 139 - 151
  • [28] Zero-Shot Reinforcement Learning on Graphs for Autonomous Exploration Under Uncertainty
    Chen, Fanfei
    Szenher, Paul
    Huang, Yewei
    Wang, Jinkun
    Shan, Tixiao
    Bai, Shi
    Englot, Brendan
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 5193 - 5199
  • [29] Zero-Shot Information Extraction as a Unified Text-to-Triple Translation
    Wang, Chenguang
    Liu, Xiao
    Chen, Zui
    Hong, Haoyun
    Tang, Jie
    Song, Dawn
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1225 - 1238
  • [30] Exploring the Impact of Layer Normalization for Zero-shot Neural Machine Translation
    Mao, Zhuoyuan
    Dabre, Raj
    Liu, Qianying
    Song, Haiyue
    Chu, Chenhui
    Kurohashi, Sadao
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1300 - 1316