The Unreasonable Volatility of Neural Machine Translation Models

被引:0
|
作者
Fadaee, Marzieh [1 ,2 ]
Monz, Christof [1 ]
机构
[1] Univ Amsterdam, Informat Inst, Amsterdam, Netherlands
[2] Zeta Alpha Vector, Amsterdam, Netherlands
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent works have shown that while Neural Machine Translation (NMT) models achieve impressive performance, questions about understanding the behaviour of these models remain unanswered. We investigate the unexpected volatility of NMT models where the input is semantically and syntactically correct. We discover that with trivial modifications of source sentences, we can identify cases where unexpected changes happen in the translation and in the worst case lead to mistranslations. This volatile behaviour of translating extremely similar sentences in surprisingly different ways highlights the underlying generalization problem of current NMT models. We find that both RNN and Transformer models display volatile behaviour in 26% and 19% of sentence variations, respectively.
引用
收藏
页码:88 / 96
页数:9
相关论文
共 50 条
  • [21] Neural Name Translation Improves Neural Machine Translation
    Li, Xiaoqing
    Yan, Jinghui
    Zhang, Jiajun
    Zong, Chengqing
    MACHINE TRANSLATION, CWMT 2018, 2019, 954 : 93 - 100
  • [22] The Event/Machine of Neural Machine Translation?
    Regnauld, Arnaud
    JOURNAL OF AESTHETICS AND PHENOMENOLOGY, 2022, 9 (02) : 141 - 154
  • [23] Compressing Neural Machine Translation Models with 4-bit Precision
    Aji, Alham Fikri
    Heafield, Kenneth
    NEURAL GENERATION AND TRANSLATION, 2020, : 35 - 42
  • [24] Neural Machine Translation Models with Attention-Based Dropout Layer
    Israr, Huma
    Khan, Safdar Abbas
    Tahir, Muhammad Ali
    Shahzad, Muhammad Khuram
    Ahmad, Muneer
    Zain, Jasni Mohamad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (02): : 2981 - 3009
  • [25] What do Neural Machine Translation Models Learn about Morphology?
    Belinkov, Yonatan
    Durrani, Nadir
    Dalvi, Fahim
    Sajjad, Hassan
    Glass, James
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 861 - 872
  • [26] Interactive Multi-System Machine Translation with Neural Language Models
    Rikters, Matiss
    DATABASES AND INFORMATION SYSTEMS IX, 2016, 291 : 269 - 280
  • [27] Deep Fusing Pre-trained Models into Neural Machine Translation
    Weng, Rongxiang
    Yu, Heng
    Luo, Weihua
    Zhang, Min
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11468 - 11476
  • [28] Fast and Robust Neural Network Joint Models for Statistical Machine Translation
    Devlin, Jacob
    Zbib, Rabih
    Huang, Zhongqiang
    Lamar, Thomas
    Schwartz, Richard
    Makhoul, John
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1370 - 1380
  • [29] Keeping Models Consistent between Pretraining and Translation for Low-Resource Neural Machine Translation
    Zhang, Wenbo
    Li, Xiao
    Yang, Yating
    Dong, Rui
    Luo, Gongxu
    FUTURE INTERNET, 2020, 12 (12): : 1 - 13
  • [30] Prompting Neural Machine Translation with Translation Memories
    Reheman, Abudurexiti
    Zhou, Tao
    Luo, Yingfeng
    Yang, Di
    Xiao, Tong
    Zhu, Jingbo
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13519 - 13527