Moment matching training for neural machine translation: An empirical study

被引:0
|
作者
Nguyen, Long H. B. [1 ,2 ]
Pham, Nghi T. [1 ,2 ]
Duc, Le D. C. [1 ,2 ]
Cong Duy Vu Hoang [3 ]
Dien Dinh [1 ,2 ]
机构
[1] Univ Sci Ho Chi Minh City, Fac Informat Technol, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[3] Oracle Corp, Melbourne, Vic, Australia
关键词
Neural machine translation; moment matching; objective function;
D O I
10.3233/JIFS-213240
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, Neural Machine Translation (NMT), which harnesses the power of neural networks, has achieved astonishing achievements. Despite its promise, NMT models can still not model prior external knowledge. Recent investigations have necessitated the adaptation of past expertise to both training and inference methods, resulting in translation inference issues. This paper proposes an extension of the moment matching framework that incorporates advanced prior knowledge without interfering with the inference process by using a matching mechanism between the model and empirical distributions. Our tests show that the suggested expansion outperforms the baseline and effectively over various language combinations.
引用
收藏
页码:2633 / 2645
页数:13
相关论文
共 50 条
  • [21] An Empirical Study of Generation Order for Machine Translation
    Chan, William
    Stern, Mitchell
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 5764 - 5773
  • [22] An Empirical Study on Learning Bug-Fixing Patches in the Wild via Neural Machine Translation
    Tufano, Michele
    Watson, Cody
    Bavota, Gabriele
    Di Penta, Massimiliano
    White, Martin
    Poshyvanyk, Denys
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2019, 28 (04)
  • [23] Regularized Training Objective for Continued Training for Domain Adaptation in Neural Machine Translation
    Khayrallah, Huda
    Thompson, Brian
    Duh, Kevin
    Koehn, Philipp
    NEURAL MACHINE TRANSLATION AND GENERATION, 2018, : 36 - 44
  • [24] Adversarial Training for Unknown Word Problems in Neural Machine Translation
    Ji, Yatu
    Hou, Hongxu
    Chen, Junjie
    Wu, Nier
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (01)
  • [25] Iterative Training of Unsupervised Neural and Statistical Machine Translation Systems
    Marie, Benjamin
    Fujita, Atsushi
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (05)
  • [26] Joint Training for Pivot-based Neural Machine Translation
    Cheng, Yong
    Yang, Qian
    Liu, Yang
    Sun, Maosong
    Xu, Wei
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3974 - 3980
  • [27] Training Google Neural Machine Translation on an Intel CPU Cluster
    Kalamkar, Dhiraj D.
    Banerjee, Kunal
    Srinivasan, Sudarshan
    Sridharan, Srinivas
    Georganas, Evangelos
    Smorkalov, Mikhail E.
    Xu, Cong
    Heinecke, Alexander
    2019 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2019, : 193 - 202
  • [28] Training Deeper Neural Machine Translation Models with Transparent Attention
    Bapna, Ankur
    Chen, Mia Xu
    Firat, Orhan
    Cao, Yuan
    Wu, Yonghui
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3028 - 3033
  • [29] From Bilingual to Multilingual Neural Machine Translation by Incremental Training
    Escolano, Carlos
    Costa-Jussa, Marta R.
    Fonollosa, Jose A. R.
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 236 - 242
  • [30] Alternated Training with Synthetic and Authentic Data for Neural Machine Translation
    Jiao, Rui
    Yang, Zonghan
    Sun, Maosong
    Liu, Yang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1828 - 1834