Moment matching training for neural machine translation: An empirical study

被引:0
|
作者
Nguyen, Long H. B. [1 ,2 ]
Pham, Nghi T. [1 ,2 ]
Duc, Le D. C. [1 ,2 ]
Cong Duy Vu Hoang [3 ]
Dien Dinh [1 ,2 ]
机构
[1] Univ Sci Ho Chi Minh City, Fac Informat Technol, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[3] Oracle Corp, Melbourne, Vic, Australia
关键词
Neural machine translation; moment matching; objective function;
D O I
10.3233/JIFS-213240
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, Neural Machine Translation (NMT), which harnesses the power of neural networks, has achieved astonishing achievements. Despite its promise, NMT models can still not model prior external knowledge. Recent investigations have necessitated the adaptation of past expertise to both training and inference methods, resulting in translation inference issues. This paper proposes an extension of the moment matching framework that incorporates advanced prior knowledge without interfering with the inference process by using a matching mechanism between the model and empirical distributions. Our tests show that the suggested expansion outperforms the baseline and effectively over various language combinations.
引用
收藏
页码:2633 / 2645
页数:13
相关论文
共 50 条
  • [1] An empirical study of cyclical learning rate on neural machine translation
    Wang, Weixuan
    Lee, Choon Meng
    Liu, Jianfeng
    Colakoglu, Talha
    Peng, Wei
    NATURAL LANGUAGE ENGINEERING, 2023, 29 (02) : 316 - 336
  • [2] An Empirical Study on Automatic Post Editing for Neural Machine Translation
    Moon, Hyeonseok
    Park, Chanjun
    Eo, Sugyeong
    Seo, Jaehyung
    Lim, Heuiseok
    IEEE ACCESS, 2021, 9 : 123754 - 123763
  • [3] An Empirical Study towards Characterizing Neural Machine Translation Testing Methods
    He, Chenxi
    Liu, Wenhong
    Zhao, Shuang
    Liu, Jiawei
    Yang, Yang
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 179 - 182
  • [4] Impact of Sentence Representation Matching in Neural Machine Translation
    Jung, Heeseung
    Kim, Kangil
    Shin, Jong-Hun
    Na, Seung-Hoon
    Jung, Sangkeun
    Woo, Sangmin
    APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [5] Improving Neural Machine Translation by Bidirectional Training
    Ding, Liang
    Wu, Di
    Tao, Dacheng
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3278 - 3284
  • [6] Discriminant training of neural networks for machine translation
    Quoc-Khanh Do
    Allauzen, Alexandre
    Yvon, Francois
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2016, 57 (01): : 111 - 135
  • [7] Generative adversarial training for neural machine translation
    Yang, Zhen
    Chen, Wei
    Wang, Feng
    Xu, Bo
    NEUROCOMPUTING, 2018, 321 : 146 - 155
  • [8] Speed Up the Training of Neural Machine Translation
    Xinyue Liu
    Weixuan Wang
    Wenxin Liang
    Yuangang Li
    Neural Processing Letters, 2020, 51 : 231 - 249
  • [9] Speed Up the Training of Neural Machine Translation
    Liu, Xinyue
    Wang, Weixuan
    Liang, Wenxin
    Li, Yuangang
    NEURAL PROCESSING LETTERS, 2020, 51 (01) : 231 - 249
  • [10] Minimum Risk Training for Neural Machine Translation
    Shen, Shiqi
    Cheng, Yong
    He, Zhongjun
    He, Wei
    Wu, Hua
    Sun, Maosong
    Liu, Yang
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 1683 - 1692