Moment matching training for neural machine translation: An empirical study

被引:0
|
作者
Nguyen, Long H. B. [1 ,2 ]
Pham, Nghi T. [1 ,2 ]
Duc, Le D. C. [1 ,2 ]
Cong Duy Vu Hoang [3 ]
Dien Dinh [1 ,2 ]
机构
[1] Univ Sci Ho Chi Minh City, Fac Informat Technol, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[3] Oracle Corp, Melbourne, Vic, Australia
关键词
Neural machine translation; moment matching; objective function;
D O I
10.3233/JIFS-213240
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, Neural Machine Translation (NMT), which harnesses the power of neural networks, has achieved astonishing achievements. Despite its promise, NMT models can still not model prior external knowledge. Recent investigations have necessitated the adaptation of past expertise to both training and inference methods, resulting in translation inference issues. This paper proposes an extension of the moment matching framework that incorporates advanced prior knowledge without interfering with the inference process by using a matching mechanism between the model and empirical distributions. Our tests show that the suggested expansion outperforms the baseline and effectively over various language combinations.
引用
收藏
页码:2633 / 2645
页数:13
相关论文
共 50 条
  • [11] Unsupervised Neural Machine Translation for Similar and Distant Language Pairs: An Empirical Study
    Sun, Haipeng
    Wang, Rui
    Utiyama, Masao
    Marie, Benjamin
    Chen, Kehai
    Sumita, Eiichiro
    Zhao, Tiejun
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (01)
  • [12] Training Neural Machine Translation To Apply Terminology Constraints
    Dinu, Georgiana
    Mathur, Prashant
    Federico, Marcello
    Al-Onaizan, Yaser
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3063 - 3068
  • [13] Shallow-to-Deep Training for Neural Machine Translation
    Li, Bei
    Wang, Ziyang
    Liu, Hui
    Jiang, Yufan
    Du, Quan
    Xiao, Tong
    Wang, Huizhen
    Zhu, Jingbo
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 995 - 1005
  • [14] Pre-training Methods for Neural Machine Translation
    Wang, Mingxuan
    Li, Lei
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: TUTORIAL ABSTRACTS, 2021, : 21 - 25
  • [15] Restricted or Not: A General Training Framework for Neural Machine Translation
    Li, Zuchao
    Utiyama, Masao
    Sumita, Eiichiro
    Zhao, Hai
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP, 2022, : 245 - 251
  • [16] An empirical study of low-resource neural machine translation of manipuri in multilingual settings
    Salam Michael Singh
    Thoudam Doren Singh
    Neural Computing and Applications, 2022, 34 : 14823 - 14844
  • [17] An empirical study of low-resource neural machine translation of manipuri in multilingual settings
    Singh, Salam Michael
    Singh, Thoudam Doren
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17): : 14823 - 14844
  • [18] An empirical study of low-resource neural machine translation of manipuri in multilingual settings
    Singh, Salam Michael
    Singh, Thoudam Doren
    Neural Computing and Applications, 2022, 34 (17) : 14823 - 14844
  • [19] Training Recurrent Neural Network through Moment Matching for NLP Applications
    Deng, Yue
    Shen, Yilin
    Chen, KaWai
    Jin, Hongxia
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3353 - 3357
  • [20] An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation
    Chu, Chenhui
    Dabre, Raj
    Kurohashi, Sadao
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 385 - 391