Exploiting Target Language Data for Neural Machine Translation Beyond Back Translation

被引:0
|
作者
Reheman, Abudurexiti [1 ]
Lu, Yingfeng [1 ]
Ruan, Junhao [1 ]
Ma, Anxiang [1 ]
Zhang, Chunliang [1 ,2 ]
Xiao, Tong [1 ,2 ]
Zhu, Jingbo [1 ,2 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang, Peoples R China
[2] NiuTrans Res, Shenyang, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural Machine Translation (NMT) encounters challenges when translating in new domains and low-resource languages. To address these issues, researchers have proposed methods to integrate additional knowledge into NMT, such as translation memories (TMs). However, finding TMs that closely match the input sentence remains challenging, particularly in specific domains. On the other hand, monolingual data is widely accessible in most languages, and backtranslation is seen as a promising approach for utilizing target language data. Nevertheless, it still necessitates additional training. In this paper, we introduce Pseudo-kNN-MT, a variant of k-nearest neighbor machine translation (kNN-MT) that utilizes target language data by constructing a pseudo datastore. Furthermore, we investigate the utility of large language models (LLMs) for the kNN component. Experimental results demonstrate that our approach exhibits strong domain adaptation capability in both high-resource and low-resource machine translation. Notably, LLMs are found to be beneficial for robust NMT systems.
引用
收藏
页码:12216 / 12228
页数:13
相关论文
共 50 条
  • [1] Exploiting Monolingual Data at Scale for Neural Machine Translation
    Wu, Lijun
    Wang, Yiren
    Xia, Yingce
    Qin, Tao
    Lai, Jianhuang
    Liu, Tie-Yan
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 4207 - 4216
  • [2] Generalizing Back-Translation in Neural Machine Translation
    Graca, Miguel
    Kim, Yunsu
    Schamper, Julian
    Khadivi, Shahram
    Ney, Hermann
    FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), VOL 1: RESEARCH PAPERS, 2019, : 45 - 52
  • [3] Iterative Back-Translation for Neural Machine Translation
    Vu Cong Duy Hoang
    Koehn, Philipp
    Haffari, Gholamreza
    Cohn, Trevor
    NEURAL MACHINE TRANSLATION AND GENERATION, 2018, : 18 - 24
  • [4] Exploiting Deep Representations for Neural Machine Translation
    Dou, Zi-Yi
    Tu, Zhaopeng
    Wang, Xing
    Shi, Shuming
    Zhang, Tong
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4253 - 4262
  • [5] Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation
    Jiao, Wenxiang
    Wang, Xing
    He, Shilin
    King, Irwin
    Lyu, Michael R.
    Tu, Zhaopeng
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2255 - 2266
  • [6] Exploiting Sentential Context for Neural Machine Translation
    Wang, Xing
    Tu, Zhaopeng
    Wang, Longyue
    Shi, Shuming
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 6197 - 6203
  • [7] Exploiting Knowledge Graph in Neural Machine Translation
    Lu, Yu
    Zhang, Jiajun
    Zong, Chengqing
    MACHINE TRANSLATION, CWMT 2018, 2019, 954 : 27 - 38
  • [8] Non-Fluent Synthetic Target-Language Data Improve Neural Machine Translation
    Sanchez-Cartagena, Victor M.
    Espla-Gomis, Miquel
    Perez-Ortiz, Juan Antonio
    Sanchez-Martinez, Felipe
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 837 - 850
  • [9] Improving Neural Machine Translation by Retrieving Target Translation Template
    Li, Fuxue
    Chi, Chuncheng
    Yan, Hong
    Zhang, Zhen
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 658 - 669
  • [10] Using Neural Machine Translation Methods for Sign Language Translation
    Angelova, Galina
    Avramidis, Eleftherios
    Moeller, Sebastian
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP, 2022, : 273 - 284