Iterative Nearest Neighbour Machine Translation for Unsupervised Domain Adaptation

被引:0
|
作者
Huang, Hui [1 ,5 ]
Wu, Shuangzhi [2 ]
Liang, Xinnian [3 ,5 ]
Zhou, Zefan [4 ,5 ]
Yang, Muyun [1 ]
Zhao, Tiejun [1 ]
机构
[1] Harbin Inst Technol, Fac Comp, Harbin, Peoples R China
[2] ByteDance AI Lab, Beijing, Peoples R China
[3] Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China
[4] Northeastern Univ, Sch Comp Sci & Engn, Shenyang, Peoples R China
[5] ByteDance Inc, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Unsupervised domain adaptation of machine translation, which adapts a pre-trained translation model to a specific domain without indomain parallel data, has drawn extensive attention in recent years. However, most existing methods focus on the fine-tuning based techniques, which is non-extensible. In this paper, we propose a new method to perform unsupervised domain adaptation in a non-parametric manner. Employing only indomain monolingual data, this method jointly perform nearest neighbour inference on both forward and backward translation directions. The forward translation model creates nearest neighbour datastore for the backward direction, and vice versa, strengthening each other in an iterative style. Experiments on multidomain datasets demonstrate that our method significantly improves the in-domain translation performance and achieves state-of-the-art results among non-parametric methods.
引用
收藏
页码:13294 / 13301
页数:8
相关论文
共 50 条
  • [1] Unsupervised Domain Adaptation for Neural Machine Translation
    Yang, Zhen
    Chen, Wei
    Wang, Feng
    Xu, Bo
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 338 - 343
  • [2] Iterative Dual Domain Adaptation for Neural Machine Translation
    Zeng, Jiali
    Liu, Yang
    Su, Jinsong
    Ge, Yubin
    Lu, Yaojie
    Yin, Yongjing
    Luo, Jiebo
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 845 - 855
  • [3] Exploring iterative dual domain adaptation for neural machine translation
    Liu, Xin
    Zeng, Jieli
    Wang, Xiaoyue
    Wang, Zhihao
    Su, Jinsong
    KNOWLEDGE-BASED SYSTEMS, 2024, 283
  • [4] Neural Network Machine Translation Method Based on Unsupervised Domain Adaptation
    Wang, Rui
    COMPLEXITY, 2020, 2020 (2020)
  • [5] Non-Parametric Unsupervised Domain Adaptation for Neural Machine Translation
    Zheng, Xin
    Zhang, Zhirui
    Huang, Shujian
    Chen, Boxing
    Xie, Jun
    Luo, Weihua
    Chen, Jiajun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4234 - 4241
  • [6] Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings
    Dou, Zi-Yi
    Hu, Junjie
    Anastasopoulos, Antonios
    Neubig, Graham
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1417 - 1422
  • [7] Domain Adaptation for Statistical Machine Translation
    Wang, Xiaoxue
    Zhu, Conghui
    Li, Sheng
    Zhao, Tiejun
    Zheng, Dequan
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1652 - 1658
  • [8] Efficient Machine Translation Domain Adaptation
    Martins, Pedro Henrique
    Marinhe, Zita
    Martins, Andre F. T.
    PROCEEDINGS OF THE 1ST WORKSHOP ON SEMIPARAMETRIC METHODS IN NLP: DECOUPLING LOGIC FROM KNOWLEDGE (SPA-NLP 2022), 2022, : 23 - 29
  • [9] Generalised Unsupervised Domain Adaptation of Neural Machine Translation with Cross-Lingual Data Selection
    Thuy-Trang Vu
    He, Xuanli
    Dinh Phung
    Haffari, Gholamreza
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3335 - 3346
  • [10] Vocabulary Adaptation for Domain Adaptation in Neural Machine Translation
    Sato, Shoetsu
    Sakuma, Jin
    Yoshinaga, Naoki
    Toyoda, Masashi
    Kitsuregawa, Masaru
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4269 - 4279