DaLC: Domain Adaptation Learning Curve Prediction for Neural Machine Translation

被引:0
|
作者
Park, Cheonbok [1 ]
Kim, Hantae [1 ]
Calapodescu, Ioan [2 ]
Cho, Hyunchang [1 ]
Nikoulina, Vassilina [2 ]
机构
[1] NAVER Corp, Papago, Seongnam Si, South Korea
[2] NAVER LABS Europe, Meylan, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain Adaptation (DA) of Neural Machine Translation (NMT) model often relies on a pretrained general NMT model which is adapted to the new domain on a sample of in-domain parallel data. Without parallel data, there is no way to estimate the potential benefit of DA, nor the amount of parallel samples it would require. It is however a desirable functionality that could help MT practitioners to make an informed decision before investing resources in dataset creation. We propose a Domain adaptation Learning Curve prediction (DaLC) model that predicts prospective DA performance based on in-domain monolingual samples in the source language. Our model relies on the NMT encoder representations combined with various instance and corpus-level features. We demonstrate that instance-level is better able to distinguish between different domains compared to corpus-level frameworks proposed in previous studies (Xia et al., 2020; Kolachina et al., 2012). Finally, we perform indepth analyses of the results highlighting the limitations of our approach, and provide directions for future research.
引用
收藏
页码:1789 / 1807
页数:19
相关论文
共 50 条
  • [1] Curriculum Learning for Domain Adaptation in Neural Machine Translation
    Zhang, Xuan
    Shapiro, Pamela
    Kumar, Gaurav
    McNamee, Paul
    Carpuat, Marine
    Duh, Kevin
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1903 - 1915
  • [2] Vocabulary Adaptation for Domain Adaptation in Neural Machine Translation
    Sato, Shoetsu
    Sakuma, Jin
    Yoshinaga, Naoki
    Toyoda, Masashi
    Kitsuregawa, Masaru
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4269 - 4279
  • [3] Meta-Curriculum Learning for Domain Adaptation in Neural Machine Translation
    Zhan, Runzhe
    Liu, Xuebo
    Wong, Derek F.
    Chao, Lidia S.
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14310 - 14318
  • [4] A Domain Adaptation Method for Neural Machine Translation
    Tian, Xiaohu
    Liu, Jin
    Pu, Jiachen
    Wang, Jin
    ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING, MUE/FUTURETECH 2018, 2019, 518 : 321 - 326
  • [5] Unsupervised Domain Adaptation for Neural Machine Translation
    Yang, Zhen
    Chen, Wei
    Wang, Feng
    Xu, Bo
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 338 - 343
  • [6] Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
    Saunders D.
    Journal of Artificial Intelligence Research, 2022, 75 : 351 - 424
  • [7] Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
    Saunders, Danielle
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 75 : 351 - 424
  • [8] Sentence Embedding for Neural Machine Translation Domain Adaptation
    Wang, Rui
    Finch, Andrew
    Utiyama, Masao
    Sumita, Eiichiro
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 560 - 566
  • [9] Domain Adaptation of Neural Machine Translation by Lexicon Induction
    Hu, Junjie
    Xia, Mengzhou
    Neubig, Graham
    Carbonell, Jaime
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2989 - 3001
  • [10] Iterative Dual Domain Adaptation for Neural Machine Translation
    Zeng, Jiali
    Liu, Yang
    Su, Jinsong
    Ge, Yubin
    Lu, Yaojie
    Yin, Yongjing
    Luo, Jiebo
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 845 - 855