DaLC: Domain Adaptation Learning Curve Prediction for Neural Machine Translation

被引:0
|
作者
Park, Cheonbok [1 ]
Kim, Hantae [1 ]
Calapodescu, Ioan [2 ]
Cho, Hyunchang [1 ]
Nikoulina, Vassilina [2 ]
机构
[1] NAVER Corp, Papago, Seongnam Si, South Korea
[2] NAVER LABS Europe, Meylan, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain Adaptation (DA) of Neural Machine Translation (NMT) model often relies on a pretrained general NMT model which is adapted to the new domain on a sample of in-domain parallel data. Without parallel data, there is no way to estimate the potential benefit of DA, nor the amount of parallel samples it would require. It is however a desirable functionality that could help MT practitioners to make an informed decision before investing resources in dataset creation. We propose a Domain adaptation Learning Curve prediction (DaLC) model that predicts prospective DA performance based on in-domain monolingual samples in the source language. Our model relies on the NMT encoder representations combined with various instance and corpus-level features. We demonstrate that instance-level is better able to distinguish between different domains compared to corpus-level frameworks proposed in previous studies (Xia et al., 2020; Kolachina et al., 2012). Finally, we perform indepth analyses of the results highlighting the limitations of our approach, and provide directions for future research.
引用
收藏
页码:1789 / 1807
页数:19
相关论文
共 50 条
  • [21] Learning Domain Specific Sub-layer Latent Variable for Multi-domain Adaptation Neural Machine Translation
    Huang, Shuanghong
    Feng, Chong
    Shi, Ge
    Li, Zhengjun
    Zhao, Xuan
    Li, Xinyan
    Wang, Xiaomei
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (06)
  • [22] Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings
    Dou, Zi-Yi
    Hu, Junjie
    Anastasopoulos, Antonios
    Neubig, Graham
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1417 - 1422
  • [23] Observing the Learning Curve of Neural Machine Translation with regard to Linguistic Phenomena
    Stadler, Patrick
    Macketanz, Vivien
    Avramidis, Eleftherios
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2021, : 186 - 196
  • [24] Domain Adaptation for Statistical Machine Translation
    Wang, Xiaoxue
    Zhu, Conghui
    Li, Sheng
    Zhao, Tiejun
    Zheng, Dequan
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1652 - 1658
  • [25] Efficient Machine Translation Domain Adaptation
    Martins, Pedro Henrique
    Marinhe, Zita
    Martins, Andre F. T.
    PROCEEDINGS OF THE 1ST WORKSHOP ON SEMIPARAMETRIC METHODS IN NLP: DECOUPLING LOGIC FROM KNOWLEDGE (SPA-NLP 2022), 2022, : 23 - 29
  • [26] Regularized Training Objective for Continued Training for Domain Adaptation in Neural Machine Translation
    Khayrallah, Huda
    Thompson, Brian
    Duh, Kevin
    Koehn, Philipp
    NEURAL MACHINE TRANSLATION AND GENERATION, 2018, : 36 - 44
  • [27] Incremental Domain Adaptation for Neural Machine Translation in Low-Resource Settings
    Kalimuthu, Marimuthu
    Barz, Michael
    Sonntag, Daniel
    FOURTH ARABIC NATURAL LANGUAGE PROCESSING WORKSHOP (WANLP 2019), 2019, : 1 - 10
  • [28] Domain Adaptation in Neural Machine Translation using a Qualia-Enriched FrameNet
    Costa, Alexandre Diniz
    Marim, Mateus Coutinho
    da Silva Matos, Ely Edison
    Torrent, Tiago Timponi
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1 - 12
  • [29] Addressing domain shift in neural machine translation via reinforcement learning
    Kumar, Amit
    Pratap, Ajay
    Singh, Anil Kumar
    Saha, Sriparna
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 201
  • [30] Simple, Scalable Adaptation for Neural Machine Translation
    Bapna, Ankur
    Firat, Orhan
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1538 - 1548