DaLC: Domain Adaptation Learning Curve Prediction for Neural Machine Translation

被引:0
|
作者
Park, Cheonbok [1 ]
Kim, Hantae [1 ]
Calapodescu, Ioan [2 ]
Cho, Hyunchang [1 ]
Nikoulina, Vassilina [2 ]
机构
[1] NAVER Corp, Papago, Seongnam Si, South Korea
[2] NAVER LABS Europe, Meylan, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain Adaptation (DA) of Neural Machine Translation (NMT) model often relies on a pretrained general NMT model which is adapted to the new domain on a sample of in-domain parallel data. Without parallel data, there is no way to estimate the potential benefit of DA, nor the amount of parallel samples it would require. It is however a desirable functionality that could help MT practitioners to make an informed decision before investing resources in dataset creation. We propose a Domain adaptation Learning Curve prediction (DaLC) model that predicts prospective DA performance based on in-domain monolingual samples in the source language. Our model relies on the NMT encoder representations combined with various instance and corpus-level features. We demonstrate that instance-level is better able to distinguish between different domains compared to corpus-level frameworks proposed in previous studies (Xia et al., 2020; Kolachina et al., 2012). Finally, we perform indepth analyses of the results highlighting the limitations of our approach, and provide directions for future research.
引用
收藏
页码:1789 / 1807
页数:19
相关论文
共 50 条
  • [31] Extreme Adaptation for Personalized Neural Machine Translation
    Michel, Paul
    Neubig, Graham
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 312 - 318
  • [32] A General Framework for Adaptation of Neural Machine Translation to Simultaneous Translation
    Chen, Yun
    Li, Liangyou
    Jiang, Xin
    Chen, Xiao
    Liu, Qun
    1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 191 - 200
  • [33] Effective domain awareness and adaptation approach via mask substructure for multi-domain neural machine translation
    Huang, Shuanghong
    Guo, Junjun
    Yu, Zhengtao
    Wen, Yonghua
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (19): : 14047 - 14060
  • [34] Effective domain awareness and adaptation approach via mask substructure for multi-domain neural machine translation
    Shuanghong Huang
    Junjun Guo
    Zhengtao Yu
    Yonghua Wen
    Neural Computing and Applications, 2023, 35 : 14047 - 14060
  • [35] Realistic material property prediction using domain adaptation based machine learning
    Hu, Jeffrey
    Liu, David
    Fu, Nihang
    Dong, Rongzhi
    DIGITAL DISCOVERY, 2024, 3 (02): : 300 - 312
  • [36] Domain Adaptive Inference for Neural Machine Translation
    Saunders, Danielle
    Stahlberg, Felix
    de Gispert, Adria
    Byrne, Bill
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 222 - 228
  • [37] Domain adaptation problem in statistical machine translation systems
    Chinea-Rios, Mara
    Sanchis-Trilles, German
    Casacuberta, Francisco
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2015, 277 : 205 - 213
  • [38] Evaluating Domain Adaptation for Machine Translation Across Scenarios
    Etchegoyhen, Thierry
    Fernandez Torne, Anna
    Azpeitia, Andoni
    Martinez Garcia, Eva
    Matamala, Anna
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 6 - 15
  • [39] Improving the Quality Trade-Off for Neural Machine Translation Multi-Domain Adaptation
    Hasler, Eva
    Domhan, Tobias
    Trenous, Jonay
    Tran, Ke
    Byrne, Bill
    Hieber, Felix
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8470 - 8477
  • [40] Continual Learning for Neural Machine Translation
    Cao, Yue
    Wei, Hao-Ran
    Chen, Boxing
    Wan, Xiaojun
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3964 - 3974