An Ensemble Strategy with Gradient Conflict for Multi-Domain Neural Machine Translation

被引:0
|
作者
Man, Zhibo [1 ]
Zhang, Yujie [1 ]
Li, Yu [1 ]
Chen, Yuanmeng [1 ]
Chen, Yufeng [1 ]
Xu, Jinan [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, 3 Shangyuan Village, Beijing 100080, Peoples R China
关键词
domain-specific; gradient conflict; Multi-domain neural machine translation;
D O I
10.1145/3638248
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-domain neural machine translation aims to construct a unified neural machine translation model to translate sentences across various domains. Nevertheless, previous studies have one limitation is the incapacity to acquire both domain-general and domain-specific representations concurrently. To this end, we propose an ensemble strategy with gradient conflict for multi-domain neural machine translation that automatically learns model parameters by identifying both domain-shared and domain-specific features. Specifically, our approach consists of (1) a parameter-sharing framework, where the parameters of all the layers are originally shared and equivalent to each domain, and (2) ensemble strategy, in which we design an Extra Ensemble strategy via a piecewise condition function to learn direction and distance-based gradient conflict. In addition, we give a detailed theoretical analysis of the gradient conflict to further validate the effectiveness of our approach. Experimental results on two multi-domain datasets show the superior performance of our proposed model compared to previous work.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Factorized Transformer for Multi-Domain Neural Machine Translation
    Deng, Yongchao
    Yu, Hongfei
    Yu, Heng
    Duan, Xiangyu
    Luo, Weihua
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4221 - 4230
  • [2] Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
    Saunders D.
    Journal of Artificial Intelligence Research, 2022, 75 : 351 - 424
  • [3] Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
    Saunders, Danielle
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 75 : 351 - 424
  • [4] Revisiting Multi-Domain Machine Translation
    MinhQuang Pham
    Crego, Josep Maria
    Yvon, Francois
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 : 17 - 35
  • [5] Domain-Aware Self-Attention for Multi-Domain Neural Machine Translation
    Zhang, Shiqi
    Liu, Yan
    Xiong, Deyi
    Zhang, Pei
    Chen, Boxing
    INTERSPEECH 2021, 2021, : 2047 - 2051
  • [6] A multi-domain adaptive neural machine translation method based on domain data balancer
    Xu, Jinlei
    Wen, Yonghua
    Huang, Shuanghong
    Yu, Zhengtao
    INTELLIGENT DATA ANALYSIS, 2024, 28 (03) : 685 - 698
  • [7] Multi-Domain Neural Machine Translation with Word-Level Domain Context Discrimination
    Zeng, Jiali
    Su, Jinsong
    Wen, Huating
    Liu, Yang
    Xie, Jun
    Yin, Yongjing
    Zhao, Jianqiang
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 447 - 457
  • [8] Building a Multi-Domain Neural Machine Translation Model Using Knowledge Distillation
    Mghabbar, Idriss
    Ratnamogan, Pirashanth
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2116 - 2123
  • [9] Exploring Discriminative Word-Level Domain Contexts for Multi-Domain Neural Machine Translation
    Su, Jinsong
    Zeng, Jiali
    Xie, Jun
    Wen, Huating
    Yin, Yongjing
    Liu, Yang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1530 - 1545
  • [10] Word-Based Domain Feature-Sensitive Multi-domain Neural Machine Translation
    Huang Z.
    Man Z.
    Zhang Y.
    Xu J.
    Chen Y.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2023, 59 (01): : 1 - 10