An Ensemble Strategy with Gradient Conflict for Multi-Domain Neural Machine Translation

被引:0
|
作者
Man, Zhibo [1 ]
Zhang, Yujie [1 ]
Li, Yu [1 ]
Chen, Yuanmeng [1 ]
Chen, Yufeng [1 ]
Xu, Jinan [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, 3 Shangyuan Village, Beijing 100080, Peoples R China
关键词
domain-specific; gradient conflict; Multi-domain neural machine translation;
D O I
10.1145/3638248
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-domain neural machine translation aims to construct a unified neural machine translation model to translate sentences across various domains. Nevertheless, previous studies have one limitation is the incapacity to acquire both domain-general and domain-specific representations concurrently. To this end, we propose an ensemble strategy with gradient conflict for multi-domain neural machine translation that automatically learns model parameters by identifying both domain-shared and domain-specific features. Specifically, our approach consists of (1) a parameter-sharing framework, where the parameters of all the layers are originally shared and equivalent to each domain, and (2) ensemble strategy, in which we design an Extra Ensemble strategy via a piecewise condition function to learn direction and distance-based gradient conflict. In addition, we give a detailed theoretical analysis of the gradient conflict to further validate the effectiveness of our approach. Experimental results on two multi-domain datasets show the superior performance of our proposed model compared to previous work.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] A Domain Adaptation Method for Neural Machine Translation
    Tian, Xiaohu
    Liu, Jin
    Pu, Jiachen
    Wang, Jin
    ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING, MUE/FUTURETECH 2018, 2019, 518 : 321 - 326
  • [42] Unsupervised Domain Adaptation for Neural Machine Translation
    Yang, Zhen
    Chen, Wei
    Wang, Feng
    Xu, Bo
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 338 - 343
  • [43] Domain Adaptive Inference for Neural Machine Translation
    Saunders, Danielle
    Stahlberg, Felix
    de Gispert, Adria
    Byrne, Bill
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 222 - 228
  • [44] Efficient parametrization of multi-domain deep neural networks
    Rebuffi, Sylvestre-Alvise
    Bilen, Hakan
    Vedaldi, Andrea
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8119 - 8127
  • [45] Fault diagnosis of rotating machinery with ensemble kernel extreme learning machine based on fused multi-domain features
    Pang, Shan
    Yang, Xinyi
    Zhang, Xiaofeng
    Lin, Xuesen
    ISA TRANSACTIONS, 2020, 98 : 320 - 337
  • [46] low-resource neural Machine translation with Multi-strategy prototype generation
    Yu Z.-Q.
    Yu Z.-T.
    Huang Y.-X.
    Guo J.-J.
    Xian Y.-T.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (11): : 5113 - 5125
  • [47] A novel multi-domain machine reading comprehension model with domain interference mitigation
    Zhou, Chulun
    Wang, Zhihao
    He, Shaojie
    Zhang, Haiying
    Su, Jinsong
    NEUROCOMPUTING, 2022, 500 : 791 - 798
  • [48] Combinatorial coverage framework for machine learning in multi-domain operations
    Cody, Tyler
    Kauffman, Justin
    Krometis, Justin
    Sobien, Dan
    Freeman, Laura
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS IV, 2022, 12113
  • [49] A fuzzy-based strategy for multi-domain sentiment analysis
    Dragoni, Mauro
    Petrucci, Giulio
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 93 : 59 - 73
  • [50] Multi-Domain Image-to-Image Translation with Adaptive Inference Graph
    The-Phuc Nguyen
    Lathuiliere, Stephane
    Ricci, Elisa
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5368 - 5375