An Ensemble Strategy with Gradient Conflict for Multi-Domain Neural Machine Translation

被引:0
|
作者
Man, Zhibo [1 ]
Zhang, Yujie [1 ]
Li, Yu [1 ]
Chen, Yuanmeng [1 ]
Chen, Yufeng [1 ]
Xu, Jinan [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, 3 Shangyuan Village, Beijing 100080, Peoples R China
关键词
domain-specific; gradient conflict; Multi-domain neural machine translation;
D O I
10.1145/3638248
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-domain neural machine translation aims to construct a unified neural machine translation model to translate sentences across various domains. Nevertheless, previous studies have one limitation is the incapacity to acquire both domain-general and domain-specific representations concurrently. To this end, we propose an ensemble strategy with gradient conflict for multi-domain neural machine translation that automatically learns model parameters by identifying both domain-shared and domain-specific features. Specifically, our approach consists of (1) a parameter-sharing framework, where the parameters of all the layers are originally shared and equivalent to each domain, and (2) ensemble strategy, in which we design an Extra Ensemble strategy via a piecewise condition function to learn direction and distance-based gradient conflict. In addition, we give a detailed theoretical analysis of the gradient conflict to further validate the effectiveness of our approach. Experimental results on two multi-domain datasets show the superior performance of our proposed model compared to previous work.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] Neural Paraphrase Generation with Multi-domain Corpus
    Qiao, Lin
    Li, Yida
    Zhong, ChenLi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 54 - 66
  • [32] Multi-domain Neural Network Language Model
    Alumae, Tanel
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2181 - 2185
  • [33] Learning a Pricing Strategy in Multi-Domain DWDM Networks
    Gurzi, Pasquale
    Steenhaut, Kris
    Nowe, Ann
    Vrancx, Peter
    2011 18TH IEEE WORKSHOP ON LOCAL AND METROPOLITAN AREA NETWORKS (LANMAN), 2011,
  • [34] Multi-Domain Alias Matching Using Machine Learning
    Ashcroft, Michael
    Johansson, Fredrik
    Kaati, Lisa
    Shrestha, Amendra
    2016 THIRD EUROPEAN NETWORK INTELLIGENCE CONFERENCE (ENIC 2016), 2016, : 77 - 84
  • [35] DMDIT: Diverse multi-domain image-to-image translation
    Shao, Mingwen
    Zhang, Youcai
    Liu, Huan
    Wang, Chao
    Li, Le
    Shao, Xun
    KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [36] A Unified Feature Disentangler for Multi-Domain Image Translation and Manipulation
    Liu, Alexander H.
    Liu, Yen-Cheng
    Yeh, Yu-Ying
    Wang, Yu-Chiang Frank
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [37] Go From the General to the Particular: Multi-Domain Translation with Domain Transformation Networks
    Wang, Yong
    Wang, Longyue
    Shi, Shuming
    Li, Victor O. K.
    Tu, Zhaopeng
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9233 - 9241
  • [38] Unsupervised Multi-Domain Image Translation with Domain-Specific Encoders/Decoders
    Hui, Le
    Li, Xiang
    Chen, Jiaxin
    He, Hongliang
    Yang, Jian
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2044 - 2049
  • [39] A Domain Gap Aware Generative Adversarial Network for Multi-Domain Image Translation
    Xu, Wenju
    Wang, Guanghui
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 72 - 84
  • [40] DYNAMIC MULTI-DOMAIN TRANSLATION NETWORK FOR SINGLE IMAGE DERAINING
    Huang, Zihong
    Zhang, Jian
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1754 - 1758