An Ensemble Strategy with Gradient Conflict for Multi-Domain Neural Machine Translation

被引:0
|
作者
Man, Zhibo [1 ]
Zhang, Yujie [1 ]
Li, Yu [1 ]
Chen, Yuanmeng [1 ]
Chen, Yufeng [1 ]
Xu, Jinan [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, 3 Shangyuan Village, Beijing 100080, Peoples R China
关键词
domain-specific; gradient conflict; Multi-domain neural machine translation;
D O I
10.1145/3638248
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-domain neural machine translation aims to construct a unified neural machine translation model to translate sentences across various domains. Nevertheless, previous studies have one limitation is the incapacity to acquire both domain-general and domain-specific representations concurrently. To this end, we propose an ensemble strategy with gradient conflict for multi-domain neural machine translation that automatically learns model parameters by identifying both domain-shared and domain-specific features. Specifically, our approach consists of (1) a parameter-sharing framework, where the parameters of all the layers are originally shared and equivalent to each domain, and (2) ensemble strategy, in which we design an Extra Ensemble strategy via a piecewise condition function to learn direction and distance-based gradient conflict. In addition, we give a detailed theoretical analysis of the gradient conflict to further validate the effectiveness of our approach. Experimental results on two multi-domain datasets show the superior performance of our proposed model compared to previous work.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Specific emitter identification based on ensemble domain adversarial neural network in multi-domain environments
    Li, Dingshan
    Yao, Bin
    Sun, Pu
    Li, Peitong
    Yan, Jianfeng
    Wang, Juzhen
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2024, 2024 (01)
  • [22] Multi-domain CT translation by a routable translation network
    Kim, Hyunjong
    Oh, Gyutaek
    Seo, Joon Beom
    Hwang, Hye Jeon
    Lee, Sang Min
    Yun, Jihye
    Ye, Jong Chul
    PHYSICS IN MEDICINE AND BIOLOGY, 2022, 67 (21):
  • [23] Management mechanism for multi-domain strategy
    Duan, Li-Juan
    Liu, Yan
    Yang, Zhen
    Lai, Ying-Xu
    Beijing Gongye Daxue Xuebao/Journal of Beijing University of Technology, 2010, 36 (SUPPL. 2): : 49 - 53
  • [24] Adversarial Feature Translation for Multi-domain Recommendation
    Hao, Xiaobo
    Liu, Yudan
    Xie, Ruobing
    Ge, Kaikai
    Tang, Linyao
    Zhang, Xu
    Lin, Leyu
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 2964 - 2973
  • [25] SWITCHGAN FOR MULTI-DOMAIN FAICAL IMAGE TRANSLATION
    Zhu, Yuanlue
    Bai, Mengchao
    Shen, Linlin
    Wen, Zhiwei
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1198 - 1203
  • [26] Multi-Domain Neural Network Recommender
    Yi, Baolin
    Zhao, Shuting
    Shen, Xiaoxuan
    Zhang, Li
    2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION ENGINEERING (ICECE 2018), 2018, : 41 - 45
  • [27] Unsupervised multi-domain image translation with domain representation learning
    Liu, Huajun
    Chen, Lei
    Sui, Haigang
    Zhu, Qing
    Lei, Dian
    Liu, Shubo
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 99
  • [28] An ensemble approach to stabilize the features for multi-domain sentiment analysis using supervised machine learning
    Ghosh M.
    Sanyal G.
    Ghosh, Monalisa (monalisa_05mca@yahoo.com), 2018, SpringerOpen (05)
  • [29] Transductive Ensemble Learning for Neural Machine Translation
    Wang, Yiren
    Wu, Lijun
    Xia, Yingce
    Qin, Tao
    Zhai, ChengXiang
    Liu, Tie-Yan
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6291 - 6298
  • [30] Gated SwitchGAN for Multi-Domain Facial Image Translation
    Zhang, Xiaokang
    Zhu, Yuanlue
    Chen, Wenting
    Liu, Wenshuang
    Shen, Linlin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1990 - 2003