Improving Adversarial Neural Machine Translation for Morphologically Rich Language

被引:10
|
作者
Mi, Chenggang [1 ]
Xie, Lei [1 ]
Zhang, Yanning [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Natl Engn Lab Integrated Aerosp Ground Ocean Big, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Neural machine translation (NMT); morphologically rich language; adversarial training; morphological word embedding; multiple references;
D O I
10.1109/TETCI.2019.2960546
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative adversarial networks (GAN) have great successes on natural language processing (NLP) and neural machine translation (NMT). However, the existing discriminator in GAN for NMT only combines two words as one query to train the translation models, which restrict the discriminator to be more meaningful and fail to apply rich monolingual information. Recent studies only consider one single reference translation during model training, this limit the GAN model to learn sufficient information about the representation of source sentence. These situations are even worse when languages are morphologically rich. In this article, an extended version of GAN model for neural machine translation is proposed to optimize the performance of morphologically rich language translation. In particular, we use the morphological word embedding instead of word embedding as input in GAN model to enrich the representation of words and overcome the data sparsity problem during model training. Moreover, multiple references are integrated into discriminator to make the model consider more context information and adapt to the diversity of different languages. Experimental results on German <-> English, French <-> English, Czech <-> English, Finnish <-> English, Turkish <-> English, Chinese <-> English, Finnish <-> Turkish and Turkish <-> Czech translation tasks demonstrate that our method achieves significant improvements over baseline systems.
引用
收藏
页码:417 / 426
页数:10
相关论文
共 50 条
  • [21] Improving Neural Machine Translation with Neural Sentence Rewriting
    Wu, Tian
    He, Zhongjun
    Chen, Enhong
    Wang, Haifeng
    2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 147 - 152
  • [22] Improving Neural Machine Translation with Neural Syntactic Distance
    Ma, Chunpeng
    Tamura, Akihiro
    Utiyama, Masao
    Zhao, Tiejun
    Sumita, Eiichiro
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2032 - 2037
  • [23] End-to-End Lexically Constrained Machine Translation for Morphologically Rich Languages
    Jon, Josef
    Aires, Joao Paulo
    Varis, Dusan
    Bojar, Ondrej
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4019 - 4033
  • [24] Neural correlates of morphological decomposition in a morphologically rich language: An fMRI study
    Lehtonen, Minna
    Vorobyev, Victor A.
    Hugdahl, Kenneth
    Tuokkola, Terhi
    Laine, Matti
    BRAIN AND LANGUAGE, 2006, 98 (02) : 182 - 193
  • [25] Improving Neural Language Modeling via Adversarial Training
    Wang, Dilin
    Gong, Chengyue
    Liu, Qiang
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [26] Improving Neural Machine Translation Using Rule-Based Machine Translation
    Singh, Muskaan
    Kumar, Ravinder
    Chana, Inderveer
    2019 7TH INTERNATIONAL CONFERENCE ON SMART COMPUTING & COMMUNICATIONS (ICSCC), 2019, : 8 - 12
  • [27] Improving Neural Machine Translation by Bidirectional Training
    Ding, Liang
    Wu, Di
    Tao, Dacheng
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3278 - 3284
  • [28] Corpus Augmentation for Improving Neural Machine Translation
    Li, Zijian
    Chi, Chengying
    Zhan, Yunyun
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 64 (01): : 637 - 650
  • [29] Improving Neural Machine Translation by Retrieving Target Translation Template
    Li, Fuxue
    Chi, Chuncheng
    Yan, Hong
    Zhang, Zhen
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 658 - 669
  • [30] Extract and Attend: Improving Entity Translation in Neural Machine Translation
    Zeng, Zixin
    Wang, Rui
    Leng, Yichong
    Guo, Junliang
    Tan, Xu
    Qin, Tao
    Liu, Tie-yan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 1697 - 1710