Reduction of Neural Machine Translation Failures by Incorporating Statistical Machine Translation

被引:1
|
作者
Dugonik, Jani [1 ]
Maucec, Mirjam Sepesy [1 ]
Verber, Domen [1 ]
Brest, Janez [1 ]
机构
[1] Univ Maribor, Fac Elect Engn & Comp Sci, SI-2000 Maribor, Slovenia
关键词
neural machine translation; statistical machine translation; sentence embedding; similarity; classification; hybrid machine translation;
D O I
10.3390/math11112484
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This paper proposes a hybrid machine translation (HMT) system that improves the quality of neural machine translation (NMT) by incorporating statistical machine translation (SMT). Therefore, two NMT systems and two SMT systems were built for the Slovenian-English language pair, each for translation in one direction. We used a multilingual language model to embed the source sentence and translations into the same vector space. From each vector, we extracted features based on the distances and similarities calculated between the source sentence and the NMT translation, and between the source sentence and the SMT translation. To select the best possible translation, we used several well-known classifiers to predict which translation system generated a better translation of the source sentence. The proposed method of combining SMT and NMT in the hybrid system is novel. Our framework is language-independent and can be applied to other languages supported by the multilingual language model. Our experiment involved empirical applications. We compared the performance of the classifiers, and the results demonstrate that our proposed HMT system achieved notable improvements in the BLEU score, with an increase of 1.5 points and 10.9 points for both translation directions, respectively.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Detecting Failures of Neural Machine Translation in the Absence of Reference Translations
    Wang, Wenyu
    Zheng, Wujie
    Liu, Dian
    Zhang, Changrong
    Zeng, Qinsong
    Deng, Yuetang
    Yang, Wei
    He, Pinjia
    Xie, Tao
    49TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN 2019): INDUSTRY TRACK, 2019, : 1 - 4
  • [22] Statistical machine translation based on translation rules
    Yulian, H.
    Journal of Chemical and Pharmaceutical Research, 2014, 6 (07) : 1628 - 1635
  • [23] Entity Highlight Generation as Statistical and Neural Machine Translation
    Huang, Jizhou
    Sun, Yaming
    Zhang, Wei
    Wang, Haifeng
    Liu, Ting
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (10) : 1860 - 1872
  • [24] A Recursive Recurrent Neural Network for Statistical Machine Translation
    Liu, Shujie
    Yang, Nan
    Li, Mu
    Zhou, Ming
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1491 - 1500
  • [25] An Investigation on Statistical Machine Translation with Neural Language Models
    Zhao, Yinggong
    Huang, Shujian
    Chen, Huadong
    Chen, Jiajun
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2014, 2014, 8801 : 175 - 186
  • [26] English-Basque Statistical and Neural Machine Translation
    Unanue, Inigo Jauregi
    Garmendia Arratibel, Lierni
    Borzeshi, Ehsan Zare
    Piccardi, Massimo
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 880 - 885
  • [27] The Event/Machine of Neural Machine Translation?
    Regnauld, Arnaud
    JOURNAL OF AESTHETICS AND PHENOMENOLOGY, 2022, 9 (02) : 141 - 154
  • [28] Neural Name Translation Improves Neural Machine Translation
    Li, Xiaoqing
    Yan, Jinghui
    Zhang, Jiajun
    Zong, Chengqing
    MACHINE TRANSLATION, CWMT 2018, 2019, 954 : 93 - 100
  • [29] Improving Neural Machine Translation by Efficiently Incorporating Syntactic Templates
    Phuong Nguyen
    Tung Le
    Thanh-Le Ha
    Thai Dang
    Khanh Tran
    Kim Anh Nguyen
    Nguyen Le Minh
    ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 303 - 314
  • [30] Incorporating Syntactic Knowledge in Neural Quality Estimation for Machine Translation
    Ye, Na
    Wang, Yuanyuan
    Cai, Dongfeng
    MACHINE TRANSLATION, CCMT 2019, 2019, 1104 : 23 - 34