Pivot language approach for phrase-based statistical machine translation

被引:49
|
作者
Wu, Hua [1 ]
Wang, Haifeng [1 ]
机构
[1] Toshiba China Res & Dev Ctr, 501,Tower W2,Oriental Plaza,1,East Chang An Ave, Beijing 100738, Peoples R China
关键词
Pivot language; Phrase-based statistical machine translation; Scarce bilingual resources;
D O I
10.1007/s10590-008-9041-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel method for phrase-based statistical machine translation based on the use of a pivot language. To translate between languages L-s and L-t with limited bilingual resources, we bring in a third language, L-p, called the pivot language. For the language pairs L-s - L-p and L-p - L-t, there exist large bilingual corpora. Using only L-s - L-p and L-p- L-t bilingual corpora, we can build a translation model for L-s - L-t. The advantage of this method lies in the fact that we can perform translation between L-s and L-t even if there is no bilingual corpus available for this language pair. Using BLEU as a metric, our pivot language approach significantly outperforms the standard model trained on a small bilingual corpus. Moreover, with a small L-s - L-t bilingual corpus available, our method can further improve translation quality by using the additional L-s - L-p and L-p - L-t bilingual corpora.
引用
收藏
页码:165 / 181
页数:17
相关论文
共 50 条
  • [21] Modality-Preserving Phrase-Based Statistical Machine Translation
    Ideue, Masamichi
    Yamamoto, Kazuhide
    Utiyama, Masao
    Sumita, Eiichiro
    2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 129 - 132
  • [22] Improving Phrase-Based Statistical Machine Translation with Preprocessing Techniques
    Yashothara, S.
    Uthayasanker, R. T.
    Jayasena, S.
    2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 322 - 327
  • [23] Improving phrase-based statistical machine translation with morphosyntactic transformation
    Thai Phuong Nguyen
    Shimazu, Akira
    MACHINE TRANSLATION, 2006, 20 (03) : 147 - 166
  • [24] A phrase-based, joint probability model for statistical machine translation
    Marcu, D
    Wong, W
    PROCEEDINGS OF THE 2002 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2002, : 133 - 139
  • [25] Phrase-Based Tibetan-Chinese Statistical Machine Translation
    Yong Cuo
    Shi, Xiaodong
    Nyima, Tashi
    Chen, Yidong
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 424 - 427
  • [26] Statistical phrase-based speech translation
    Mathias, Lambert
    Byrne, William
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 561 - 564
  • [27] Empirical Analysis of Phrase-Based Statistical Machine Translation System for English to Hindi Language
    Babhulgaonkar, Arun
    Sonavane, Shefali
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2022, 09 (02) : 135 - 162
  • [28] Improving Phrase-Based Statistical Machine Translation Models by Incorporating Syntax-Based Language Models
    陈毅东
    史晓东
    Journal of Donghua University(English Edition), 2010, 27 (02) : 185 - 188
  • [29] Folsom: A fast and memory-efficient phrase-based approach to statistical machine translation
    Zhou, Bowen
    Chen, Stanley E.
    Gao, Yuqing
    2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 226 - +
  • [30] Improving Phrase-based Korean-English Statistical Machine Translation
    Lee, Jonghoon
    Lee, Donghyeon
    Lee, Gary Geunbae
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 753 - 756