Phrase-based translation of speech recognizer word lattices using loglinear model combination

被引:0
|
作者
Matusov, E [1 ]
Ney, H [1 ]
Schlüter, R [1 ]
机构
[1] Rhein Westfal TH Aachen, Dept Comp Sci, Lehrstuhl Informat 6, D-52056 Aachen, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a phrase-based speech translation system that combines phrasal lexicon, language, and acoustic model features in a loglinear model. Automatic speech recognition and machine translation are coupled by using large word lattices as the input for translation. For the first time, all features are directly integrated into the decoding process. The feature weights are iteratively optimized for an objective error measure. We prove that acoustic recognition scores of the recognized words in the lattices together with a source language model score positively and significantly affect the translation quality. We show the advantage of using loglinear model combination for a robust optimization of scaling factors. We report consistent improvements compared with translations of single best recognition output on an Italian-to-English translation task. First encouraging results were also obtained on a large vocabulary task of translating European parliamentary speeches.
引用
收藏
页码:110 / 115
页数:6
相关论文
共 50 条
  • [21] Phrase-Based & Neural Unsupervised Machine Translation
    Lample, Guillaume
    Ott, Myle
    Conneau, Alexis
    Denoyer, Ludovic
    Ranzato, Marc'Aurelio
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 5039 - 5049
  • [22] Improvements in phrase-based statistical machine translation
    Zens, R
    Ney, H
    HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2004, : 257 - 264
  • [23] Cardinality pruning and language model heuristics for hierarchical phrase-based translation
    Vilar, David
    Ney, Hermann
    MACHINE TRANSLATION, 2012, 26 (03) : 217 - 254
  • [24] Introducing a translation dictionary into phrase-based SMT
    Okuma, Hideo
    Yamamoto, Hirofumi
    Sumita, Eiichiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (07): : 2051 - 2057
  • [25] Syntactic phrase-based statistical machine translation
    Hassan, Hany
    Heame, Mary
    Way, Andy
    Sima'an, Khalil
    2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 238 - +
  • [26] FACTORED PHRASE-BASED STATISTICAL MACHINE TRANSLATION
    Tufis, Dan
    Ceausu, Alexandru
    FROM SPEECH PROCESSING TO SPOKEN LANGUAGE TECHNOLOGY, 2009, : 115 - 124
  • [27] Learning Better Classification-based Reordering Model for Phrase-based Translation
    Li Fuxue
    Zhu Jingbo
    Xiao Tong
    2017 INTERNATIONAL CONFERENCE ON COMPUTER NETWORK, ELECTRONIC AND AUTOMATION (ICCNEA), 2017, : 190 - 197
  • [28] Phrase-Based Machine Translation based on Simulated Annealing
    Lavecchia, Caroline
    Langlois, David
    Smaili, Kamel
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3123 - 3129
  • [29] Divergence-based fine pruning of phrase-based statistical translation model
    Kim, Kangil
    Park, Eun-Jin
    Shin, Jong-Hun
    Kwon, Oh-Woog
    Kim, Young-Kil
    COMPUTER SPEECH AND LANGUAGE, 2017, 41 : 146 - 160
  • [30] Using TectoMT as a Preprocessing Tool for Phrase-Based Statistical Machine Translation
    Zeman, Daniel
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 216 - 223