An overview of decoding techniques for large vocabulary continuous speech recognition

被引:56
|
作者
Aubert, XL [1 ]
机构
[1] Philips Res Labs, D-52066 Aachen, Germany
来源
COMPUTER SPEECH AND LANGUAGE | 2002年 / 16卷 / 01期
关键词
D O I
10.1006/csla.2001.0185
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A number of decoding strategies for large vocabulary continuous speech recognition (LVCSR) are examined from the viewpoint of their search space representation. Different design solutions are compared with respect to the integration of linguistic and acoustic constraints, as implied by m-gram language models (LM) and cross-word (CW) phonetic contexts. This study is structured along two main axes: the network expansion and the search algorithm itself. The network can be expanded statically or dynamically while the search can proceed either time-synchronously or asynchronously which leads to distinct architectures. Three broad classes of decoding methods are briefly reviewed: the use of weighted finite state transducers (WFST) for static network expansion, the time-synchronous dynamic-expansion search and the asynchronous stack decoding. Heuristic methods for further reducing the search space are also considered. The main approaches are compared and some prospective views are formulated regarding possible future avenues. (C) 2002 Academic Press.
引用
收藏
页码:89 / 114
页数:26
相关论文
共 50 条
  • [1] Integrating induced probability into decoding for large vocabulary continuous speech recognition
    Yang, Zhanlei
    Liu, Wenju
    Chao, Hao
    Shengxue Xuebao/Acta Acustica, 2012, 37 (02): : 209 - 217
  • [2] Discriminative training of decoding graphs for large vocabulary continuous speech recognition
    Kuo, Hong-Kwang Jeff
    Kingsbury, Brian
    Zweig, Geoffrey
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 45 - +
  • [3] Integrating induced probability into decoding for large vocabulary continuous speech recognition
    YANG Zhanlei LIU Wenju CHAO Hao (National Laboratory of Pattern Recognition
    Chinese Journal of Acoustics, 2012, 31 (03) : 338 - 352
  • [4] A Detailed Survey on Large Vocabulary Continuous Speech Recognition Techniques
    Vanajakshi, P.
    Mathivanan, M.
    2017 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2017,
  • [5] Minimum Bayes risk estimation and decoding in large vocabulary continuous speech recognition
    Byrne, W
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03): : 900 - 907
  • [6] Response Probability Based Decoding Algorithm for Large Vocabulary Continuous Speech Recognition
    Yang, Zhanlei
    Chao, Hao
    Liu, Wenju
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1940 - 1943
  • [7] Improved discriminative training techniques for large vocabulary continuous speech recognition
    Povey, D
    Woodland, PC
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 45 - 48
  • [8] Vietnamese Large Vocabulary Continuous Speech Recognition
    Ngoc Thang Vu
    Schultz, Tanja
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 333 - 338
  • [9] Advances in large vocabulary continuous speech recognition
    Zweig, G
    Picheny, M
    ADVANCES IN COMPUTERS, VOL. 60: INFORMATION SECURITY, 2004, 60 : 249 - 291
  • [10] Advances in Missing Feature Techniques for Robust Large-Vocabulary Continuous Speech Recognition
    Van Segbroeck, Maarten
    Van Hamme, Hugo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 123 - 137