An overview of decoding techniques for large vocabulary continuous speech recognition

被引：56

作者：

Aubert, XL ^{[1
]}

机构：

[1] Philips Res Labs, D-52066 Aachen, Germany

来源：

COMPUTER SPEECH AND LANGUAGE | 2002年 / 16卷 / 01期

关键词：

D O I：

10.1006/csla.2001.0185

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A number of decoding strategies for large vocabulary continuous speech recognition (LVCSR) are examined from the viewpoint of their search space representation. Different design solutions are compared with respect to the integration of linguistic and acoustic constraints, as implied by m-gram language models (LM) and cross-word (CW) phonetic contexts. This study is structured along two main axes: the network expansion and the search algorithm itself. The network can be expanded statically or dynamically while the search can proceed either time-synchronously or asynchronously which leads to distinct architectures. Three broad classes of decoding methods are briefly reviewed: the use of weighted finite state transducers (WFST) for static network expansion, the time-synchronous dynamic-expansion search and the asynchronous stack decoding. Heuristic methods for further reducing the search space are also considered. The main approaches are compared and some prospective views are formulated regarding possible future avenues. (C) 2002 Academic Press.

引用

页码：89 / 114

页数：26

共 50 条

[1] Integrating induced probability into decoding for large vocabulary continuous speech recognition
Yang, Zhanlei
Liu, Wenju
Chao, Hao
Shengxue Xuebao/Acta Acustica, 2012, 37 (02): : 209 - 217
[2] Discriminative training of decoding graphs for large vocabulary continuous speech recognition
Kuo, Hong-Kwang Jeff
Kingsbury, Brian
Zweig, Geoffrey
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 45 - +
[3] Integrating induced probability into decoding for large vocabulary continuous speech recognition
YANG Zhanlei LIU Wenju CHAO Hao (National Laboratory of Pattern Recognition
Chinese Journal of Acoustics, 2012, 31 (03) : 338 - 352
[4] A Detailed Survey on Large Vocabulary Continuous Speech Recognition Techniques
Vanajakshi, P.
Mathivanan, M.
2017 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2017,
[5] Minimum Bayes risk estimation and decoding in large vocabulary continuous speech recognition
Byrne, W
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03): : 900 - 907
[6] Response Probability Based Decoding Algorithm for Large Vocabulary Continuous Speech Recognition
Yang, Zhanlei
Chao, Hao
Liu, Wenju
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1940 - 1943
[7] Improved discriminative training techniques for large vocabulary continuous speech recognition
Povey, D
Woodland, PC
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 45 - 48
[8] Vietnamese Large Vocabulary Continuous Speech Recognition
Ngoc Thang Vu
Schultz, Tanja
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 333 - 338
[9] Advances in large vocabulary continuous speech recognition
Zweig, G
Picheny, M
ADVANCES IN COMPUTERS, VOL. 60: INFORMATION SECURITY, 2004, 60 : 249 - 291
[10] Advances in Missing Feature Techniques for Robust Large-Vocabulary Continuous Speech Recognition
Van Segbroeck, Maarten
Van Hamme, Hugo
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 123 - 137

← 1 2 3 4 5 →