An overview of decoding techniques for large vocabulary continuous speech recognition

被引：56

作者：

Aubert, XL ^{[1
]}

机构：

[1] Philips Res Labs, D-52066 Aachen, Germany

来源：

COMPUTER SPEECH AND LANGUAGE | 2002年 / 16卷 / 01期

关键词：

D O I：

10.1006/csla.2001.0185

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A number of decoding strategies for large vocabulary continuous speech recognition (LVCSR) are examined from the viewpoint of their search space representation. Different design solutions are compared with respect to the integration of linguistic and acoustic constraints, as implied by m-gram language models (LM) and cross-word (CW) phonetic contexts. This study is structured along two main axes: the network expansion and the search algorithm itself. The network can be expanded statically or dynamically while the search can proceed either time-synchronously or asynchronously which leads to distinct architectures. Three broad classes of decoding methods are briefly reviewed: the use of weighted finite state transducers (WFST) for static network expansion, the time-synchronous dynamic-expansion search and the asynchronous stack decoding. Heuristic methods for further reducing the search space are also considered. The main approaches are compared and some prospective views are formulated regarding possible future avenues. (C) 2002 Academic Press.

引用

页码：89 / 114

页数：26

共 50 条

[21] Experimenting with lipreading for large vocabulary continuous speech recognition
Karel Paleček
Journal on Multimodal User Interfaces, 2018, 12 : 309 - 318
[22] Recent Developments in Large Vocabulary Continuous Speech Recognition
Saon, George
Chien, Jen-Tzung
2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
[23] Development of Large Vocabulary Continuous Speech Recognition for Polish
Demenko, G.
Szymanski, M.
Cecko, R.
Kusmierek, E.
Lange, M.
Wegner, K.
Klessa, K.
Owsianny, M.
ACTA PHYSICA POLONICA A, 2012, 121 (1A) : A86 - A91
[24] A Myanmar Large Vocabulary Continuous Speech Recognition System
Naing, Hay Mar Soe
Hlaing, Aye Mya
Pa, Win Pa
Hu, Xinhui
Thu, Ye Kyaw
Hori, Chiori
Kawai, Hisashi
2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 320 - 327
[25] Investigation on large vocabulary continuous Kannada speech recognition
Vanajakshi, Puttaswamy Gowda
Mathivanan, M.
Kumaran, T. Senthil
INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2021, 36 (01) : 1 - 24
[26] Towards speech rate independence in large vocabulary continuous speech recognition
Martinez, F
Tapias, D
Alvarez, J
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 725 - 728
[27] Parallel Scalability in Speech Recognition Inference engines in large vocabulary continuous speech recognition
You, Kisun
Chong, Jike
Yi, Youngmin
Gonina, Ekaterina
Hughes, Christopher J.
Chen, Yen-Kuang
Sung, Wonyong
Keutzer, Kurt
IEEE SIGNAL PROCESSING MAGAZINE, 2009, 26 (06) : 124 - 135
[28] A Segmental CRF Approach to Large Vocabulary Continuous Speech Recognition
Zweig, Geoffrey
Nguyen, Patrick
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 152 - 157
[29] A large vocabulary continuous speech recognition system for Persian language
Sameti, Hossein
Veisi, Hadi
Bahrani, Mohammad
Babaali, Bagher
Hosseinzadeh, Khosro
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 12
[30] A review of large-vocabulary continuous-speech recognition
Young, S
IEEE SIGNAL PROCESSING MAGAZINE, 1996, 13 (05) : 45 - 57

← 1 2 3 4 5 →