Turbo Processing for Speech Recognition

被引：2

作者：

Moon, Todd K. ^{[1
,2
]}

Gunther, Jacob H. ^{[1
,2
]}

Broadus, Cortnie ^{[3
]}

Hou, Wendy ^{[4
]}

Nelson, Nils ^{[3
]}

机构：

[1] Utah State Univ, Informat Dynam Lab, Logan, UT 84322 USA

[2] Utah State Univ, Dept Elect & Comp Engn, Logan, UT 84322 USA

[3] Utah State Univ, Dept Math, Logan, UT 84322 USA

[4] Yale Univ, Dept Math, New Haven, CT 06511 USA

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2014年 / 44卷 / 01期

关键词：

Human-machine interface; speech processing; turbo processing; HIDDEN MARKOV-MODELS; MAXIMIZATION;

D O I：

10.1109/TCYB.2013.2247593

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Speech recognition is a classic example of a human/machine interface, typifying many of the difficulties and opportunities of human/machine interaction. In this paper, speech recognition is used as an example of applying turbo processing principles to the general problem of human/machine interface. Speech recognizers frequently involve a model representing phonemic information at a local level, followed by a language model representing information at a nonlocal level. This structure is analogous to the local (e. g., equalizer) and nonlocal (e. g., error correction decoding) elements common in digital communications. Drawing from the analogy of turbo processing for digital communications, turbo speech processing iteratively feeds back the output of the language model to be used as prior probabilities for the phonemic model. This analogy is developed here, and the performance of this turbo model is characterized by using an artificial language model. Using turbo processing, the relative error rate improves significantly, especially in high-noise settings.

引用

页码：83 / 91

页数：9

共 50 条

[1] Turbo Automatic Speech Recognition
Receveur, Simon
Weiss, Robin
Fingscheidt, Tim
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (05) : 846 - 862
[2] THE AUDITORY PROCESSING AND RECOGNITION OF SPEECH
BYRNE, W
ROBINSON, J
SHAMMA, S
SPEECH AND NATURAL LANGUAGE, 1989, : 325 - 331
[3] A COMPACT FORMULATION OF TURBO AUDIO-VISUAL SPEECH RECOGNITION
Receveur, Simon
Meyer, Patrick
Fingscheidt, Tim
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[4] Turbo Decoders for Audio-visual Continuous Speech Recognition
Abdelaziz, Ahmed Hussen
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3667 - 3671
[5] Various Speech Processing Techniques For Speech Compression And Recognition
Karam, Jalal
PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 26, PARTS 1 AND 2, DECEMBER 2007, 2007, 26 : 704 - 708
[6] LANGUAGE PROCESSING BEYOND SPEECH RECOGNITION
LINGGARD, R
BRITISH TELECOM TECHNOLOGY JOURNAL, 1988, 6 (02): : 124 - 130
[7] High performance processing for speech recognition
Ramljak, Milan
Stella, Maja
Šarić, Matko
1600, North Atlantic University Union NAUN (08): : 166 - 172
[8] Makhraj Recognition Using Speech Processing
Wahidah, A. N.
Suriazalmi, M. S.
Niza, M. L.
Rosyati, H.
Faradila, N.
Hasan, A.
Rohana, A. K.
Farizan, Z. N.
2012 7TH INTERNATIONAL CONFERENCE ON COMPUTING AND CONVERGENCE TECHNOLOGY (ICCCT2012), 2012, : 689 - 693
[9] A SYSTOLIC PROCESSING ELEMENT FOR SPEECH RECOGNITION
WESTE, NHE
BURR, DJ
ACKLAND, BD
ISSCC DIGEST OF TECHNICAL PAPERS, 1982, 25 : 274 - 275
[10] Speech Processing for Hindi Dialect Recognition
Sinha, Shweta
Jain, Aruna
Agrawal, Shyam S.
ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2014, 264 : 161 - 169

← 1 2 3 4 5 →