Speech production knowledge in automatic speech recognition

被引：130

作者：

King, Simon

Frankel, Joe

Livescu, Karen

McDermott, Erik

Richmond, Korin

Wester, Mirjam

机构：

[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9LW, Midlothian, Scotland

[2] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA

[3] NTT Corp, Commun Sci Labs, Kyoto 6190237, Japan

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2007年 / 121卷 / 02期

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

10.1121/1.2404622

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Although much is known about. how speech is produced, and research into speech production has resulted in measured articulatory data, feature systems of different kinds, and numerous models, speech production knowledge is almost totally ignored in current mainstream approaches to automatic speech recognition. Representations of speech production allow simple explanations for many phenomena observed in speech which cannot be easily analyzed from either acoustic signal or phonetic transcription alone. In this article, a survey of a growing body of work in which such representations are used to improve automatic speech recognition is provided. (c) 2007 Acoustical Society of America.

引用

页码：723 / 742

页数：20

共 50 条

[31] Topological invariants as speech features for automatic speech recognition
Kacur, Juraj
Chudy, Vladimir
INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2014, 7 (04) : 235 - 244
[32] Creation of Marathi Speech Corpus for Automatic Speech Recognition
Gaikwad, Santosh
Gawali, Bharti
Mehrotra, Suresh
2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
[33] DUAL APPLICATION OF SPEECH ENHANCEMENT FOR AUTOMATIC SPEECH RECOGNITION
Pandey, Ashutosh
Liu, Chunxi
Wang, Yun
Saraf, Yatharth
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 223 - 228
[34] SPEECH DISFLUENCIES MODELING IN AUTOMATIC SPEECH RECOGNITION SYSTEMS
Vasilisa, Verkhodanova O.
Alexey, Karpov A.
TOMSK STATE UNIVERSITY JOURNAL, 2012, (363): : 10 - +
[35] Evaluation of an Automatic Speech Recognition Platform for Dysarthric Speech
Calvo, Irene
Tropea, Peppino
Vigano, Mauro
Scialla, Maria
Cavalcante, Agnieszka B.
Grajzer, Monika
Gilardone, Marco
Corbo, Massimo
FOLIA PHONIATRICA ET LOGOPAEDICA, 2021, 73 (05) : 432 - 441
[36] Automatic Speech Recognition Performance for Training on Noised Speech
Prodeus, Arkadiy
Kukharicheva, Kateryna
2017 2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION AND COMMUNICATION TECHNOLOGIES-2017 (AICT 2017), 2017, : 71 - 74
[37] AN APPROACH TO THE AUTOMATIC RECOGNITION OF SPEECH
PAY, BE
EVANS, CR
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1981, 14 (01): : 13 - 27
[38] PROSPECTS FOR AUTOMATIC RECOGNITION OF SPEECH
HOUDE, R
AMERICAN ANNALS OF THE DEAF, 1979, 124 (05) : 568 - 572
[39] Automatic speech recognition systems
Catariov, A
Information Technologies 2004, 2004, 5822 : 83 - 93
[40] Automatic speech recognition: A review
Haton, JP
ENTERPRISE INFORMATION SYSTEMS V, 2004, : 6 - 11

← 1 2 3 4 5 →