Linguistic knowledge and empirical methods in speech recognition

被引：0

作者：

Stolcke, A ^{[1
]}

机构：

[1] SRI Int, Speech Res & Technol Lab, Menlo Park, CA 94025 USA

来源：

AI MAGAZINE | 1997年 / 18卷 / 04期

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic speech recognition is one of the fastest growing and commercially mast promising applications of natural language technology. The technology has achieved a point where carefully designed systems for suitably constrained applications are a reality. Commercial systems are available today for such tasks as large-vocabulary dictation and voice control of medical equipment. This article reviews how state-of-the-art speech-recognition systems combine statistical modeling, linguistic knowledge, and machine learning to achieve their performance and points out some of the research issues in the field.

引用

页码：25 / 31

页数：7

共 50 条

[41] Combining stochastic and linguistic language models for recognition of spontaneous speech
Eckert, W
Gallwitz, F
Niemann, H
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 423 - 426
[42] The role of linguistic and indexical information in improved recognition of dysarthric speech
Borrie, Stephanie A.
McAuliffe, Megan J.
Liss, Julie M.
O'Beirne, Greg A.
Anderson, Tim J.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (01): : 474 - 482
[43] Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition
Zhang, Xulong
Wang, Jianzong
Cheng, Ning
Zhao, Mengyuan
Zhang, Zhiyong
Xiao, Jing
2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN, 2022, : 915 - 920
[44] Emotion Recognition from Speech using Prosodic and Linguistic Features
Pervaiz, Mahwish
Khan, Tamim Ahmed
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (08) : 84 - 90
[45] Effect of acoustic and linguistic contexts on human and machine speech recognition
Kitaoka, Norihide
Enami, Daisuke
Nakagawa, Seiichi
COMPUTER SPEECH AND LANGUAGE, 2014, 28 (03): : 769 - 787
[46] Personalised Emotion Recognition Utilising Speech Signal and Linguistic Cues
Ramya, H. R.
Bhatt, Mahabaleswara Ram
2019 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2019, : 856 - 860
[47] EXPLOITING SPEECH KNOWLEDGE IN NEURAL NETS FOR RECOGNITION
HUCKVALE, M
SPEECH COMMUNICATION, 1990, 9 (01) : 1 - 13
[48] Improving part of speech disambiguation rules by adding linguistic knowledge
Lindberg, N
Eineborg, M
INDUCTIVE LOGIC PROGRAMMING, 1999, 1634 : 186 - 197
[49] Knowledge Distillation for Throat Microphone Speech Recognition
Suzuki, Takahito
Ogata, Jun
Tsunakawa, Takashi
Nishida, Masafumi
Nishimura, Masafumi
INTERSPEECH 2019, 2019, : 461 - 465
[50] Incorporating knowledge sources into statistical speech recognition
NICT/ATR Spoken Language, Communication Research Laboratories, Keihanna Science City Kyoto, Japan
不详
Lect. Notes Electr. Eng., 2009, (1-218):

← 1 2 3 4 5 →