Linguistic knowledge and empirical methods in speech recognition

被引:0
|
作者
Stolcke, A [1 ]
机构
[1] SRI Int, Speech Res & Technol Lab, Menlo Park, CA 94025 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic speech recognition is one of the fastest growing and commercially mast promising applications of natural language technology. The technology has achieved a point where carefully designed systems for suitably constrained applications are a reality. Commercial systems are available today for such tasks as large-vocabulary dictation and voice control of medical equipment. This article reviews how state-of-the-art speech-recognition systems combine statistical modeling, linguistic knowledge, and machine learning to achieve their performance and points out some of the research issues in the field.
引用
收藏
页码:25 / 31
页数:7
相关论文
共 50 条
  • [41] Combining stochastic and linguistic language models for recognition of spontaneous speech
    Eckert, W
    Gallwitz, F
    Niemann, H
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 423 - 426
  • [42] The role of linguistic and indexical information in improved recognition of dysarthric speech
    Borrie, Stephanie A.
    McAuliffe, Megan J.
    Liss, Julie M.
    O'Beirne, Greg A.
    Anderson, Tim J.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (01): : 474 - 482
  • [43] Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition
    Zhang, Xulong
    Wang, Jianzong
    Cheng, Ning
    Zhao, Mengyuan
    Zhang, Zhiyong
    Xiao, Jing
    2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN, 2022, : 915 - 920
  • [44] Emotion Recognition from Speech using Prosodic and Linguistic Features
    Pervaiz, Mahwish
    Khan, Tamim Ahmed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (08) : 84 - 90
  • [45] Effect of acoustic and linguistic contexts on human and machine speech recognition
    Kitaoka, Norihide
    Enami, Daisuke
    Nakagawa, Seiichi
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (03): : 769 - 787
  • [46] Personalised Emotion Recognition Utilising Speech Signal and Linguistic Cues
    Ramya, H. R.
    Bhatt, Mahabaleswara Ram
    2019 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2019, : 856 - 860
  • [47] EXPLOITING SPEECH KNOWLEDGE IN NEURAL NETS FOR RECOGNITION
    HUCKVALE, M
    SPEECH COMMUNICATION, 1990, 9 (01) : 1 - 13
  • [48] Improving part of speech disambiguation rules by adding linguistic knowledge
    Lindberg, N
    Eineborg, M
    INDUCTIVE LOGIC PROGRAMMING, 1999, 1634 : 186 - 197
  • [49] Knowledge Distillation for Throat Microphone Speech Recognition
    Suzuki, Takahito
    Ogata, Jun
    Tsunakawa, Takashi
    Nishida, Masafumi
    Nishimura, Masafumi
    INTERSPEECH 2019, 2019, : 461 - 465
  • [50] Incorporating knowledge sources into statistical speech recognition
    NICT/ATR Spoken Language, Communication Research Laboratories, Keihanna Science City Kyoto, Japan
    不详
    Lect. Notes Electr. Eng., 2009, (1-218):