Linguistic knowledge and empirical methods in speech recognition

被引:0
|
作者
Stolcke, A [1 ]
机构
[1] SRI Int, Speech Res & Technol Lab, Menlo Park, CA 94025 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic speech recognition is one of the fastest growing and commercially mast promising applications of natural language technology. The technology has achieved a point where carefully designed systems for suitably constrained applications are a reality. Commercial systems are available today for such tasks as large-vocabulary dictation and voice control of medical equipment. This article reviews how state-of-the-art speech-recognition systems combine statistical modeling, linguistic knowledge, and machine learning to achieve their performance and points out some of the research issues in the field.
引用
收藏
页码:25 / 31
页数:7
相关论文
共 50 条
  • [31] STRUCTURAL METHODS IN AUTOMATIC SPEECH RECOGNITION
    LEVINSON, SE
    PROCEEDINGS OF THE IEEE, 1985, 73 (11) : 1625 - 1650
  • [32] SPEECH RECOGNITION USING CONNECTIONIST METHODS
    WELLEKENS, CJ
    CONNECTIONISM IN PERSPECTIVE, 1989, : 103 - 111
  • [33] The development of analysis methods for speech recognition
    Hoffmann, R
    Westendorf, CM
    BEHAVIOURAL PROCESSES, 1997, 39 (02) : 113 - 125
  • [34] Kernel Approximation Methods for Speech Recognition
    May, Avner
    Garakani, Alireza Bagheri
    Lu, Zhiyun
    Guo, Dong
    Liu, Kuan
    Bellet, Aurelien
    Fan, Linxi
    Collins, Michael
    Hsu, Daniel
    Kingsbury, Brian
    Picheny, Michael
    Sha, Fei
    JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [35] TRAINING AND SEARCH METHODS FOR SPEECH RECOGNITION
    JELINEK, F
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (22) : 9964 - 9969
  • [36] Kernel approximation methods for speech recognition
    May, Avner
    Garakani, Alireza Bagheri
    Lu, Zhiyun
    Guo, Dong
    Liu, Kuan
    Bellet, Aurélien
    Fan, Linxi
    Collins, Michael
    Hsu, Daniel
    Kingsbury, Brian
    Picheny, Michael
    Sha, Fei
    Journal of Machine Learning Research, 2019, 20
  • [37] Methods of recognition in the space of knowledge
    Vikentiev, A. A.
    Ivanov, V. V.
    BULLETIN OF THE KARAGANDA UNIVERSITY-MATHEMATICS, 2016, 81 (01): : 26 - 34
  • [38] Multistage linguistic conditioning of convolutional layers for speech emotion recognition
    Triantafyllopoulos, Andreas
    Reichel, Uwe
    Liu, Shuo
    Huber, Stephan
    Eyben, Florian
    Schuller, Bjoern W.
    FRONTIERS IN COMPUTER SCIENCE, 2023, 5
  • [39] The use of subword linguistic modeling for multiple tasks in speech recognition
    Seneff, S
    SPEECH COMMUNICATION, 2004, 42 (3-4) : 373 - 390
  • [40] The role of linguistic and indexical information in improved recognition of dysarthric speech
    Borrie, S.A. (steph.borrie@gmail.com), 1600, Acoustical Society of America (133):