Large-vocabulary recognition

被引:1
|
作者
Dugast, C
机构
[1] Philips GmbH Forschungslaboratorien Aachen, D-52021 Aachen
关键词
continuous-speech recognition; free syntax; dictation system; vocabulary selection; on-line adaptation; domain;
D O I
10.1016/0165-5817(96)81585-3
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Large-vocabulary continuous-speech recognition (CSR) technology is at work. As an application of the technology, we will describe a dictation system (DS). Input to the system is unrestricted spontaneous speech. No adaptation, no special skills are required to use the system. The DS transforms continuous speech into written text. It is essential in this application that the user is free to speak as he or she usually does and should be free to use his or her own wording and formulation. This implies speech recognition for large and open vocabularies, free syntax, continuous speech. The aim of the paper is an attempt to determine what is feasible with today's technology and what will be feasible in the near future. The problems addressed are: what are the limits of today's technology, what is needed to make the next step, i.e. going towards real industrialization of CSR technology.
引用
收藏
页码:353 / 366
页数:14
相关论文
共 50 条
  • [21] Pre-Initialized Composition For Large-Vocabulary Speech Recognition
    Allauzen, Cyril
    Riley, Michael
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 666 - 670
  • [22] Compound words in large-vocabulary German speech recognition systems
    Berton, A
    Fetter, P
    RegelBrietzmann, P
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1165 - 1168
  • [23] Large-vocabulary spontaneous speech recognition using a corpus of lectures
    Nishimura, M
    Itoh, N
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2003, 86 (08): : 52 - 60
  • [24] Acoustic models of the elderly for large-vocabulary continuous speech recognition
    Baba, A
    Yoshizawa, S
    Yamada, M
    Lee, A
    Shikano, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2004, 87 (07): : 49 - 57
  • [25] Combining spectral representations for large-vocabulary continuous speech recognition
    Garau, Giulia
    Renals, Steve
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 508 - 518
  • [26] A COMMERCIAL LARGE-VOCABULARY DISCRETE SPEECH RECOGNITION SYSTEM - DRAGONDICTATE
    MANDEL, MA
    LANGUAGE AND SPEECH, 1992, 35 : 237 - 246
  • [28] A HIERARCHICAL DECISION APPROACH TO LARGE-VOCABULARY DISCRETE UTTERANCE RECOGNITION
    KANEKO, T
    DIXON, NR
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1983, 31 (05): : 1061 - 1066
  • [29] L-Sign: Large-Vocabulary Sign Gestures Recognition System
    Zheng, Zhiwen
    Wang, Qingshan
    Yang, Dejun
    Wang, Qi
    Huang, Wei
    Xu, Yinlong
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2022, 52 (02) : 290 - 301
  • [30] Rapid Nonlinear Speaker Adaptation for Large-Vocabulary Continuous Speech Recognition
    Roupakia, Zoi
    Ragni, Anton
    Gales, Mark
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1782 - 1785