Large-vocabulary recognition

被引:1
|
作者
Dugast, C
机构
[1] Philips GmbH Forschungslaboratorien Aachen, D-52021 Aachen
关键词
continuous-speech recognition; free syntax; dictation system; vocabulary selection; on-line adaptation; domain;
D O I
10.1016/0165-5817(96)81585-3
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Large-vocabulary continuous-speech recognition (CSR) technology is at work. As an application of the technology, we will describe a dictation system (DS). Input to the system is unrestricted spontaneous speech. No adaptation, no special skills are required to use the system. The DS transforms continuous speech into written text. It is essential in this application that the user is free to speak as he or she usually does and should be free to use his or her own wording and formulation. This implies speech recognition for large and open vocabularies, free syntax, continuous speech. The aim of the paper is an attempt to determine what is feasible with today's technology and what will be feasible in the near future. The problems addressed are: what are the limits of today's technology, what is needed to make the next step, i.e. going towards real industrialization of CSR technology.
引用
收藏
页码:353 / 366
页数:14
相关论文
共 50 条
  • [1] Large-vocabulary recognition
    Dugast, C.
    Philips Journal of Research, 49 (04): : 353 - 366
  • [2] Large-vocabulary speech recognition algorithms
    Padmanabhan, M
    Picheny, M
    COMPUTER, 2002, 35 (04) : 42 - +
  • [3] SPEECH RECOGNITION FOR LARGE-VOCABULARY SYSTEMS
    JACOB, B
    ANDREOBRECHT, R
    JOURNAL DE PHYSIQUE IV, 1994, 4 (C5): : 489 - 492
  • [4] Large-Vocabulary Continuous Speech Recognition Systems
    Saon, George
    Chien, Jen-Tzung
    IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 18 - 33
  • [5] Results on large-vocabulary isolated word recognition
    Codogno, Maurizio
    Fissore, Luciano
    CSELT Technical Reports, 1988, 16 (07): : 611 - 614
  • [6] ARTICULATORY TRAJECTORIES FOR LARGE-VOCABULARY SPEECH RECOGNITION
    Mitra, Vikramjit
    Wang, Wen
    Stolcke, Andreas
    Nam, Hosung
    Richey, Colleen
    Yuan, Jiahong
    Liberman, Mark
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7145 - 7149
  • [7] Tandem acoustic modeling in large-vocabulary recognition
    Ellis, DPW
    Singh, R
    Sivadas, S
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 517 - 520
  • [8] Network optimizations for large-vocabulary speech recognition
    Mohri, M
    Riley, M
    SPEECH COMMUNICATION, 1999, 28 (01) : 1 - 12
  • [9] SUBWORD-BASED LARGE-VOCABULARY SPEECH RECOGNITION
    LEE, CH
    GAUVAIN, JL
    PIERACCINI, R
    RABINER, LR
    AT&T TECHNICAL JOURNAL, 1993, 72 (05): : 25 - 36
  • [10] Recognition time reduction algorithm for large-vocabulary speech recognition
    Koo, J.M.
    Un, C.K.
    Speech Communication, 1992, 10 (01) : 45 - 50