Large-vocabulary recognition

被引:1
|
作者
Dugast, C
机构
[1] Philips GmbH Forschungslaboratorien Aachen, D-52021 Aachen
关键词
continuous-speech recognition; free syntax; dictation system; vocabulary selection; on-line adaptation; domain;
D O I
10.1016/0165-5817(96)81585-3
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Large-vocabulary continuous-speech recognition (CSR) technology is at work. As an application of the technology, we will describe a dictation system (DS). Input to the system is unrestricted spontaneous speech. No adaptation, no special skills are required to use the system. The DS transforms continuous speech into written text. It is essential in this application that the user is free to speak as he or she usually does and should be free to use his or her own wording and formulation. This implies speech recognition for large and open vocabularies, free syntax, continuous speech. The aim of the paper is an attempt to determine what is feasible with today's technology and what will be feasible in the near future. The problems addressed are: what are the limits of today's technology, what is needed to make the next step, i.e. going towards real industrialization of CSR technology.
引用
收藏
页码:353 / 366
页数:14
相关论文
共 50 条
  • [41] Reliability-Based Large-Vocabulary Audio-Visual Speech Recognition
    Yu, Wentao
    Zeiler, Steffen
    Kolossa, Dorothea
    SENSORS, 2022, 22 (15)
  • [42] Advances in Missing Feature Techniques for Robust Large-Vocabulary Continuous Speech Recognition
    Van Segbroeck, Maarten
    Van Hamme, Hugo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 123 - 137
  • [43] Discriminative training for large-vocabulary speech recognition using minimum classification error
    McDermott, Erik
    Hazen, Timothy J.
    Le Roux, Jonathan
    Nakamura, Atsushi
    Katagiri, Shigeru
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 203 - 223
  • [44] Reducing latency for language identification based on large-vocabulary continuous speech recognition
    Okamoto T.
    Hiroe A.
    Kawai H.
    Acoustical Science and Technology, 2017, 38 (01) : 38 - 41
  • [45] Issues in large-vocabulary interactive speech systems
    Attwater, DJ
    Whittaker, SJ
    BT TECHNOLOGY JOURNAL, 1996, 14 (01): : 177 - 186
  • [46] A simplified audiovisual fusion model with application to large-vocabulary recognition of French Canadian speech
    Gagnon, L.
    Foucher, S.
    Laliberte, F.
    Boulianne, G.
    CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING-REVUE CANADIENNE DE GENIE ELECTRIQUE ET INFORMATIQUE, 2008, 33 (02): : 109 - 119
  • [47] AN HMM STATE DURATION CONTROL ALGORITHM APPLIED TO LARGE-VOCABULARY SPONTANEOUS SPEECH RECOGNITION
    TAKAHASHI, S
    MINAMI, Y
    SHIKANO, K
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (06) : 648 - 653
  • [48] Comparing Computation in Gaussian mixture and Neural Network based Large-Vocabulary Speech Recognition
    Gupta, Vishwa
    Boulianne, Gilles
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 617 - 621
  • [49] Speech recognition on Mandarin Call Home: A large-vocabulary, conversational, and telephone speech corpus
    Liu, FH
    Picheny, M
    Srinivasa, P
    Monkowski, M
    Chen, JL
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 157 - 160
  • [50] I-vector Based Utterance Verification for Large-Vocabulary Speech Recognition System
    Choi, Woo Yong
    Song, Hwa Jeon
    Chung, Hoon
    Kang, Jeomja
    Park, Jeon Gue
    2016 FIRST IEEE INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND THE INTERNET (ICCCI 2016), 2016, : 316 - 319