Large-vocabulary recognition

被引:1
|
作者
Dugast, C
机构
[1] Philips GmbH Forschungslaboratorien Aachen, D-52021 Aachen
关键词
continuous-speech recognition; free syntax; dictation system; vocabulary selection; on-line adaptation; domain;
D O I
10.1016/0165-5817(96)81585-3
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Large-vocabulary continuous-speech recognition (CSR) technology is at work. As an application of the technology, we will describe a dictation system (DS). Input to the system is unrestricted spontaneous speech. No adaptation, no special skills are required to use the system. The DS transforms continuous speech into written text. It is essential in this application that the user is free to speak as he or she usually does and should be free to use his or her own wording and formulation. This implies speech recognition for large and open vocabularies, free syntax, continuous speech. The aim of the paper is an attempt to determine what is feasible with today's technology and what will be feasible in the near future. The problems addressed are: what are the limits of today's technology, what is needed to make the next step, i.e. going towards real industrialization of CSR technology.
引用
收藏
页码:353 / 366
页数:14
相关论文
共 50 条
  • [31] An efficient search method for large-vocabulary continuous-speech recognition
    Hanazawa, K
    Minami, Y
    Furui, S
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1787 - 1790
  • [32] Large-vocabulary Audio-visual Speech Recognition in Noisy Environments
    Yu, Wentao
    Zeiler, Steffen
    Kolossa, Dorothea
    IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [33] A segmental framework for fully-unsupervised large-vocabulary speech recognition
    Kamper, Herman
    Jansen, Aren
    Goldwater, Sharon
    COMPUTER SPEECH AND LANGUAGE, 2017, 46 : 154 - 174
  • [34] Multimodal Integration for Large-Vocabulary Audio-Visual Speech Recognition
    Yu, Wentao
    Zeiler, Steffen
    Kolossa, Dorothea
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 341 - 345
  • [35] Large-vocabulary speech recognition system for Taiwanese (Min-nan)
    Lyu, Ren-yuan
    Chiang, Yuang-chin
    Hsieh, Wen-ping
    Fang, Ren-zhou
    Chen, Chih-yu
    Journal of the Chinese Institute of Electrical Engineering, Transactions of the Chinese Institute of Engineers, Series E/Chung KuoTien Chi Kung Chieng Hsueh K'an, 2000, 7 (02): : 123 - 136
  • [36] ADAPTABLE PHONEME-BASED MODELS FOR LARGE-VOCABULARY SPEECH RECOGNITION
    BAMBERG, PG
    MANDEL, MA
    SPEECH COMMUNICATION, 1991, 10 (5-6) : 437 - 451
  • [37] Chinese large-vocabulary name recognition system using character description and syllable spelling recognition
    Wang, NJC
    Tsai, CH
    Huang, P
    Shen, JL
    2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 17 - 20
  • [38] Creating word-level language models for large-vocabulary handwriting recognition
    John F. Pitrelli
    Amit Roy
    International Journal on Document Analysis and Recognition, 2003, 5 (2) : 126 - 137
  • [39] Speaker clustering and transformation for speaker adaptation in large-vocabulary speech recognition systems
    Padmanabhan, M
    Bahl, LR
    Nahamoo, D
    Picheny, MA
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 701 - 704
  • [40] Comparative Experiments to Evaluate the Use of Syllables for Large-Vocabulary Automatic Speech Recognition
    Tolba, Hesham
    Azmi, Mohamed
    2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 1, 2009, : 250 - +