Large vocabulary speech recognition in French

被引:1
|
作者
Adda-Decker, M [1 ]
Adda, G [1 ]
Gauvain, JL [1 ]
Lamel, L [1 ]
机构
[1] LIMSI, CNRS, Spoken Language Proc Grp, F-91403 Orsay, France
关键词
D O I
10.1109/ICASSP.1999.758058
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this contribution we present some design considerations concerning our large vocabulary continuous speech recognition system in French.(1) The impact of the epoch of the text training material on lexical coverage, language model perplexity and recognition performance on newspaper texts is demonstrated. The effectiveness of larger vocabulary sizes and larger text training corpora for language modeling is investigated. French is a highly inflected language producing large lexical variety and a high homophone rate. About 30% of recognition errors are shown to be due to substitutions between inflected forms of a given root form. When word error rates are analysed as a function of word frequency, a significant increase in the error rate can be measured for frequency ranks above 5000.
引用
收藏
页码:45 / 48
页数:4
相关论文
共 50 条
  • [41] Large-Vocabulary Continuous Speech Recognition Systems
    Saon, George
    Chien, Jen-Tzung
    IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 18 - 33
  • [42] Boosting systems for large vocabulary continuous speech recognition
    Saon, George
    Soltau, Hagen
    SPEECH COMMUNICATION, 2012, 54 (02) : 212 - 218
  • [43] Experimenting with lipreading for large vocabulary continuous speech recognition
    Karel Paleček
    Journal on Multimodal User Interfaces, 2018, 12 : 309 - 318
  • [44] Chinese speech recognition system with very large vocabulary
    Qin, Y
    Mo, FY
    Li, CL
    Guan, DH
    ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 817 - 820
  • [45] Large vocabulary speech recognition for read and broadcast Czech
    Byrne, W
    Hajic, J
    Ircing, P
    Jelinek, F
    Khudanpur, S
    McDonough, J
    Peterek, N
    Psutka, J
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 235 - 240
  • [46] A RECOGNITION TIME REDUCTION ALGORITHM FOR LARGE-VOCABULARY SPEECH RECOGNITION
    KOO, JM
    UN, CK
    SPEECH COMMUNICATION, 1992, 11 (01) : 45 - 50
  • [47] Recognition time reduction algorithm for large-vocabulary speech recognition
    Koo, J.M.
    Un, C.K.
    Speech Communication, 1992, 10 (01) : 45 - 50
  • [48] Large vocabulary audio-visual speech recognition using the Janus speech recognition toolkit
    Kratt, J
    Metze, F
    Stiefelhagen, R
    Waibel, A
    PATTERN RECOGNITION, 2004, 3175 : 488 - 495
  • [49] Recent improvements of the RWTH large vocabulary speech recognition system on spontaneous speech
    Sixtus, A
    Molau, S
    Kanthak, S
    Schlüter, R
    Ney, H
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1671 - 1674
  • [50] JNAS: Japanese speech corpus for large vocabulary continuous speech recognition research
    Itou, Katunobu
    Yamamoto, Mikio
    Takeda, Kazuya
    Takezawa, Toshiyuki
    Matsuoka, Tatsuo
    Kobayashi, Tetsunori
    Shikano, Kiyohiro
    Itahashi, Shuichi
    Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 1999, 20 (03): : 199 - 206