Large vocabulary speech recognition in French

被引:1
|
作者
Adda-Decker, M [1 ]
Adda, G [1 ]
Gauvain, JL [1 ]
Lamel, L [1 ]
机构
[1] LIMSI, CNRS, Spoken Language Proc Grp, F-91403 Orsay, France
关键词
D O I
10.1109/ICASSP.1999.758058
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this contribution we present some design considerations concerning our large vocabulary continuous speech recognition system in French.(1) The impact of the epoch of the text training material on lexical coverage, language model perplexity and recognition performance on newspaper texts is demonstrated. The effectiveness of larger vocabulary sizes and larger text training corpora for language modeling is investigated. French is a highly inflected language producing large lexical variety and a high homophone rate. About 30% of recognition errors are shown to be due to substitutions between inflected forms of a given root form. When word error rates are analysed as a function of word frequency, a significant increase in the error rate can be measured for frequency ranks above 5000.
引用
收藏
页码:45 / 48
页数:4
相关论文
共 50 条
  • [31] Recent experiments in Large Vocabulary Conversational Speech Recognition
    Billa, J
    Colhurst, T
    El-Jaroudi, A
    Iyer, R
    Ma, K
    Matsoukas, S
    Quillen, C
    Richardson, F
    Siu, M
    Zavaliagkos, G
    Gish, H
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 41 - 44
  • [32] Confidence measures for large vocabulary continuous speech recognition
    Wessel, F
    Schlüter, R
    Macherey, K
    Ney, H
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03): : 288 - 298
  • [33] Boosting acoustic models in large vocabulary speech recognition
    Meyer, C
    Schramm, H
    PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2004, : 255 - 260
  • [34] CONNECTIONIST APPROACHES TO LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
    SAWAI, H
    MINAMI, Y
    MIYATAKE, M
    WAIBEL, A
    SHIKANO, K
    IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1834 - 1844
  • [35] Towards speech rate independence in large vocabulary continuous speech recognition
    Martinez, F
    Tapias, D
    Alvarez, J
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 725 - 728
  • [36] ARTICULATORY TRAJECTORIES FOR LARGE-VOCABULARY SPEECH RECOGNITION
    Mitra, Vikramjit
    Wang, Wen
    Stolcke, Andreas
    Nam, Hosung
    Richey, Colleen
    Yuan, Jiahong
    Liberman, Mark
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7145 - 7149
  • [37] Recent Developments in Large Vocabulary Continuous Speech Recognition
    Saon, George
    Chien, Jen-Tzung
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [38] Development of Large Vocabulary Continuous Speech Recognition for Polish
    Demenko, G.
    Szymanski, M.
    Cecko, R.
    Kusmierek, E.
    Lange, M.
    Wegner, K.
    Klessa, K.
    Owsianny, M.
    ACTA PHYSICA POLONICA A, 2012, 121 (1A) : A86 - A91
  • [39] A Myanmar Large Vocabulary Continuous Speech Recognition System
    Naing, Hay Mar Soe
    Hlaing, Aye Mya
    Pa, Win Pa
    Hu, Xinhui
    Thu, Ye Kyaw
    Hori, Chiori
    Kawai, Hisashi
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 320 - 327
  • [40] Investigation on large vocabulary continuous Kannada speech recognition
    Vanajakshi, Puttaswamy Gowda
    Mathivanan, M.
    Kumaran, T. Senthil
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2021, 36 (01) : 1 - 24