Using multilingual units for improved modeling of pronunciation variants

被引:0
|
作者
Bartkova, K. [1 ]
Jouvet, D. [1 ]
机构
[1] France Telecom, Div R&D, TECH, SSTP, F-22307 Lannion, France
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Standard speech modeling generally implies the combination of models of the phonemes of the current language with a description of possible pronunciation variants of the vocabulary words. When dealing with foreign accent, this standard native speech modeling is not adequate. In fact many variabilities have to be taken into account as the acoustic realization of the sounds by non-native speakers does not always match with native models and some phonemes may be replaced by others. By introducing models of phonemes estimated from speech data of other languages, and adding extra pronunciation variants through phonological rules, speech recognition performance improvements were achieved on non-native speech. In this study, a selection of the most frequently used variants is proposed, which relies on the frequency of usage of the various models associated to each phoneme on a development set. Although this selection process is rather simple it provides significant performance improvement.
引用
收藏
页码:5895 / 5898
页数:4
相关论文
共 50 条
  • [31] INFORMATIVE DIALECT RECOGNITION USING CONTEXT-DEPENDENT PRONUNCIATION MODELING
    Chen, Nancy F.
    Shen, Wade
    Campbell, Joseph P.
    Torres-Carrasquillo, Pedro A.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4396 - 4399
  • [32] IMPROVED ESTIMATION OF PROBABILITIES IN PRONUNCIATION BY ANALOGY
    Kujala, Janne V.
    Nandi, Asoke K.
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2797 - 2801
  • [33] Adaptive conditional pronunciation modeling using articulatory features for speaker verification
    Leung, KY
    Mak, MW
    Siu, MH
    Kung, SY
    2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 61 - 64
  • [34] Semantic context effects in the comprehension of reduced pronunciation variants
    van de Ven, Marco
    Tucker, Benjamin V.
    Ernestus, Mirjam
    MEMORY & COGNITION, 2011, 39 (07) : 1301 - 1316
  • [35] Exploring the role of exposure frequency in recognizing pronunciation variants
    Pitt, Mark A.
    Dilley, Laura
    Tat, Michael
    JOURNAL OF PHONETICS, 2011, 39 (03) : 304 - 311
  • [36] A hybrid statistical model to generate pronunciation variants of words
    Vazirnezhad, B
    Almasganj, F
    Bijankhan, M
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 106 - 110
  • [37] Comparing SMT Methods for Automatic Generation of Pronunciation Variants
    Karanasou, Panagiota
    Lamel, Lori
    ADVANCES IN NATURAL LANGUAGE PROCESSING, 2010, 6233 : 167 - 178
  • [38] The Strength and Time Course of Lexical Activation of Pronunciation Variants
    Pitt, Mark A.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2009, 35 (03) : 896 - 910
  • [39] Using Pronunciation-Based Morphological Subword Units to Improve OOV Handling in Keyword Search
    He, Yanzhang
    Baumann, Peter
    Fang, Hao
    Hutchinson, Brian
    Jaech, Aaron
    Ostendorf, Mari
    Fosler-Lussier, Eric
    Pierrehumbert, Janet
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (01) : 79 - 92
  • [40] Generation and Pruning of Pronunciation Variants to Improve ASR Accuracy
    Ge, Zhenhao
    Ganapathiraju, Aravind
    Iyer, Ananth N.
    Randal, Scott A.
    Wyss, Felix I.
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3101 - 3105