Using multilingual units for improved modeling of pronunciation variants

被引:0
|
作者
Bartkova, K. [1 ]
Jouvet, D. [1 ]
机构
[1] France Telecom, Div R&D, TECH, SSTP, F-22307 Lannion, France
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Standard speech modeling generally implies the combination of models of the phonemes of the current language with a description of possible pronunciation variants of the vocabulary words. When dealing with foreign accent, this standard native speech modeling is not adequate. In fact many variabilities have to be taken into account as the acoustic realization of the sounds by non-native speakers does not always match with native models and some phonemes may be replaced by others. By introducing models of phonemes estimated from speech data of other languages, and adding extra pronunciation variants through phonological rules, speech recognition performance improvements were achieved on non-native speech. In this study, a selection of the most frequently used variants is proposed, which relies on the frequency of usage of the various models associated to each phoneme on a development set. Although this selection process is rather simple it provides significant performance improvement.
引用
收藏
页码:5895 / 5898
页数:4
相关论文
共 50 条
  • [1] Multilingual pronunciation by analogy
    Information: Signals, Images, Systems Research Group, School of Electronics and Computer Science, University of Southampton, Southampton SO17 1BJ, United Kingdom
    不详
    Nat Lang Eng, 2008, 4 (527-546):
  • [2] Pronunciation modeling for improved spelling correction
    Toutanova, K
    Moore, RC
    40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, : 144 - 151
  • [3] Multilingual recognition of non-native speech using acoustic model transformation and pronunciation modeling
    G. Bouselmi
    D. Fohr
    I. Illina
    International Journal of Speech Technology, 2012, 15 (2) : 203 - 213
  • [4] Multilingual recognition of non-native speech using acoustic model transformation and pronunciation modeling
    Bouselmi, G.
    Fohr, D.
    Illina, I.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (02) : 203 - 213
  • [5] Multilingual modeling of cross-lingual spelling variants
    Krister Lindén
    Information Retrieval, 2006, 9 : 295 - 310
  • [6] Multilingual modeling of cross-lingual spelling variants
    Linden, Krister
    INFORMATION RETRIEVAL, 2006, 9 (03): : 295 - 310
  • [7] Massively Multilingual Pronunciation Mining with WikiPron
    Lee, Jackson L.
    Ashby, Lucas F. E.
    Garza, M. Elizabeth
    Lee-Sikka, Yeonju
    Miller, Sean
    Wong, Alan
    McCarthy, Arya D.
    Gorman, Kyle
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4223 - 4228
  • [8] Category Similarity in Multilingual Pronunciation Training
    Koreman, Jacques
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2578 - 2582
  • [9] Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
    Peng, Yukun
    Ling, Zhenhua
    INTERSPEECH 2022, 2022, : 4257 - 4261
  • [10] Improved pronunciation prediction accuracy using morphology
    Sharma, Dravyansh
    Sahai, Saumya Yashmohini
    Chaudhari, Neha
    Bruguier, Antoine
    SIGMORPHON 2021: 18TH SIGMORPHON WORKSHOP ON COMPUTATIONAL RESEARCH IN PHONETICS, PHONOLOGY, AND MORPHOLOGY, 2021, : 222 - 228