Using multilingual units for improved modeling of pronunciation variants

被引:0
|
作者
Bartkova, K. [1 ]
Jouvet, D. [1 ]
机构
[1] France Telecom, Div R&D, TECH, SSTP, F-22307 Lannion, France
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Standard speech modeling generally implies the combination of models of the phonemes of the current language with a description of possible pronunciation variants of the vocabulary words. When dealing with foreign accent, this standard native speech modeling is not adequate. In fact many variabilities have to be taken into account as the acoustic realization of the sounds by non-native speakers does not always match with native models and some phonemes may be replaced by others. By introducing models of phonemes estimated from speech data of other languages, and adding extra pronunciation variants through phonological rules, speech recognition performance improvements were achieved on non-native speech. In this study, a selection of the most frequently used variants is proposed, which relies on the frequency of usage of the various models associated to each phoneme on a development set. Although this selection process is rather simple it provides significant performance improvement.
引用
收藏
页码:5895 / 5898
页数:4
相关论文
共 50 条
  • [41] Semantic context effects in the comprehension of reduced pronunciation variants
    Marco van de Ven
    Benjamin V. Tucker
    Mirjam Ernestus
    Memory & Cognition, 2011, 39 : 1301 - 1316
  • [42] Learning pronunciation and formulation variants in continuous speech applications
    Colibro, D
    Fissore, L
    Popovici, C
    Vair, C
    Laface, P
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1001 - 1004
  • [44] Pronunciation and Silence Probability Modeling for ASR
    Chen, Guoguo
    Xu, Hainan
    Wu, Minhua
    Povey, Daniel
    Khudanpur, Sanjeev
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 533 - 537
  • [45] A unified multilingual handwriting recognition system using multigrams sub-lexical units
    Swaileh, Wassim
    Soullard, Yann
    Paquet, Thierry
    PATTERN RECOGNITION LETTERS, 2019, 121 : 68 - 76
  • [46] Spotting multilingual consonant-vowel units of speech using neural network models
    Gangashetty, SV
    Sekhar, CC
    Yegnanarayana, B
    NONLINEAR ANALYSES AND ALGORITHMS FOR SPEECH PROCESSING, 2005, 3817 : 303 - 317
  • [47] Pronunciation modeling four speech technology
    Svendsen, T
    2004 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING & COMMUNICATIONS (SPCOM), 2004, : 11 - 16
  • [48] Pronunciation Variation Modeling for Mandarin with Accent
    Zhang Chi
    Wu Ji
    Xiao Xi
    Wang Zuoying
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 709 - 712
  • [49] Sequence-Based Pronunciation Modeling Using a Noisy-Channel Approach
    Hofmann, Hansjoerg
    Sakti, Sakriani
    Isotani, Ryosuke
    Kawai, Hisashi
    Nakamura, Satoshi
    Minker, Wolfgang
    SPOKEN DIALOGUE SYSTEMS FOR AMBIENT ENVIRONMENTS, 2010, 6392 : 156 - 162
  • [50] Pronunciation modeling for names of foreign origin
    Maison, B
    Chen, SF
    Cohen, PS
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 429 - 434