Using multilingual units for improved modeling of pronunciation variants

被引:0
|
作者
Bartkova, K. [1 ]
Jouvet, D. [1 ]
机构
[1] France Telecom, Div R&D, TECH, SSTP, F-22307 Lannion, France
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Standard speech modeling generally implies the combination of models of the phonemes of the current language with a description of possible pronunciation variants of the vocabulary words. When dealing with foreign accent, this standard native speech modeling is not adequate. In fact many variabilities have to be taken into account as the acoustic realization of the sounds by non-native speakers does not always match with native models and some phonemes may be replaced by others. By introducing models of phonemes estimated from speech data of other languages, and adding extra pronunciation variants through phonological rules, speech recognition performance improvements were achieved on non-native speech. In this study, a selection of the most frequently used variants is proposed, which relies on the frequency of usage of the various models associated to each phoneme on a development set. Although this selection process is rather simple it provides significant performance improvement.
引用
收藏
页码:5895 / 5898
页数:4
相关论文
共 50 条
  • [21] Multiword units in multilingual speakers
    Hennecke, Inga
    Perevozchikova, Tatiana
    Wiesinger, Evelyn
    INTERNATIONAL JOURNAL OF BILINGUALISM, 2025, 29 (02) : 329 - 346
  • [22] On Pronunciation in a Multilingual Dictionary: The Case of Lukumi, Olukumi and Yoruba Dictionary
    Uguru, Joy O.
    Okeke, Chukwuma O.
    LEXIKOS, 2020, 30 : 519 - 539
  • [23] A Web-Based Tool for Developing Multilingual Pronunciation Lexicons
    Ainsley, Samantha
    Ha, Linne
    Jansehe, Martin
    Kim, Ara
    Nanzawa, Masayuki
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3338 - +
  • [24] Improved pronunciation modelling by inverse word frequency and pronunciation entropy
    Tsai, MY
    Chou, FC
    Lee, LS
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 53 - 56
  • [25] Nigerian English pronunciation preferences: A corpus-based survey of pronunciation variants
    Oladipupo, Rotimi
    Akinola, Aderonke
    COGENT ARTS & HUMANITIES, 2022, 9 (01):
  • [26] Pronunciation modeling using a finite-state transducer representation
    Hazen, TJ
    Hetherington, IL
    Shu, H
    Livescu, K
    SPEECH COMMUNICATION, 2005, 46 (02) : 189 - 203
  • [27] Developing Consistent Pronunciation Models for Phonemic Variants
    Davel, Marelie
    Barnard, Etienne
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1260 - 1263
  • [28] Multilingual context-based pronunciation learning for Text-to-Speech
    Comini, Giulia
    Ribeiro, Manuel Sam
    Yang, Fan
    Shim, Heereen
    Lorenzo-Trueba, Jaime
    INTERSPEECH 2023, 2023, : 631 - 635
  • [29] Pronunciation Modeling for Malaysian English
    Khaw, Yen-Min
    Tan, Tien-Ping
    2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 153 - 156
  • [30] FSM-Based Pronunciation Modeling using Articulatory Phonological Code
    Hu, Chi
    Zhuang, Xiaodan
    Hasegawa-Johnson, Mark
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2274 - 2277