Using multilingual units for improved modeling of pronunciation variants

被引：0

作者：

Bartkova, K. ^{[1
]}

Jouvet, D. ^{[1
]}

机构：

[1] France Telecom, Div R&D, TECH, SSTP, F-22307 Lannion, France

来源：

2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13 | 2006年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Standard speech modeling generally implies the combination of models of the phonemes of the current language with a description of possible pronunciation variants of the vocabulary words. When dealing with foreign accent, this standard native speech modeling is not adequate. In fact many variabilities have to be taken into account as the acoustic realization of the sounds by non-native speakers does not always match with native models and some phonemes may be replaced by others. By introducing models of phonemes estimated from speech data of other languages, and adding extra pronunciation variants through phonological rules, speech recognition performance improvements were achieved on non-native speech. In this study, a selection of the most frequently used variants is proposed, which relies on the frequency of usage of the various models associated to each phoneme on a development set. Although this selection process is rather simple it provides significant performance improvement.

引用

页码：5895 / 5898

页数：4

共 50 条

[41] Semantic context effects in the comprehension of reduced pronunciation variants
Marco van de Ven
Benjamin V. Tucker
Mirjam Ernestus
Memory & Cognition, 2011, 39 : 1301 - 1316
[42] Learning pronunciation and formulation variants in continuous speech applications
Colibro, D
Fissore, L
Popovici, C
Vair, C
Laface, P
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1001 - 1004
[43] Learners' attitudes to first, second and third languages pronunciation in structuring multilingual identity
Szyszka, Magdalena
APPLIED LINGUISTICS REVIEW, 2022, 13 (06) : 1127 - 1147
[44] Pronunciation and Silence Probability Modeling for ASR
Chen, Guoguo
Xu, Hainan
Wu, Minhua
Povey, Daniel
Khudanpur, Sanjeev
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 533 - 537
[45] A unified multilingual handwriting recognition system using multigrams sub-lexical units
Swaileh, Wassim
Soullard, Yann
Paquet, Thierry
PATTERN RECOGNITION LETTERS, 2019, 121 : 68 - 76
[46] Spotting multilingual consonant-vowel units of speech using neural network models
Gangashetty, SV
Sekhar, CC
Yegnanarayana, B
NONLINEAR ANALYSES AND ALGORITHMS FOR SPEECH PROCESSING, 2005, 3817 : 303 - 317
[47] Pronunciation modeling four speech technology
Svendsen, T
2004 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING & COMMUNICATIONS (SPCOM), 2004, : 11 - 16
[48] Pronunciation Variation Modeling for Mandarin with Accent
Zhang Chi
Wu Ji
Xiao Xi
Wang Zuoying
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 709 - 712
[49] Sequence-Based Pronunciation Modeling Using a Noisy-Channel Approach
Hofmann, Hansjoerg
Sakti, Sakriani
Isotani, Ryosuke
Kawai, Hisashi
Nakamura, Satoshi
Minker, Wolfgang
SPOKEN DIALOGUE SYSTEMS FOR AMBIENT ENVIRONMENTS, 2010, 6392 : 156 - 162
[50] Pronunciation modeling for names of foreign origin
Maison, B
Chen, SF
Cohen, PS
ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 429 - 434

← 1 2 3 4 5 →