Using multilingual units for improved modeling of pronunciation variants

被引：0

作者：

Bartkova, K. ^{[1
]}

Jouvet, D. ^{[1
]}

机构：

[1] France Telecom, Div R&D, TECH, SSTP, F-22307 Lannion, France

来源：

2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13 | 2006年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Standard speech modeling generally implies the combination of models of the phonemes of the current language with a description of possible pronunciation variants of the vocabulary words. When dealing with foreign accent, this standard native speech modeling is not adequate. In fact many variabilities have to be taken into account as the acoustic realization of the sounds by non-native speakers does not always match with native models and some phonemes may be replaced by others. By introducing models of phonemes estimated from speech data of other languages, and adding extra pronunciation variants through phonological rules, speech recognition performance improvements were achieved on non-native speech. In this study, a selection of the most frequently used variants is proposed, which relies on the frequency of usage of the various models associated to each phoneme on a development set. Although this selection process is rather simple it provides significant performance improvement.

引用

页码：5895 / 5898

页数：4

共 50 条

[1] Multilingual pronunciation by analogy
Information: Signals, Images, Systems Research Group, School of Electronics and Computer Science, University of Southampton, Southampton SO17 1BJ, United Kingdom
不详
Nat Lang Eng, 2008, 4 (527-546):
[2] Pronunciation modeling for improved spelling correction
Toutanova, K
Moore, RC
40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, : 144 - 151
[3] Multilingual recognition of non-native speech using acoustic model transformation and pronunciation modeling
G. Bouselmi
D. Fohr
I. Illina
International Journal of Speech Technology, 2012, 15 (2) : 203 - 213
[4] Multilingual recognition of non-native speech using acoustic model transformation and pronunciation modeling
Bouselmi, G.
Fohr, D.
Illina, I.
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (02) : 203 - 213
[5] Multilingual modeling of cross-lingual spelling variants
Krister Lindén
Information Retrieval, 2006, 9 : 295 - 310
[6] Multilingual modeling of cross-lingual spelling variants
Linden, Krister
INFORMATION RETRIEVAL, 2006, 9 (03): : 295 - 310
[7] Massively Multilingual Pronunciation Mining with WikiPron
Lee, Jackson L.
Ashby, Lucas F. E.
Garza, M. Elizabeth
Lee-Sikka, Yeonju
Miller, Sean
Wong, Alan
McCarthy, Arya D.
Gorman, Kyle
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4223 - 4228
[8] Category Similarity in Multilingual Pronunciation Training
Koreman, Jacques
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2578 - 2582
[9] Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
Peng, Yukun
Ling, Zhenhua
INTERSPEECH 2022, 2022, : 4257 - 4261
[10] Improved pronunciation prediction accuracy using morphology
Sharma, Dravyansh
Sahai, Saumya Yashmohini
Chaudhari, Neha
Bruguier, Antoine
SIGMORPHON 2021: 18TH SIGMORPHON WORKSHOP ON COMPUTATIONAL RESEARCH IN PHONETICS, PHONOLOGY, AND MORPHOLOGY, 2021, : 222 - 228

← 1 2 3 4 5 →