Using multilingual units for improved modeling of pronunciation variants

被引：0

作者：

Bartkova, K. ^{[1
]}

Jouvet, D. ^{[1
]}

机构：

[1] France Telecom, Div R&D, TECH, SSTP, F-22307 Lannion, France

来源：

2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13 | 2006年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Standard speech modeling generally implies the combination of models of the phonemes of the current language with a description of possible pronunciation variants of the vocabulary words. When dealing with foreign accent, this standard native speech modeling is not adequate. In fact many variabilities have to be taken into account as the acoustic realization of the sounds by non-native speakers does not always match with native models and some phonemes may be replaced by others. By introducing models of phonemes estimated from speech data of other languages, and adding extra pronunciation variants through phonological rules, speech recognition performance improvements were achieved on non-native speech. In this study, a selection of the most frequently used variants is proposed, which relies on the frequency of usage of the various models associated to each phoneme on a development set. Although this selection process is rather simple it provides significant performance improvement.

引用

页码：5895 / 5898

页数：4

共 50 条

[31] INFORMATIVE DIALECT RECOGNITION USING CONTEXT-DEPENDENT PRONUNCIATION MODELING
Chen, Nancy F.
Shen, Wade
Campbell, Joseph P.
Torres-Carrasquillo, Pedro A.
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4396 - 4399
[32] IMPROVED ESTIMATION OF PROBABILITIES IN PRONUNCIATION BY ANALOGY
Kujala, Janne V.
Nandi, Asoke K.
2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2797 - 2801
[33] Adaptive conditional pronunciation modeling using articulatory features for speaker verification
Leung, KY
Mak, MW
Siu, MH
Kung, SY
2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 61 - 64
[34] Semantic context effects in the comprehension of reduced pronunciation variants
van de Ven, Marco
Tucker, Benjamin V.
Ernestus, Mirjam
MEMORY & COGNITION, 2011, 39 (07) : 1301 - 1316
[35] Exploring the role of exposure frequency in recognizing pronunciation variants
Pitt, Mark A.
Dilley, Laura
Tat, Michael
JOURNAL OF PHONETICS, 2011, 39 (03) : 304 - 311
[36] A hybrid statistical model to generate pronunciation variants of words
Vazirnezhad, B
Almasganj, F
Bijankhan, M
PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 106 - 110
[37] Comparing SMT Methods for Automatic Generation of Pronunciation Variants
Karanasou, Panagiota
Lamel, Lori
ADVANCES IN NATURAL LANGUAGE PROCESSING, 2010, 6233 : 167 - 178
[38] The Strength and Time Course of Lexical Activation of Pronunciation Variants
Pitt, Mark A.
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2009, 35 (03) : 896 - 910
[39] Using Pronunciation-Based Morphological Subword Units to Improve OOV Handling in Keyword Search
He, Yanzhang
Baumann, Peter
Fang, Hao
Hutchinson, Brian
Jaech, Aaron
Ostendorf, Mari
Fosler-Lussier, Eric
Pierrehumbert, Janet
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (01) : 79 - 92
[40] Generation and Pruning of Pronunciation Variants to Improve ASR Accuracy
Ge, Zhenhao
Ganapathiraju, Aravind
Iyer, Ananth N.
Randal, Scott A.
Wyss, Felix I.
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3101 - 3105

← 1 2 3 4 5 →