State-based bilingual model modification for nonnative speech recognition

被引：0

作者：

Zhang, Qingqing ^{[1
]}

Li, Ta ^{[1
]}

Pan, Jiefin ^{[1
]}

Yan, Yonghong ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Acoust, ThinkIT Speech Lab, Beijing 100080, Peoples R China

来源：

2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS | 2008年

关键词：

D O I：

10.1109/ICALIP.2008.4589996

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The speech recognition accuracy has been observed to decrease for nonnative speakers, especially those who are just beginning to learn foreign language or who have heavy accents. This paper presents a novel bilingual model modification approach to improve nonnative speech recog2nition via considering these great variations of accented pronunciations. Each state of the baseline nonnative acoustic models is modified with several candidate states from the auxiliary acoustic models, which are trained by speakers' mother language. State mapping criterion and n-best candidates are investigated based on a grammar-constrained speech recognition system. Using the state-based bilingual model modification approach, compared to the nonnative acoustic models which have already been well trained by adaptation technique MAP, a Relative reduction of 11.7% in Phrase Error Rate (RPhrER) was further achieved.

引用

页码：1300 / 1304

页数：5

共 50 条

[1] Nonnative Speech Recognition Based on Bilingual Model Modification at State Level
Zhang, Qingqing
Pan, Jielin
Chan, Shui-duen
Yan, Yonghong
SIXTH INTERNATIONAL SYMPOSIUM ON NEURAL NETWORKS (ISNN 2009), 2009, 56 : 299 - +
[2] Nonnative Speech Recognition based on Bilingual Model Modification
Zhang, Qingqing
Pan, Jielin
Chan, Shui-duen
Yan, Yonghong
2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 110 - +
[3] Nonnative Speech Recognition Based on State-level Bilingual Model Modification
Zhang, Qingqing
Li, Ta
Pan, Jielin
Yan, Yonghong
THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 2, PROCEEDINGS, 2008, : 1220 - 1225
[4] Nonnative Speech Recognition Based on State-Candidate Bilingual Model Modification
Zhang, Qingqing
Li, Ta
Pan, Jielin
Yan, Yonghong
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2366 - 2369
[5] Speech Recognition With State-based Nearest Neighbour Classifiers
Deselaers, Thomas
Heigold, Georg
Ney, Hermann
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 333 - 336
[6] State-based labelling for a sparse representation of speech and its application to robust speech recognition
Virtanen, Tuomas
Gemmeke, Jort F.
Hurmalainen, Antti
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 893 - +
[7] Prior knowledge guided MEL based model selection and adaptation for nonnative speech recognition
He, XD
Zhao, YX
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 337 - 340
[8] State-based Gaussian selection in large vocabulary continuous speech recognition using HMM's
Gales, MJF
Knill, KM
Young, SJ
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (02): : 152 - 161
[9] A state-based approach to the representation and recognition of gesture
Bobick, AF
Wilson, AD
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (12) : 1325 - 1337
[10] Multisensory Integration of Native and Nonnative Speech in Bilingual and Monolingual Adults
Mohamed, Riham Hafez
Ansari, Niloufar
Abdeljawad, Bahaa
Valdivia, Celina
Edwards, Abigail
Parks, Kaitlyn M. A.
Rafat, Yassaman
Stevenson, Ryan A.
MULTISENSORY RESEARCH, 2024, 37 (6-8) : 413 - 430

← 1 2 3 4 5 →