State-based bilingual model modification for nonnative speech recognition

被引:0
|
作者
Zhang, Qingqing [1 ]
Li, Ta [1 ]
Pan, Jiefin [1 ]
Yan, Yonghong [1 ]
机构
[1] Chinese Acad Sci, Inst Acoust, ThinkIT Speech Lab, Beijing 100080, Peoples R China
关键词
D O I
10.1109/ICALIP.2008.4589996
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The speech recognition accuracy has been observed to decrease for nonnative speakers, especially those who are just beginning to learn foreign language or who have heavy accents. This paper presents a novel bilingual model modification approach to improve nonnative speech recog2nition via considering these great variations of accented pronunciations. Each state of the baseline nonnative acoustic models is modified with several candidate states from the auxiliary acoustic models, which are trained by speakers' mother language. State mapping criterion and n-best candidates are investigated based on a grammar-constrained speech recognition system. Using the state-based bilingual model modification approach, compared to the nonnative acoustic models which have already been well trained by adaptation technique MAP, a Relative reduction of 11.7% in Phrase Error Rate (RPhrER) was further achieved.
引用
收藏
页码:1300 / 1304
页数:5
相关论文
共 50 条
  • [1] Nonnative Speech Recognition Based on Bilingual Model Modification at State Level
    Zhang, Qingqing
    Pan, Jielin
    Chan, Shui-duen
    Yan, Yonghong
    SIXTH INTERNATIONAL SYMPOSIUM ON NEURAL NETWORKS (ISNN 2009), 2009, 56 : 299 - +
  • [2] Nonnative Speech Recognition based on Bilingual Model Modification
    Zhang, Qingqing
    Pan, Jielin
    Chan, Shui-duen
    Yan, Yonghong
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 110 - +
  • [3] Nonnative Speech Recognition Based on State-level Bilingual Model Modification
    Zhang, Qingqing
    Li, Ta
    Pan, Jielin
    Yan, Yonghong
    THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 2, PROCEEDINGS, 2008, : 1220 - 1225
  • [4] Nonnative Speech Recognition Based on State-Candidate Bilingual Model Modification
    Zhang, Qingqing
    Li, Ta
    Pan, Jielin
    Yan, Yonghong
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2366 - 2369
  • [5] Speech Recognition With State-based Nearest Neighbour Classifiers
    Deselaers, Thomas
    Heigold, Georg
    Ney, Hermann
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 333 - 336
  • [6] State-based labelling for a sparse representation of speech and its application to robust speech recognition
    Virtanen, Tuomas
    Gemmeke, Jort F.
    Hurmalainen, Antti
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 893 - +
  • [7] Prior knowledge guided MEL based model selection and adaptation for nonnative speech recognition
    He, XD
    Zhao, YX
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 337 - 340
  • [8] State-based Gaussian selection in large vocabulary continuous speech recognition using HMM's
    Gales, MJF
    Knill, KM
    Young, SJ
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (02): : 152 - 161
  • [9] A state-based approach to the representation and recognition of gesture
    Bobick, AF
    Wilson, AD
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (12) : 1325 - 1337
  • [10] Multisensory Integration of Native and Nonnative Speech in Bilingual and Monolingual Adults
    Mohamed, Riham Hafez
    Ansari, Niloufar
    Abdeljawad, Bahaa
    Valdivia, Celina
    Edwards, Abigail
    Parks, Kaitlyn M. A.
    Rafat, Yassaman
    Stevenson, Ryan A.
    MULTISENSORY RESEARCH, 2024, 37 (6-8) : 413 - 430