A Simulated-Data Adaptation Technique for Robust Speech Recognition

被引:0
|
作者
Thatphithakkul, Nattanun [1 ]
Kruatrachue, Boontee [1 ]
Wutiwiwatchai, Chai [2 ]
Marukatat, Sanparith [2 ]
Boonpiam, Vataya
机构
[1] King Mongkuts Inst Technol Ladkrabang, Dept Comp Engn, Fac Engn, Bangkok 10520, Thailand
[2] Natl Elect & Comp Technol Ctr, Pathum Thani 12120, Thailand
关键词
robust speech recognition; MLLR; online-adaptation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an efficient acoustic model adaptation method based on the use of simulated-data in maximum likelihood linear regression (MLLR) adaptation for robust speech recognition. Online MLLR adaptation is an unsupervised process which requires an input speech with phone labels transcribed automatically. Instead of using only the input signal in adaptation, our proposed simulated data method increases the size of adaptation data by adding noise portions extracted from the input speech to a set of pre-recorded clean speech, whose correct transcriptions are known. Various configurations of the proposed method are explored. Evaluations are performed with both additive and real noisy speech. The experimental results show that the proposed system achieves higher recognition rate than the system using only the input speech in adaptation and the system using a multi-conditioned acoustic model.
引用
收藏
页码:777 / +
页数:2
相关论文
共 50 条
  • [31] EXPLOITING MULTIMODAL DATA FUSION IN ROBUST SPEECH RECOGNITION
    Heracleous, Panikos
    Badin, Pierre
    Bailly, Gerard
    Hagita, Norihiro
    2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 568 - 572
  • [32] MAXIMUM LIKELIHOOD ADAPTATION OF HISTOGRAM EQUALIZATION WITH CONSTRAINT FOR ROBUST SPEECH RECOGNITION
    Xiao, Xiong
    Li, Jinyu
    Chng, Eng Siong
    Li, Haizhou
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5480 - 5483
  • [33] Model Adaptation Based on Improved Variance Estimation for Robust Speech Recognition
    Lu, Yong
    Xu, Zongyu
    Yan, Qin
    Zhou, Lin
    2012 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2012), 2012,
  • [34] A novel HMM model adaptation and compensation method for robust speech recognition
    Ning, GX
    Wei, G
    INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2005, VOLS 1 AND 2, PROCEEDINGS, 2005, : 274 - 277
  • [35] Adaptive model-based technique for robust speech recognition
    Graciarena, M
    CONFERENCE RECORD OF THE THIRTY-FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2000, : 1512 - 1516
  • [36] Maximum likelihood sub-band adaptation for robust speech recognition
    Zhu, DL
    Nakamura, S
    Paliwal, KK
    Wang, RH
    SPEECH COMMUNICATION, 2005, 47 (03) : 243 - 264
  • [37] Joint Adaptation and Adaptive Training of TVWR for Robust Automatic Speech Recognition
    Liu, Shilin
    Sim, Khe Chai
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 636 - 640
  • [38] Unsupervised Language Model Adaptation by Data Selection for Speech Recognition
    Khassanov, Yerbolat
    Chong, Tze Yuang
    Bigot, Benjamin
    Chng, Eng Siong
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I, 2017, 10191 : 508 - 517
  • [39] A robust speech analysis in speech recognition
    Miyanaga, Y
    Gozen, S
    Ohtsuki, N
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 706 - 709
  • [40] Noise-robust speech recognition by discriminative adaptation in parallel model combination
    Chung, YJ
    ELECTRONICS LETTERS, 2000, 36 (04) : 370 - 371