A Simulated-Data Adaptation Technique for Robust Speech Recognition

被引:0
|
作者
Thatphithakkul, Nattanun [1 ]
Kruatrachue, Boontee [1 ]
Wutiwiwatchai, Chai [2 ]
Marukatat, Sanparith [2 ]
Boonpiam, Vataya
机构
[1] King Mongkuts Inst Technol Ladkrabang, Dept Comp Engn, Fac Engn, Bangkok 10520, Thailand
[2] Natl Elect & Comp Technol Ctr, Pathum Thani 12120, Thailand
关键词
robust speech recognition; MLLR; online-adaptation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an efficient acoustic model adaptation method based on the use of simulated-data in maximum likelihood linear regression (MLLR) adaptation for robust speech recognition. Online MLLR adaptation is an unsupervised process which requires an input speech with phone labels transcribed automatically. Instead of using only the input signal in adaptation, our proposed simulated data method increases the size of adaptation data by adding noise portions extracted from the input speech to a set of pre-recorded clean speech, whose correct transcriptions are known. Various configurations of the proposed method are explored. Evaluations are performed with both additive and real noisy speech. The experimental results show that the proposed system achieves higher recognition rate than the system using only the input speech in adaptation and the system using a multi-conditioned acoustic model.
引用
收藏
页码:777 / +
页数:2
相关论文
共 50 条
  • [41] Multi-Speaker Adaptation for Robust Speech Recognition under Ubiquitous Environment
    Shih, Po-Yi
    Wang, Jhing-Fa
    Lin, Yuan-Ning
    Fu, Zhong-Hua
    ORIENTAL COCOSDA 2009 - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2009, : 126 - 131
  • [42] Combining MMSE enhancement with LA model adaptation for robust automatic speech recognition
    Ding, P
    Cao, ZG
    ELECTRONICS LETTERS, 2001, 37 (08) : 539 - 540
  • [43] ACOUSTIC MODEL ADAPTATION VIA LINEAR SPLINE INTERPOLATION FOR ROBUST SPEECH RECOGNITION
    Seltzer, Michael L.
    Acero, Alex
    Kalgaonkar, Kaustubh
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4550 - 4553
  • [44] Joint speaker and environment adaptation using Tensor Voice for robust speech recognition
    Jeong, Yongwon
    SPEECH COMMUNICATION, 2014, 58 : 1 - 10
  • [45] ON COMBINING DNN AND GMM WITH UNSUPERVISED SPEAKER ADAPTATION FOR ROBUST AUTOMATIC SPEECH RECOGNITION
    Liu, Shilin
    Sim, Khe Chai
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [46] HMM Adaptation using Statistical Linear Approximation for Robust Automatic Speech Recognition
    Berkovitch, Michael
    Shallom, Ilan D.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1301 - 1304
  • [47] Domain Adaptation Using Factorized Hidden Layer for Robust Automatic Speech Recognition
    Sim, Khe Chai
    Narayanan, Arun
    Misra, Ananya
    Tripathi, Anshuman
    Pundak, Golan
    Sainath, Tara N.
    Haghani, Parisa
    Li, Bo
    Bacchiani, Michiel
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 892 - 896
  • [48] Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation
    Gales, MJF
    Pye, D
    Woodland, PC
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1832 - 1835
  • [49] Feature adaptation using deviation vector for robust speech recognition in noisy environment
    Hwang, TH
    Lee, LM
    Wang, HC
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1227 - 1230
  • [50] Model Adaptation Algorithm Based on Central Subband Regression for Robust Speech Recognition
    Lu, Yong
    Zhou, Lin
    2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2, 2014,