A Simulated-Data Adaptation Technique for Robust Speech Recognition

被引：0

作者：

Thatphithakkul, Nattanun ^{[1
]}

Kruatrachue, Boontee ^{[1
]}

Wutiwiwatchai, Chai ^{[2
]}

Marukatat, Sanparith ^{[2
]}

Boonpiam, Vataya

机构：

[1] King Mongkuts Inst Technol Ladkrabang, Dept Comp Engn, Fac Engn, Bangkok 10520, Thailand

[2] Natl Elect & Comp Technol Ctr, Pathum Thani 12120, Thailand

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

robust speech recognition; MLLR; online-adaptation;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes an efficient acoustic model adaptation method based on the use of simulated-data in maximum likelihood linear regression (MLLR) adaptation for robust speech recognition. Online MLLR adaptation is an unsupervised process which requires an input speech with phone labels transcribed automatically. Instead of using only the input signal in adaptation, our proposed simulated data method increases the size of adaptation data by adding noise portions extracted from the input speech to a set of pre-recorded clean speech, whose correct transcriptions are known. Various configurations of the proposed method are explored. Evaluations are performed with both additive and real noisy speech. The experimental results show that the proposed system achieves higher recognition rate than the system using only the input speech in adaptation and the system using a multi-conditioned acoustic model.

引用

页码：777 / +

页数：2

共 50 条

[41] Multi-Speaker Adaptation for Robust Speech Recognition under Ubiquitous Environment
Shih, Po-Yi
Wang, Jhing-Fa
Lin, Yuan-Ning
Fu, Zhong-Hua
ORIENTAL COCOSDA 2009 - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2009, : 126 - 131
[42] Combining MMSE enhancement with LA model adaptation for robust automatic speech recognition
Ding, P
Cao, ZG
ELECTRONICS LETTERS, 2001, 37 (08) : 539 - 540
[43] ACOUSTIC MODEL ADAPTATION VIA LINEAR SPLINE INTERPOLATION FOR ROBUST SPEECH RECOGNITION
Seltzer, Michael L.
Acero, Alex
Kalgaonkar, Kaustubh
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4550 - 4553
[44] Joint speaker and environment adaptation using Tensor Voice for robust speech recognition
Jeong, Yongwon
SPEECH COMMUNICATION, 2014, 58 : 1 - 10
[45] ON COMBINING DNN AND GMM WITH UNSUPERVISED SPEAKER ADAPTATION FOR ROBUST AUTOMATIC SPEECH RECOGNITION
Liu, Shilin
Sim, Khe Chai
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[46] HMM Adaptation using Statistical Linear Approximation for Robust Automatic Speech Recognition
Berkovitch, Michael
Shallom, Ilan D.
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1301 - 1304
[47] Domain Adaptation Using Factorized Hidden Layer for Robust Automatic Speech Recognition
Sim, Khe Chai
Narayanan, Arun
Misra, Ananya
Tripathi, Anshuman
Pundak, Golan
Sainath, Tara N.
Haghani, Parisa
Li, Bo
Bacchiani, Michiel
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 892 - 896
[48] Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation
Gales, MJF
Pye, D
Woodland, PC
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1832 - 1835
[49] Feature adaptation using deviation vector for robust speech recognition in noisy environment
Hwang, TH
Lee, LM
Wang, HC
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1227 - 1230
[50] Model Adaptation Algorithm Based on Central Subband Regression for Robust Speech Recognition
Lu, Yong
Zhou, Lin
2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2, 2014,

← 1 2 3 4 5 →