A Simulated-Data Adaptation Technique for Robust Speech Recognition

被引：0

作者：

Thatphithakkul, Nattanun ^{[1
]}

Kruatrachue, Boontee ^{[1
]}

Wutiwiwatchai, Chai ^{[2
]}

Marukatat, Sanparith ^{[2
]}

Boonpiam, Vataya

机构：

[1] King Mongkuts Inst Technol Ladkrabang, Dept Comp Engn, Fac Engn, Bangkok 10520, Thailand

[2] Natl Elect & Comp Technol Ctr, Pathum Thani 12120, Thailand

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

robust speech recognition; MLLR; online-adaptation;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes an efficient acoustic model adaptation method based on the use of simulated-data in maximum likelihood linear regression (MLLR) adaptation for robust speech recognition. Online MLLR adaptation is an unsupervised process which requires an input speech with phone labels transcribed automatically. Instead of using only the input signal in adaptation, our proposed simulated data method increases the size of adaptation data by adding noise portions extracted from the input speech to a set of pre-recorded clean speech, whose correct transcriptions are known. Various configurations of the proposed method are explored. Evaluations are performed with both additive and real noisy speech. The experimental results show that the proposed system achieves higher recognition rate than the system using only the input speech in adaptation and the system using a multi-conditioned acoustic model.

引用

页码：777 / +

页数：2

共 50 条

[31] EXPLOITING MULTIMODAL DATA FUSION IN ROBUST SPEECH RECOGNITION
Heracleous, Panikos
Badin, Pierre
Bailly, Gerard
Hagita, Norihiro
2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 568 - 572
[32] MAXIMUM LIKELIHOOD ADAPTATION OF HISTOGRAM EQUALIZATION WITH CONSTRAINT FOR ROBUST SPEECH RECOGNITION
Xiao, Xiong
Li, Jinyu
Chng, Eng Siong
Li, Haizhou
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5480 - 5483
[33] Model Adaptation Based on Improved Variance Estimation for Robust Speech Recognition
Lu, Yong
Xu, Zongyu
Yan, Qin
Zhou, Lin
2012 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2012), 2012,
[34] A novel HMM model adaptation and compensation method for robust speech recognition
Ning, GX
Wei, G
INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2005, VOLS 1 AND 2, PROCEEDINGS, 2005, : 274 - 277
[35] Adaptive model-based technique for robust speech recognition
Graciarena, M
CONFERENCE RECORD OF THE THIRTY-FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2000, : 1512 - 1516
[36] Maximum likelihood sub-band adaptation for robust speech recognition
Zhu, DL
Nakamura, S
Paliwal, KK
Wang, RH
SPEECH COMMUNICATION, 2005, 47 (03) : 243 - 264
[37] Joint Adaptation and Adaptive Training of TVWR for Robust Automatic Speech Recognition
Liu, Shilin
Sim, Khe Chai
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 636 - 640
[38] Unsupervised Language Model Adaptation by Data Selection for Speech Recognition
Khassanov, Yerbolat
Chong, Tze Yuang
Bigot, Benjamin
Chng, Eng Siong
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I, 2017, 10191 : 508 - 517
[39] A robust speech analysis in speech recognition
Miyanaga, Y
Gozen, S
Ohtsuki, N
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 706 - 709
[40] Noise-robust speech recognition by discriminative adaptation in parallel model combination
Chung, YJ
ELECTRONICS LETTERS, 2000, 36 (04) : 370 - 371

← 1 2 3 4 5 →