Noise-Adaptive LDA: A New Approach for Speech Recognition Under Observation Uncertainty

被引：17

作者：

Kolossa, Dorothea ^{[1
]}

Zeiler, Steffen ^{[1
]}

Saeidi, Rahim ^{[2
]}

Astudillo, Ramon Fernandez ^{[3
]}

机构：

[1] Ruhr Univ Bochum, Inst Commun Acoust, Bochum, Germany

[2] Radboud Univ Nijmegen, NL-6525 ED Nijmegen, Netherlands

[3] INESC ID, Spoken Language Syst Lab, Lisbon, Portugal

来源：

IEEE SIGNAL PROCESSING LETTERS | 2013年 / 20卷 / 11期

关键词：

ASR; LDA; noise adaptive; observation uncertainty;

D O I：

10.1109/LSP.2013.2278556

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Automatic speech recognition (ASR) performance suffers severely from non-stationary noise, precluding widespread use of ASR in natural environments. Recently, so-termed uncertainty-of-observation techniques have helped to recover good performance. These consider the clean speech features as a hidden variable, of which the observable features are only an imperfect estimate. An estimated error variance of features is therefore used to further guide recognition. Based on the same idea, we introduce a new strategy: Reducing the speech feature dimensionality for optimal discriminance under observation uncertainty can yield significantly improved recognition performance, and is derived easily via Fisher's criterion of discriminant analysis.

引用

页码：1018 / 1021

页数：4

共 50 条

[21] JOINT NOISE ADAPTIVE TRAINING FOR ROBUST AUTOMATIC SPEECH RECOGNITION
Narayanan, Arun
Wang, DeLiang
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[22] REPRESENTATION OF HIDDEN MARKOV MODEL FOR NOISE ADAPTIVE SPEECH RECOGNITION
LEE, LM
WANG, HC
ELECTRONICS LETTERS, 1995, 31 (08) : 616 - 617
[23] Joint Uncertainty Decoding With Predictive Methods for Noise Robust Speech Recognition
Xu, Haitian
Gales, Mark J. F.
Chin, K. K.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1665 - 1676
[24] An Adaptive Spectrum Sensing Algorithm under Noise Uncertainty
Zhang, Shibing
Bao, Zhihua
2011 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2011,
[25] A perceptual masking approach for noise robust speech recognition
Hari Krishna Maganti
Marco Matassoni
EURASIP Journal on Audio, Speech, and Music Processing, 2012
[26] Robust speech recognition using a noise rejection approach
Khan, E
Levinson, R
IEEE INTERNATIONAL JOINT SYMPOSIA ON INTELLIGENCE AND SYSTEMS - PROCEEDINGS, 1998, : 326 - 335
[27] A perceptual masking approach for noise robust speech recognition
Maganti, Hari Krishna
Matassoni, Marco
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2012,
[28] A new approach for Persian speech Recognition
Pour, Meysam Mohamad
Farokhi, Fardad
2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 153 - 158
[29] Adaptive Multimodal Fusion by Uncertainty Compensation With Application to Audiovisual Speech Recognition
Papandreou, George
Katsamanis, Athanassios
Pitsikalis, Vassilis
Maragos, Petros
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (03): : 423 - 435
[30] Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
Hansen, JHL
SPEECH COMMUNICATION, 1996, 20 (1-2) : 151 - 173

← 1 2 3 4 5 →