Noise-Adaptive LDA: A New Approach for Speech Recognition Under Observation Uncertainty

被引:17
|
作者
Kolossa, Dorothea [1 ]
Zeiler, Steffen [1 ]
Saeidi, Rahim [2 ]
Astudillo, Ramon Fernandez [3 ]
机构
[1] Ruhr Univ Bochum, Inst Commun Acoust, Bochum, Germany
[2] Radboud Univ Nijmegen, NL-6525 ED Nijmegen, Netherlands
[3] INESC ID, Spoken Language Syst Lab, Lisbon, Portugal
关键词
ASR; LDA; noise adaptive; observation uncertainty;
D O I
10.1109/LSP.2013.2278556
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Automatic speech recognition (ASR) performance suffers severely from non-stationary noise, precluding widespread use of ASR in natural environments. Recently, so-termed uncertainty-of-observation techniques have helped to recover good performance. These consider the clean speech features as a hidden variable, of which the observable features are only an imperfect estimate. An estimated error variance of features is therefore used to further guide recognition. Based on the same idea, we introduce a new strategy: Reducing the speech feature dimensionality for optimal discriminance under observation uncertainty can yield significantly improved recognition performance, and is derived easily via Fisher's criterion of discriminant analysis.
引用
收藏
页码:1018 / 1021
页数:4
相关论文
共 50 条
  • [21] JOINT NOISE ADAPTIVE TRAINING FOR ROBUST AUTOMATIC SPEECH RECOGNITION
    Narayanan, Arun
    Wang, DeLiang
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [22] REPRESENTATION OF HIDDEN MARKOV MODEL FOR NOISE ADAPTIVE SPEECH RECOGNITION
    LEE, LM
    WANG, HC
    ELECTRONICS LETTERS, 1995, 31 (08) : 616 - 617
  • [23] Joint Uncertainty Decoding With Predictive Methods for Noise Robust Speech Recognition
    Xu, Haitian
    Gales, Mark J. F.
    Chin, K. K.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1665 - 1676
  • [24] An Adaptive Spectrum Sensing Algorithm under Noise Uncertainty
    Zhang, Shibing
    Bao, Zhihua
    2011 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2011,
  • [25] A perceptual masking approach for noise robust speech recognition
    Hari Krishna Maganti
    Marco Matassoni
    EURASIP Journal on Audio, Speech, and Music Processing, 2012
  • [26] Robust speech recognition using a noise rejection approach
    Khan, E
    Levinson, R
    IEEE INTERNATIONAL JOINT SYMPOSIA ON INTELLIGENCE AND SYSTEMS - PROCEEDINGS, 1998, : 326 - 335
  • [27] A perceptual masking approach for noise robust speech recognition
    Maganti, Hari Krishna
    Matassoni, Marco
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2012,
  • [28] A new approach for Persian speech Recognition
    Pour, Meysam Mohamad
    Farokhi, Fardad
    2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 153 - 158
  • [29] Adaptive Multimodal Fusion by Uncertainty Compensation With Application to Audiovisual Speech Recognition
    Papandreou, George
    Katsamanis, Athanassios
    Pitsikalis, Vassilis
    Maragos, Petros
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (03): : 423 - 435
  • [30] Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
    Hansen, JHL
    SPEECH COMMUNICATION, 1996, 20 (1-2) : 151 - 173