Noise-Adaptive LDA: A New Approach for Speech Recognition Under Observation Uncertainty

被引:17
|
作者
Kolossa, Dorothea [1 ]
Zeiler, Steffen [1 ]
Saeidi, Rahim [2 ]
Astudillo, Ramon Fernandez [3 ]
机构
[1] Ruhr Univ Bochum, Inst Commun Acoust, Bochum, Germany
[2] Radboud Univ Nijmegen, NL-6525 ED Nijmegen, Netherlands
[3] INESC ID, Spoken Language Syst Lab, Lisbon, Portugal
关键词
ASR; LDA; noise adaptive; observation uncertainty;
D O I
10.1109/LSP.2013.2278556
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Automatic speech recognition (ASR) performance suffers severely from non-stationary noise, precluding widespread use of ASR in natural environments. Recently, so-termed uncertainty-of-observation techniques have helped to recover good performance. These consider the clean speech features as a hidden variable, of which the observable features are only an imperfect estimate. An estimated error variance of features is therefore used to further guide recognition. Based on the same idea, we introduce a new strategy: Reducing the speech feature dimensionality for optimal discriminance under observation uncertainty can yield significantly improved recognition performance, and is derived easily via Fisher's criterion of discriminant analysis.
引用
收藏
页码:1018 / 1021
页数:4
相关论文
共 50 条
  • [31] A supervised learning approach to uncertainty decoding for robust speech recognition
    Srinivasan, Soundararajan
    Wang, DeLiang
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 297 - 300
  • [32] INVENTORY MODELS UNDER UNCERTAINTY - AN ADAPTIVE APPROACH
    RUBINSTEIN, YR
    KREIMER, J
    MATHEMATICS AND COMPUTERS IN SIMULATION, 1986, 28 (03) : 169 - 188
  • [33] AN MCMC APPROACH TO JOINT ESTIMATION OF CLEAN SPEECH AND NOISE FOR ROBUST SPEECH RECOGNITION
    Mushtaq, Aleem
    Lee, Chin-Hui
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7107 - 7111
  • [34] JOINT UNCERTAINTY DECODING WITH THE SECOND ORDER APPROXIMATION FOR NOISE ROBUST SPEECH RECOGNITION
    Xu, Haitian
    Chin, K. K.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3841 - 3844
  • [35] Noise-Separated Adaptive Feature Distillation for Robust Speech Recognition
    Qu, Honglin
    Su, Xiangdong
    Wang, Yonghe
    Hao, Xiang
    Gao, Guanglai
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 763 - 767
  • [36] Experiments with an extended adaptive SVD enhancement scheme for speech recognition in noise
    Uhl, C
    Lieb, M
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 281 - 284
  • [37] Noise Adaptive Stream Weighting in Audio-Visual Speech Recognition
    Martin Heckmann
    Frédéric Berthommier
    Kristian Kroschel
    EURASIP Journal on Advances in Signal Processing, 2002
  • [38] Noise adaptive stream weighting in audio-visual speech recognition
    Heckmann, M
    Berthommier, F
    Kroschel, K
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2002, 2002 (11) : 1260 - 1273
  • [39] Speech Processing for Makhraj Recognition The Design of Adaptive Filter for Noise Canceller
    Arshad, N. W.
    Aziz, S. N. Abdul
    Naim, F.
    Karim, R. Abdul
    Hamid, R.
    Zakaria, N. F.
    2011 7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN ASIA (CITA 11), 2011,
  • [40] Comparison of Estimation Techniques in Joint Uncertainty Decoding for Noise Robust Speech Recognition
    Xu, Haitian
    Chin, K. K.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2363 - 2366