Noise-Adaptive LDA: A New Approach for Speech Recognition Under Observation Uncertainty

被引:17
|
作者
Kolossa, Dorothea [1 ]
Zeiler, Steffen [1 ]
Saeidi, Rahim [2 ]
Astudillo, Ramon Fernandez [3 ]
机构
[1] Ruhr Univ Bochum, Inst Commun Acoust, Bochum, Germany
[2] Radboud Univ Nijmegen, NL-6525 ED Nijmegen, Netherlands
[3] INESC ID, Spoken Language Syst Lab, Lisbon, Portugal
关键词
ASR; LDA; noise adaptive; observation uncertainty;
D O I
10.1109/LSP.2013.2278556
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Automatic speech recognition (ASR) performance suffers severely from non-stationary noise, precluding widespread use of ASR in natural environments. Recently, so-termed uncertainty-of-observation techniques have helped to recover good performance. These consider the clean speech features as a hidden variable, of which the observable features are only an imperfect estimate. An estimated error variance of features is therefore used to further guide recognition. Based on the same idea, we introduce a new strategy: Reducing the speech feature dimensionality for optimal discriminance under observation uncertainty can yield significantly improved recognition performance, and is derived easily via Fisher's criterion of discriminant analysis.
引用
收藏
页码:1018 / 1021
页数:4
相关论文
共 50 条
  • [1] SPEECH RECOGNITION USING NOISE-ADAPTIVE PROTOTYPES
    NADAS, A
    NAHAMOO, D
    PICHENY, MA
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (10): : 1495 - 1503
  • [2] ROBUST AUDIOVISUAL SPEECH RECOGNITION USING NOISE-ADAPTIVE LINEAR DISCRIMINANT ANALYSIS
    Zeiler, Steffen
    Nickel, Robert
    Ma, Ning
    Brown, Guy J.
    Kolossa, Dorothea
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2797 - 2801
  • [3] Noise-Adaptive Auto-Encoder for Modulation Recognition of RF Signal
    Woo, Jongseok
    Jung, Kuchul
    Mukhopadhyay, Saibal
    2024 IEEE/MTT-S INTERNATIONAL MICROWAVE SYMPOSIUM, IMS 2024, 2024, : 820 - 823
  • [4] NOISE-ADAPTIVE DEEP NEURAL NETWORK FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Chung, Hanwook
    Kim, Taesup
    Plourde, Eric
    Champagne, Benoit
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [5] A PROGRESSIVE LEARNING APPROACH TO ADAPTIVE NOISE AND SPEECH ESTIMATION FOR SPEECH ENHANCEMENT AND NOISY SPEECH RECOGNITION
    Nian, Zhaoxu
    Tu, Yan-Hui
    Du, Jun
    Lee, Chin-Hui
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6913 - 6917
  • [6] AN UNCERTAINTY DECODING APPROACH TO NOISE- AND REVERBERATION-ROBUST SPEECH RECOGNITION
    Maas, Roland
    Thippur, Akshaya
    Sehr, Armin
    Kellermann, Walter
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7388 - 7392
  • [7] Uncertainty decoding with splice for noise robust speech recognition
    Droppo, J
    Acero, A
    Deng, L
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 57 - 60
  • [8] Issues with Uncertainty Decoding for Noise Robust Speech Recognition
    Liao, H.
    Gales, M. J. F.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1121 - 1124
  • [9] NOISE ADAPTIVE TRAINING USING A VECTOR TAYLOR SERIES APPROACH FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION
    Kalinli, Ozlem
    Seltzer, Michael L.
    Acero, Alex
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3825 - 3828
  • [10] A noise-adaptive discriminant function and its application to blurred machine-printed Kanji recognition
    Omachi, S
    Sun, F
    Aso, H
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (03) : 314 - 319