Reverberation and Noise Robust Feature Compensation Based on IMM

被引:9
|
作者
Han, Chang Woo [1 ,2 ]
Kang, Shin Jae [1 ,2 ]
Kim, Nam Soo [1 ,2 ]
机构
[1] Seoul Natl Univ, Sch Elect Engn, Seoul 151, South Korea
[2] Seoul Natl Univ, INMC, Seoul 151, South Korea
基金
新加坡国家研究基金会;
关键词
Dereverberation; feature compensation; interacting multiple model (IMM); MAXIMUM-LIKELIHOOD; SPEECH; ADAPTATION; ALGORITHM;
D O I
10.1109/TASL.2013.2256893
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a novel feature compensation approach based on the interacting multiple model (IMM) algorithm specially designed for joint processing of background noise and acoustic reverberation. Our approach to cope with the time-varying environmental parameters is to establish a switching linear dynamic model for the additive and convolutive distortions, such as the background noise and acoustic reverberation, in the log-spectral domain. We construct multiple state space models with the speech corruption process in which the log spectra of clean speech and log frequency response of acoustic reverberation are jointly handled as the state of our interest. The proposed approach shows significant improvements in the Aurora-5 automatic speech recognition (ASR) task which was developed to investigate the influence on the performance of ASR for a hands-free speech input in noisy room environments.
引用
收藏
页码:1598 / 1611
页数:14
相关论文
共 50 条
  • [41] Noise Robust Feature Extraction Based on Extended Weighted Linear Prediction in LVCSR
    Keronen, Sami
    Pohjalainen, Jouni
    Alku, Paavo
    Kurimo, Mikko
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1272 - +
  • [42] Two-stage model-based feature compensation for robust speech recognition
    Shen, Haifeng
    Liu, Gang
    Guo, Jun
    COMPUTING, 2012, 94 (01) : 1 - 20
  • [43] Two-stage model-based feature compensation for robust speech recognition
    Haifeng Shen
    Gang Liu
    Jun Guo
    Computing, 2012, 94 : 1 - 20
  • [44] Model-based feature enhancement with uncertainty decoding for noise robust ASR
    Stouten, Veronique
    Van hamme, Hugo
    Warnbacq, Patrick
    SPEECH COMMUNICATION, 2006, 48 (11) : 1502 - 1514
  • [45] Teager energy based feature parameters for robust speech recognition in car noise
    Jabloun, Firas
    Cetin, A.Enis
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 273 - 276
  • [46] Variational Bayesian Based IMM Robust GPS Navigation Filter
    Jwo, Dah-Jing
    Chang, Wei-Yeh
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (01): : 755 - 773
  • [47] AN UNCERTAINTY DECODING APPROACH TO NOISE- AND REVERBERATION-ROBUST SPEECH RECOGNITION
    Maas, Roland
    Thippur, Akshaya
    Sehr, Armin
    Kellermann, Walter
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7388 - 7392
  • [48] Adaptive mechanisms facilitate robust performance in noise and in reverberation in an auditory categorization model
    Parida, Satyabrata
    Liu, Shi Tong
    Sadagopan, Srivatsun
    COMMUNICATIONS BIOLOGY, 2023, 6 (01)
  • [49] Robust voiceprint recognition with adaptive anti-noise ability based on fitting and compensation
    Chen, Zhuang
    Yu, Yibiao
    Shengxue Xuebao/Acta Acustica, 2022, 47 (01): : 151 - 160
  • [50] NOISE ROBUST INTEGRATION FOR BLIND AND NON-BLIND REVERBERATION TIME ESTIMATION
    Schuldt, Christian
    Handel, Peter
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 56 - 60