Reverberation and Noise Robust Feature Compensation Based on IMM

被引:9
|
作者
Han, Chang Woo [1 ,2 ]
Kang, Shin Jae [1 ,2 ]
Kim, Nam Soo [1 ,2 ]
机构
[1] Seoul Natl Univ, Sch Elect Engn, Seoul 151, South Korea
[2] Seoul Natl Univ, INMC, Seoul 151, South Korea
基金
新加坡国家研究基金会;
关键词
Dereverberation; feature compensation; interacting multiple model (IMM); MAXIMUM-LIKELIHOOD; SPEECH; ADAPTATION; ALGORITHM;
D O I
10.1109/TASL.2013.2256893
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a novel feature compensation approach based on the interacting multiple model (IMM) algorithm specially designed for joint processing of background noise and acoustic reverberation. Our approach to cope with the time-varying environmental parameters is to establish a switching linear dynamic model for the additive and convolutive distortions, such as the background noise and acoustic reverberation, in the log-spectral domain. We construct multiple state space models with the speech corruption process in which the log spectra of clean speech and log frequency response of acoustic reverberation are jointly handled as the state of our interest. The proposed approach shows significant improvements in the Aurora-5 automatic speech recognition (ASR) task which was developed to investigate the influence on the performance of ASR for a hands-free speech input in noisy room environments.
引用
收藏
页码:1598 / 1611
页数:14
相关论文
共 50 条
  • [31] ROBUST ACOUSTIC FEATURE EXTRACTION FOR SOUND CLASSIFICATION BASED ON NOISE REDUCTION
    Ye, Jiaxing
    Kobayashi, Takumi
    Murakawa, Masahiro
    Higuchi, Tetsuya
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [32] On Noise Robust Feature for Speech Recognition Based on Power Function Family
    Pardede, Hilman F.
    2015 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2015, : 386 - 390
  • [33] Residual noise compensation for robust speech recognition in nonstationary noise
    Yao, KS
    Shi, BE
    Fung, P
    Cao, ZG
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1125 - 1128
  • [34] Noise-robust feature based on sparse representation for speaker recognition
    Qi, Hongzhuo
    Metallurgical and Mining Industry, 2015, 7 (04): : 64 - 69
  • [35] Robust control with compensation of bounded perturbations and noise
    Tsykunov, A. M.
    JOURNAL OF COMPUTER AND SYSTEMS SCIENCES INTERNATIONAL, 2014, 53 (03) : 320 - 326
  • [36] Robust control with compensation of bounded perturbations and noise
    A. M. Tsykunov
    Journal of Computer and Systems Sciences International, 2014, 53 : 320 - 326
  • [37] Enhanced Speech Features by Single-Channel Joint Compensation of Noise and Reverberation
    Woefel, Matthias
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (02): : 312 - 323
  • [38] Variational Bayesian Based IMM Robust GPS Navigation Filter
    Jwo, Dah-Jing
    Chang, Wei-Yeh
    Computers, Materials and Continua, 2022, 72 (01): : 755 - 773
  • [39] Pitch synchronous based feature extraction for noise-robust speaker verification
    Gong Wei-Guo
    Yang Li-Ping
    Chen Di
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 295 - 298
  • [40] The Teager Energy based feature parameters for robust speech recognition in car noise
    Jabloun, F
    Çetin, AE
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 273 - 276