Reverberation and Noise Robust Feature Compensation Based on IMM

被引：9

作者：

Han, Chang Woo ^{[1
,2
]}

Kang, Shin Jae ^{[1
,2
]}

Kim, Nam Soo ^{[1
,2
]}

机构：

[1] Seoul Natl Univ, Sch Elect Engn, Seoul 151, South Korea

[2] Seoul Natl Univ, INMC, Seoul 151, South Korea

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2013年 / 21卷 / 08期

基金：

新加坡国家研究基金会;

关键词：

Dereverberation; feature compensation; interacting multiple model (IMM); MAXIMUM-LIKELIHOOD; SPEECH; ADAPTATION; ALGORITHM;

D O I：

10.1109/TASL.2013.2256893

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we propose a novel feature compensation approach based on the interacting multiple model (IMM) algorithm specially designed for joint processing of background noise and acoustic reverberation. Our approach to cope with the time-varying environmental parameters is to establish a switching linear dynamic model for the additive and convolutive distortions, such as the background noise and acoustic reverberation, in the log-spectral domain. We construct multiple state space models with the speech corruption process in which the log spectra of clean speech and log frequency response of acoustic reverberation are jointly handled as the state of our interest. The proposed approach shows significant improvements in the Aurora-5 automatic speech recognition (ASR) task which was developed to investigate the influence on the performance of ASR for a hands-free speech input in noisy room environments.

引用

页码：1598 / 1611

页数：14

共 50 条

[41] Noise Robust Feature Extraction Based on Extended Weighted Linear Prediction in LVCSR
Keronen, Sami
Pohjalainen, Jouni
Alku, Paavo
Kurimo, Mikko
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1272 - +
[42] Two-stage model-based feature compensation for robust speech recognition
Shen, Haifeng
Liu, Gang
Guo, Jun
COMPUTING, 2012, 94 (01) : 1 - 20
[43] Two-stage model-based feature compensation for robust speech recognition
Haifeng Shen
Gang Liu
Jun Guo
Computing, 2012, 94 : 1 - 20
[44] Model-based feature enhancement with uncertainty decoding for noise robust ASR
Stouten, Veronique
Van hamme, Hugo
Warnbacq, Patrick
SPEECH COMMUNICATION, 2006, 48 (11) : 1502 - 1514
[45] Teager energy based feature parameters for robust speech recognition in car noise
Jabloun, Firas
Cetin, A.Enis
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 273 - 276
[46] Variational Bayesian Based IMM Robust GPS Navigation Filter
Jwo, Dah-Jing
Chang, Wei-Yeh
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (01): : 755 - 773
[47] AN UNCERTAINTY DECODING APPROACH TO NOISE- AND REVERBERATION-ROBUST SPEECH RECOGNITION
Maas, Roland
Thippur, Akshaya
Sehr, Armin
Kellermann, Walter
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7388 - 7392
[48] Adaptive mechanisms facilitate robust performance in noise and in reverberation in an auditory categorization model
Parida, Satyabrata
Liu, Shi Tong
Sadagopan, Srivatsun
COMMUNICATIONS BIOLOGY, 2023, 6 (01)
[49] Robust voiceprint recognition with adaptive anti-noise ability based on fitting and compensation
Chen, Zhuang
Yu, Yibiao
Shengxue Xuebao/Acta Acustica, 2022, 47 (01): : 151 - 160
[50] NOISE ROBUST INTEGRATION FOR BLIND AND NON-BLIND REVERBERATION TIME ESTIMATION
Schuldt, Christian
Handel, Peter
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 56 - 60

← 1 2 3 4 5 →