Weight-Space Viterbi Decoding Based Spectral Subtraction for Reverberant Speech Recognition

被引：8

作者：

Ban, Sung Min ^{[1
]}

Kim, Hyung Soon ^{[1
]}

机构：

[1] Pusan Natl Univ, Dept Elect Engn, Pusan 609735, South Korea

来源：

IEEE SIGNAL PROCESSING LETTERS | 2015年 / 22卷 / 09期

关键词：

Dereverberation; spectral subtraction; speech recognition; viterbi decoding; MODEL ADAPTATION; DEREVERBERATION; SUPPRESSION;

D O I：

10.1109/LSP.2015.2408371

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A single-channel blind dereverberation algorithm is proposed in this letter for distant-talking speech recognition. The proposed method is based on spectral subtraction (SS) method, in which the spectrum of a late reverberant signal is estimated using a delayed and attenuated version of the reverberant signal. Through some assumptions, the conventional SS method regards the attenuation weight as a constant that is a function of reverberation time. However, these assumptions are not valid in real situations, and the ideal weight varies with the frame. Therefore, in the proposed method, the variable weight sequence is estimated using Viterbi decoding scheme based on the reverberation model. This weight sequence is then substituted for the fixed weight in the conventional SS method without explicitly estimating the reverberation time. The proposed method performs better than the conventional SS method in both isolated word recognition and connected digit recognition experiments in reverberant environments.

引用

页码：1424 / 1428

页数：5

共 50 条

[1] Speech Recognition by Denoising and Dereverberation Based on Spectral Subtraction in a Real Noisy Reverberant Environment
Odani, Kyohei
Wang, Longbiao
Kai, Atsuhiko
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1250 - 1253
[2] OPTIMIZING SPECTRAL SUBTRACTION AND WIENER FILTERING FOR ROBUST SPEECH RECOGNITION IN REVERBERANT AND NOISY CONDITIONS
Gomez, Randy
Kawahara, Tatsuya
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4566 - 4569
[3] Speech Enhancement Based on Spectral Subtraction for Speech Recognition System
Han, Jung-woo
Kim, Se-young
Kim, Ki-man
Jung, Ji-won
Yun, Young
IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE 2011), 2011, : 417 - 418
[4] An Auditory Based Modulation Spectral Feature for Reverberant Speech Recognition
Maganti, HariKrishna
Matassoni, Marco
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 570 - 573
[5] Spectral subtraction for speech recognition with increased spectral resolution
Zhang, Weiqiang
Xu, Chen
Journal of Information and Computational Science, 2008, 5 (05): : 2253 - 2258
[6] Q-Gaussian based spectral subtraction for robust speech recognition
Pardede, Hilman E.
Shinoda, Koichi
Iwano, Koji
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1254 - 1257
[7] Speech recognition in nonstationary noise based on parallel HMMs and spectral subtraction
Mine, R
Kobayashi, T
Shirai, K
SYSTEMS AND COMPUTERS IN JAPAN, 1996, 27 (14) : 37 - 44
[8] Robust Speech Recognition Based on Multi-band Spectral Subtraction
Wan, Yi-Long
Zhang, Tian-Qi
Wang, Zhi-Chao
Jin, Jing
2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 36 - 40
[9] Reduced Memory Viterbi Decoding for Hardware-accelerated Speech Recognition
Raj, Pani Prithvi
Reddy, Pakala Akhil
Chandrachoodan, Nitin
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (03)
[10] Spectral Subtraction Based on Non-extensive Statistics for Speech Recognition
Pardede, Hilman
Iwano, Koji
Shinoda, Koichi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (08): : 1774 - 1782

← 1 2 3 4 5 →