Weight-Space Viterbi Decoding Based Spectral Subtraction for Reverberant Speech Recognition

被引:8
|
作者
Ban, Sung Min [1 ]
Kim, Hyung Soon [1 ]
机构
[1] Pusan Natl Univ, Dept Elect Engn, Pusan 609735, South Korea
关键词
Dereverberation; spectral subtraction; speech recognition; viterbi decoding; MODEL ADAPTATION; DEREVERBERATION; SUPPRESSION;
D O I
10.1109/LSP.2015.2408371
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A single-channel blind dereverberation algorithm is proposed in this letter for distant-talking speech recognition. The proposed method is based on spectral subtraction (SS) method, in which the spectrum of a late reverberant signal is estimated using a delayed and attenuated version of the reverberant signal. Through some assumptions, the conventional SS method regards the attenuation weight as a constant that is a function of reverberation time. However, these assumptions are not valid in real situations, and the ideal weight varies with the frame. Therefore, in the proposed method, the variable weight sequence is estimated using Viterbi decoding scheme based on the reverberation model. This weight sequence is then substituted for the fixed weight in the conventional SS method without explicitly estimating the reverberation time. The proposed method performs better than the conventional SS method in both isolated word recognition and connected digit recognition experiments in reverberant environments.
引用
收藏
页码:1424 / 1428
页数:5
相关论文
共 50 条
  • [1] Speech Recognition by Denoising and Dereverberation Based on Spectral Subtraction in a Real Noisy Reverberant Environment
    Odani, Kyohei
    Wang, Longbiao
    Kai, Atsuhiko
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1250 - 1253
  • [2] OPTIMIZING SPECTRAL SUBTRACTION AND WIENER FILTERING FOR ROBUST SPEECH RECOGNITION IN REVERBERANT AND NOISY CONDITIONS
    Gomez, Randy
    Kawahara, Tatsuya
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4566 - 4569
  • [3] Speech Enhancement Based on Spectral Subtraction for Speech Recognition System
    Han, Jung-woo
    Kim, Se-young
    Kim, Ki-man
    Jung, Ji-won
    Yun, Young
    IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE 2011), 2011, : 417 - 418
  • [4] An Auditory Based Modulation Spectral Feature for Reverberant Speech Recognition
    Maganti, HariKrishna
    Matassoni, Marco
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 570 - 573
  • [5] Spectral subtraction for speech recognition with increased spectral resolution
    Zhang, Weiqiang
    Xu, Chen
    Journal of Information and Computational Science, 2008, 5 (05): : 2253 - 2258
  • [6] Q-Gaussian based spectral subtraction for robust speech recognition
    Pardede, Hilman E.
    Shinoda, Koichi
    Iwano, Koji
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1254 - 1257
  • [7] Speech recognition in nonstationary noise based on parallel HMMs and spectral subtraction
    Mine, R
    Kobayashi, T
    Shirai, K
    SYSTEMS AND COMPUTERS IN JAPAN, 1996, 27 (14) : 37 - 44
  • [8] Robust Speech Recognition Based on Multi-band Spectral Subtraction
    Wan, Yi-Long
    Zhang, Tian-Qi
    Wang, Zhi-Chao
    Jin, Jing
    2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 36 - 40
  • [9] Reduced Memory Viterbi Decoding for Hardware-accelerated Speech Recognition
    Raj, Pani Prithvi
    Reddy, Pakala Akhil
    Chandrachoodan, Nitin
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (03)
  • [10] Spectral Subtraction Based on Non-extensive Statistics for Speech Recognition
    Pardede, Hilman
    Iwano, Koji
    Shinoda, Koichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (08): : 1774 - 1782