Speech pause detection for noise spectrum estimation by tracking power envelope dynamics

被引:94
|
作者
Marzinzik, M [1 ]
Kollmeier, B [1 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Dept Phys Med, D-26111 Oldenburg, Germany
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2002年 / 10卷 / 02期
关键词
envelope dynamics; envelope minima; noise estimation; noise reduction; speech pause detection;
D O I
10.1109/89.985548
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A speech pause detection algorithm is an important and sensitive part of most single-microphone noise reduction schemes for enhancement of speech signals corrupted by additive noise as an estimate of the background noise is usually determined when speech is absent. An algorithm is proposed which detects speech pauses by adaptively tracking minima in a noisy signal's power envelope both for the broadband signal and for the high-pass and low-pass filtered signal. In poor signal-to-noise ratios (SNRs), the proposed algorithm maintains a low false-alarm rate in the detection of speech pauses while the standardized algorithm of ITU G.729 shows an increasing false-alarm rate in unfavorable situations. These characteristics are found with different types of noise and indicate that the proposed algorithm is better suited to be used for noise estimation in noise reduction algorithms, as speech deteriorations may thus be kept at a low level. It is shown that in connection with the Ephraim-Malah noise reduction scheme [1], the speech pause detection performance can even be further increased by using the noise-reduced signal instead of the noisy signal as input for the speech pause decision unit.
引用
收藏
页码:109 / 118
页数:10
相关论文
共 50 条
  • [1] Noise power spectrum estimation for speech enhancement using an autoregressive model for speech power spectrum dynamics
    Batina, Ivo
    Jensen, Jesper
    Heusdens, Richard
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 3515 - 3518
  • [2] ANALYSIS-SYNTHESIS BASED SPEECH ENHANCEMENT WITH IMPROVED SPECTRUM ENVELOPE ESTIMATION BY TRACKING SPEECH DYNAMICS
    Chen, Ruofei
    Chan, Cheung-Fat
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4644 - 4647
  • [3] Noise power spectrum estimation based on speech presence probability
    Zhao Y.-P.
    Zhao X.-H.
    Wang B.
    Zhao, Xiao-Hui (xhzhao@jlu.edu.cn), 1600, Editorial Board of Jilin University (46): : 917 - 922
  • [4] NOISE POWER SPECTRUM ESTIMATION BASED ON WEAK SPEECH PROTECTION FOR SPEECH ENHANCEMENT
    Feng, Yan
    An, Baokun
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 484 - 487
  • [5] Efficient methods in LPA using power spectrum estimation of envelope of speech signal
    R.G.M. College of Engineering, Nandyal, Kurnool Dist, A.P., India
    不详
    不详
    Inf. Technol. J., 2007, 2 (300-303):
  • [6] Model-Based Speech Enhancement With Improved Spectral Envelope Estimation via Dynamics Tracking
    Chen, Ruofei
    Chan, Cheung-Fat
    So, Hing Cheung
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04): : 1324 - 1336
  • [7] ADAPTIVE NOISE POWER SPECTRUM ESTIMATION FOR COMPACT DUAL CHANNEL SPEECH ENHANCEMENT
    Jeong, So-Young
    Kim, Kyuhong
    Jeong, Jae-Hoon
    Oh, Kwang-Cheol
    Kim, Jeongsu
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1630 - 1633
  • [8] Robust Estimation of Non-Stationary Noise Power Spectrum for Speech Enhancement
    Mai, Van-Khanh
    Pastor, Dominique
    Aissa-El-Bey, Abdeldjalil
    Le-Bidan, Raphael
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (04) : 670 - 682
  • [9] iDeepMMSE: An Improved Deep Learning Approach to MMSE Speech and Noise Power Spectrum Estimation for Speech Enhancement
    Kim, Minseung
    Song, Hyungchan
    Cheong, Sein
    Shin, Jong Won
    INTERSPEECH 2022, 2022, : 181 - 185
  • [10] Predicting binaural speech intelligibility using the signal-to-noise ratio in the envelope power spectrum domain
    Chabot-Leclerc, Alexandre
    MacDonald, Ewen N.
    Dau, Torsten
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (01): : 192 - 205