Speech pause detection for noise spectrum estimation by tracking power envelope dynamics

被引:94
|
作者
Marzinzik, M [1 ]
Kollmeier, B [1 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Dept Phys Med, D-26111 Oldenburg, Germany
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2002年 / 10卷 / 02期
关键词
envelope dynamics; envelope minima; noise estimation; noise reduction; speech pause detection;
D O I
10.1109/89.985548
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A speech pause detection algorithm is an important and sensitive part of most single-microphone noise reduction schemes for enhancement of speech signals corrupted by additive noise as an estimate of the background noise is usually determined when speech is absent. An algorithm is proposed which detects speech pauses by adaptively tracking minima in a noisy signal's power envelope both for the broadband signal and for the high-pass and low-pass filtered signal. In poor signal-to-noise ratios (SNRs), the proposed algorithm maintains a low false-alarm rate in the detection of speech pauses while the standardized algorithm of ITU G.729 shows an increasing false-alarm rate in unfavorable situations. These characteristics are found with different types of noise and indicate that the proposed algorithm is better suited to be used for noise estimation in noise reduction algorithms, as speech deteriorations may thus be kept at a low level. It is shown that in connection with the Ephraim-Malah noise reduction scheme [1], the speech pause detection performance can even be further increased by using the noise-reduced signal instead of the noisy signal as input for the speech pause decision unit.
引用
收藏
页码:109 / 118
页数:10
相关论文
共 50 条
  • [41] Maximum likelihood channel estimation with noise of unknown power spectrum
    Hui, D
    Zangi, KC
    IEEE VTC 53RD VEHICULAR TECHNOLOGY CONFERENCE, SPRING 2001, VOLS 1-4, PROCEEDINGS, 2001, : 1614 - 1618
  • [42] Noise estimation for speech enhancement by the estimated degree of noise without voice activity detection
    Hamid, M. Ekramul
    Ogawa, Keita
    Fukabayashi, Takeshi
    PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2006, : 420 - +
  • [43] Tracking of nonstationary noise based on data-driven recursive noise power estimation
    Erkelens, Jan S.
    Heusdens, Richard
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (06): : 1112 - 1123
  • [44] Speech Enhancement, Gain, and Noise Spectrum Adaptation Using Approximate Bayesian Estimation
    Hao, Jiucang
    Attias, Hagai
    Nagarajan, Srikantan
    Lee, Te-Won
    Sejnowski, Terrence J.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (01): : 24 - 37
  • [45] Noise Spectrum Estimation Using Line Spectral Frequencies for Robust Speech Recognition
    Jang, Gil-Jin
    Park, Jeong-Sik
    Kim, Sanghun
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2012, 31 (03): : 179 - 187
  • [46] Optimal Simultaneous Detection and Signal and Noise Power Estimation
    Le, Long
    Jones, Douglas L.
    2014 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2014, : 571 - 575
  • [47] SNR Wall for Energy Detection with Noise Power Estimation
    Mariani, Andrea
    Giorgetti, Andrea
    Chiani, Marco
    2011 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2011,
  • [48] Voice activity detection using density ratio estimation of speech and noise
    Tachioka, Yuuki
    Hanazawa, Toshiyuki
    Narita, Tomohiro
    Ishii, Jun
    IEEJ Transactions on Electronics, Information and Systems, 2013, 133 (08) : 1549 - 1555
  • [49] Amplitude and envelope phase noise of a modelocked laser predicted from its noise transfer function and the pump noise power spectrum
    Mulder, Theresa D.
    Scott, Ryan P.
    Kolner, Brian H.
    OPTICS EXPRESS, 2008, 16 (18) : 14186 - 14191
  • [50] TEDS Base-Station Power Amplifier Using Low-Noise Envelope Tracking Power Supply
    Hoyerby, Mikkel C. W.
    Andersen, Michael A. E.
    IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2009, 57 (07) : 1687 - 1693