Speech pause detection for noise spectrum estimation by tracking power envelope dynamics

被引:94
|
作者
Marzinzik, M [1 ]
Kollmeier, B [1 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Dept Phys Med, D-26111 Oldenburg, Germany
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2002年 / 10卷 / 02期
关键词
envelope dynamics; envelope minima; noise estimation; noise reduction; speech pause detection;
D O I
10.1109/89.985548
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A speech pause detection algorithm is an important and sensitive part of most single-microphone noise reduction schemes for enhancement of speech signals corrupted by additive noise as an estimate of the background noise is usually determined when speech is absent. An algorithm is proposed which detects speech pauses by adaptively tracking minima in a noisy signal's power envelope both for the broadband signal and for the high-pass and low-pass filtered signal. In poor signal-to-noise ratios (SNRs), the proposed algorithm maintains a low false-alarm rate in the detection of speech pauses while the standardized algorithm of ITU G.729 shows an increasing false-alarm rate in unfavorable situations. These characteristics are found with different types of noise and indicate that the proposed algorithm is better suited to be used for noise estimation in noise reduction algorithms, as speech deteriorations may thus be kept at a low level. It is shown that in connection with the Ephraim-Malah noise reduction scheme [1], the speech pause detection performance can even be further increased by using the noise-reduced signal instead of the noisy signal as input for the speech pause decision unit.
引用
收藏
页码:109 / 118
页数:10
相关论文
共 50 条
  • [31] Predicting speech intelligibility based on a correlation metric in the envelope power spectrum domain
    Relano-Iborra, Helia
    May, Tobias
    Zaar, Johannes
    Scheidiger, Christoph
    Dau, Torsten
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (04): : 2670 - 2679
  • [32] QRD BASED MVDR BEAMFORMING FOR FAST TRACKING OF SPEECH AND NOISE DYNAMICS
    Barnov, Anna
    Bar Bracha, Vered
    Markovich-Golan, Shmulik
    2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 369 - 373
  • [33] Noise estimation using speech/non-speech frame decision and subband spectral tracking
    Lin, Zhong
    Goubran, Rafik A.
    Dansereau, Richard M.
    SPEECH COMMUNICATION, 2007, 49 (7-8) : 542 - 557
  • [34] Simultaneous Speech Detection and Magnitude Squared Spectrum Estimation Approach for Speech Enhancement
    Han, Ruirui
    Ou, Shifeng
    Liu, Wei
    Chen, Chen
    Zhang, Shuo
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 281 - 285
  • [35] NOISE POWER ESTIMATION AND OVERLAPPED SIGNALS DETECTION
    TACCONI, EJ
    CARASI, JJ
    SIGNAL PROCESSING, 1991, 23 (02) : 145 - 155
  • [36] Speech endpoint detection based on the dynamic segmentation of power spectral envelope
    Xu, Chun-Dong
    Wang, Jing
    Zhang, Ge
    Ying, Dong-Wen
    Li, Jun-Feng
    Yan, Yong-Hong
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2015, 35 (11): : 1189 - 1193
  • [37] Adaptive-threshold neural spike detection by noise-envelope tracking
    Traver, L.
    Tain, C.
    Marti, P.
    Cardona, N.
    ELECTRONICS LETTERS, 2007, 43 (24) : 1333 - 1335
  • [38] Noise-robust speech recognition based on difference of power spectrum
    Xu, JF
    Wei, G
    ELECTRONICS LETTERS, 2000, 36 (14) : 1247 - 1248
  • [39] FPGA Implementation of Dynamic Quantile Tracking based Noise Estimation for Speech Enhancement
    Kumar B.
    Varma A.K.
    Journal of Engineering Science and Technology Review, 2023, 16 (04) : 171 - 179
  • [40] Noise Estimation for Speech Enhancement Based on Quasi-Gaussian Distributed Power Spectrum Series by Radical Root Transformation
    Ye, Tian
    Yokota, Yasunari
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (06): : 1306 - 1314