Speech pause detection for noise spectrum estimation by tracking power envelope dynamics

被引:94
|
作者
Marzinzik, M [1 ]
Kollmeier, B [1 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Dept Phys Med, D-26111 Oldenburg, Germany
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2002年 / 10卷 / 02期
关键词
envelope dynamics; envelope minima; noise estimation; noise reduction; speech pause detection;
D O I
10.1109/89.985548
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A speech pause detection algorithm is an important and sensitive part of most single-microphone noise reduction schemes for enhancement of speech signals corrupted by additive noise as an estimate of the background noise is usually determined when speech is absent. An algorithm is proposed which detects speech pauses by adaptively tracking minima in a noisy signal's power envelope both for the broadband signal and for the high-pass and low-pass filtered signal. In poor signal-to-noise ratios (SNRs), the proposed algorithm maintains a low false-alarm rate in the detection of speech pauses while the standardized algorithm of ITU G.729 shows an increasing false-alarm rate in unfavorable situations. These characteristics are found with different types of noise and indicate that the proposed algorithm is better suited to be used for noise estimation in noise reduction algorithms, as speech deteriorations may thus be kept at a low level. It is shown that in connection with the Ephraim-Malah noise reduction scheme [1], the speech pause detection performance can even be further increased by using the noise-reduced signal instead of the noisy signal as input for the speech pause decision unit.
引用
收藏
页码:109 / 118
页数:10
相关论文
共 50 条
  • [21] NOISE POWER ESTIMATION BASED ON THE PROBABILITY OF SPEECH PRESENCE
    Gerkmann, Timo
    Hendriks, Richard C.
    2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 145 - 148
  • [22] Noise Spectrum Estimation Based on SNR Discrepancy for Speech Enhancement
    Saha, Atanu
    Shimamura, Tetsuya
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (02) : 373 - 377
  • [23] Speech and noise power estimation using Gamma modeling
    Chehrehsa, Sarang
    Moir, Tom James
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2017, 31 (10) : 1491 - 1502
  • [24] MINIMUM SUBSPACE NOISE TRACKING FOR NOISE POWER SPECTRAL DENSITY ESTIMATION
    Triki, Mahdi
    Janse, Kees
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 29 - 32
  • [25] Delta-band neural envelope tracking predicts speech intelligibility in noise in preschoolers
    Van Hirtum, Tilde
    Somers, Ben
    Verschueren, Eline
    Dieudonne, Benjamin
    Francart, Tom
    HEARING RESEARCH, 2023, 434
  • [26] Speech Envelope Dynamics for Noise-Robust Auditory Scene Analysis in Robotics
    Rea, Francesco
    Kothig, Austin
    Grasse, Lukas
    Tata, Matthew
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2020, 17 (06)
  • [27] Evidence for enhanced neural tracking of the speech envelope underlying age-related speech-in-noise difficulties
    Decruy, Lien
    Vanthornhout, Jonas
    Francart, Tom
    JOURNAL OF NEUROPHYSIOLOGY, 2019, 122 (02) : 601 - 615
  • [28] Effects of manipulating the signal-to-noise envelope power ratio on speech intelligibility
    Jorgensen, Soren
    Decorsiere, Remi
    Dau, Torsten
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 137 (03): : 1401 - 1410
  • [29] Cooperative spectrum sensing based on noise power estimation
    Rakovic, Valentin
    Pavlovska, Valentina
    Atanasovski, Vladimir
    Gavrilovska, Liljana
    2013 16TH INTERNATIONAL SYMPOSIUM ON WIRELESS PERSONAL MULTIMEDIA COMMUNICATIONS (WPMC), 2013,
  • [30] Speech enhancement based on soft audible noise masking and noise power estimation
    Yu, Rongshan
    SPEECH COMMUNICATION, 2013, 55 (10) : 964 - 974