Voicing detection based on adaptive aperiodicity thresholding for speech enhancement in non-stationary noise

被引:4
|
作者
Cabanas-Molero, Pablo [1 ]
Martinez-Munoz, Damian [1 ]
Vera-Candeas, Pedro [1 ]
Ruiz-Reyes, Nicolas [1 ]
Jose Rodriguez-Serrano, Francisco [1 ]
机构
[1] Univ Jaen, Polytech Sch, Dept Telecommun Engn, Jaen 23700, Spain
关键词
hearing aids; speech enhancement; signal-to-noise ratios; voicing classifier; speech sentences database; fluctuating noise; signal-adaptive decision; nonstationary noise; adaptive aperiodicity thresholding; voicing detection; FUNDAMENTAL-FREQUENCY ESTIMATION; SPECTRAL SUBTRACTION; ENVIRONMENTS; ESTIMATOR;
D O I
10.1049/iet-spr.2012.0224
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this study, the authors present a novel voicing detection algorithm which employs the well-known aperiodicity measure to detect voiced speech in signals contaminated with non-stationary noise. The method computes a signal-adaptive decision threshold which takes into account the current noise level, enabling voicing detection by direct comparison with the extracted aperiodicity. This adaptive threshold is updated at each frame by making a simple estimate of the current noise power, and thus is adapted to fluctuating noise conditions. Once the aperiodicity is computed, the method only requires a small number of operations, and enables its implementation in challenging devices (such as hearing aids) if an efficient approximation of the difference function is employed to extract the aperiodicity. Evaluation over a database of speech sentences degraded by several types of noise reveals that the proposed voicing classifier is robust against different noises and signal-to-noise ratios. In addition, to evaluate the applicability of the method for speech enhancement, a simple F-0-based speech enhancement algorithm integrating the proposed classifier is implemented. The system is shown to achieve competitive results, in terms of objective measures, when compared with other well-known speech enhancement approaches.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 50 条
  • [31] SPEECH ENHANCEMENT BASED ON SPATIAL AND ACOUSTICAL FEATURES IN NON-STATIONARY NOISY ENVIRONMENTS
    Mizumachi, Mitsunori
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONGRESS ON SOUND AND VIBRATION, 2010,
  • [32] Particle filter based non-stationary noise tracking for robust speech recognition
    Fujimoto, M
    Nakamura, S
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 257 - 260
  • [33] FEATURE ENHANCEMENT BY BIDIRECTIONAL LSTM NETWORKS FOR CONVERSATIONAL SPEECH RECOGNITION IN HIGHLY NON-STATIONARY NOISE
    Woellmer, Martin
    Zhang, Zixing
    Weninger, Felix
    Schuller, Bjoern
    Rigoll, Gerhard
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6822 - 6826
  • [34] Intelligent Adaptive Active Noise Control in Non-stationary Noise Environments
    Mu, Xiangbin
    Ko, JinSeok
    Rheem, JaeYeol
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2013, 32 (05): : 408 - 414
  • [35] Enhancement of Non-Stationary Speech using Harmonic Chirp Filters
    Norholm, Sidsel Marie
    Jensen, Jesper Rindom
    Christensen, Mads Graesboll
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1755 - 1759
  • [36] Non-linear feature extraction for robust speech recognition in stationary and non-stationary noise
    Zhu, QF
    Alwan, A
    COMPUTER SPEECH AND LANGUAGE, 2003, 17 (04): : 381 - 402
  • [37] MASKING AND INPAINTING: A TWO-STAGE SPEECH ENHANCEMENT APPROACH FOR LOW SNR AND NON-STATIONARY NOISE
    Hao, Xiang
    Su, Xiangdong
    Wen, Shixue
    Wang, Zhiyu
    Pan, Yiqian
    Bao, Feilong
    Chen, Wei
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6959 - 6963
  • [38] A HIGHLY NON-STATIONARY NOISE TRACKING AND COMPENSATION ALGORITHM, WITH APPLICATIONS TO SPEECH ENHANCEMENT AND ON-LINE ASR
    Chowdhury, Md Foezur Rahman
    Selouani, Sid-Ahmed
    O'Shaughnessy, Douglas
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4337 - 4340
  • [39] Adaptive identification of non-Gaussian/non-stationary glint noise
    Wu, WR
    Wu, KG
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1999, E82A (12) : 2783 - 2792
  • [40] Ranked signals detection on the background of non-stationary noise
    Linkevichyus, S.P.
    Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 1998, 13 (05): : 3 - 5