Single-Channel Speech Enhancement Based on Improved Frame-Iterative Spectral Subtraction in the Modulation Domain

被引:0
|
作者
Li, Chao [1 ]
Jiang, Ting [1 ]
Wu, Sheng [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
基金
中国国家自然科学基金; 国家自然科学基金重大项目;
关键词
short-time modulation domain; single-channel speech enhancement; modulation improved frame iterative spectral subtraction; low SNRs; MEAN-SQUARE ERROR; NOISE; MAGNITUDE;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Aiming at the problem of music noise introduced by classical spectral subtraction, a short-time modulation domain (STM) spectral subtraction method has been successfully applied for single-channel speech enhancement. However, due to the inaccurate voice activity detection (VAD), the residual music noise and enhanced performance still need to be further improved, especially in the low signal to noise ratio (SNR) scenarios. To address this issue, an improved frame iterative spectral subtraction in the STM domain (IMModSSub) is proposed. More specifically, with the inter-frame correlation, the noise subtraction is directly applied to handle the noisy signal for each frame in the STM domain. Then, the noisy signal is classified into speech or silence frames based on a predefined threshold of segmented SNR. With these classification results, a corresponding mask function is developed for noisy speech after noise subtraction. Finally, exploiting the increased sparsity of speech signal in the modulation domain, the orthogonal matching pursuit (OMP) technique is employed to the speech frames for improving the speech quality and intelligibility. The effectiveness of the proposed method is evaluated with three types of noise, including white noise, pink noise, and hfchannel noise. The obtained results show that the proposed method outperforms some established baselines at lower SNRs (5 to +5 dB).
引用
收藏
页码:100 / 115
页数:16
相关论文
共 50 条
  • [11] Speech Enhancement Algorithm Based on Improved Spectral Subtraction
    Gao, Liuyang
    Guo, Yunfei
    Li, Shaomei
    Chen, Fucai
    2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 3, 2009, : 140 - 143
  • [12] Single-Channel Speech Enhancement Based on Sub-Band Spectral Entropy
    Wei, Yi
    Zeng, Yumin
    Li, Chen
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2018, 66 (03): : 100 - 113
  • [13] Combine Waveform and Spectral Methods for Single-channel Speech Enhancement
    Li, Miao
    Zhang, Hui
    Zhang, Xueliang
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 47 - 52
  • [14] Modified Amplitude Spectral Estimator for Single-Channel Speech Enhancement
    Zhai, Zhenhui
    Ou, Shifeng
    Gao, Ying
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS (AMEII 2016), 2016, 73 : 1115 - 1120
  • [15] Single Channel Speech Enhancement Utilizing Iterative Processing of Multi-Band Spectral Subtraction Algorithm
    Upadhyay, Navneet
    Karmakar, Abhijit
    2012 2ND INTERNATIONAL CONFERENCE ON POWER, CONTROL AND EMBEDDED SYSTEMS (ICPCES 2012), 2012,
  • [16] Phase-Aware Single-Channel Speech Enhancement With Modulation-Domain Kalman Filtering
    Dionelis, Nikolaos
    Brookes, Mike
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (05) : 937 - 950
  • [17] Analysis of Optimized Spectral Subtraction Method for Single Channel Speech Enhancement
    Monika Gupta
    R. K. Singh
    Sachin Singh
    Wireless Personal Communications, 2023, 128 : 2203 - 2215
  • [18] Analysis of Optimized Spectral Subtraction Method for Single Channel Speech Enhancement
    Gupta, Monika
    Singh, R. K.
    Singh, Sachin
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 128 (03) : 2203 - 2215
  • [19] STFT Phase Reconstruction in Voiced Speech for an Improved Single-Channel Speech Enhancement
    Krawczyk, Martin
    Gerkmann, Timo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) : 1931 - 1940
  • [20] Single-Channel Speech Enhancement Based on Psychoacoustic Masking
    Zhou, Tingting
    Zeng, Yumin
    Wang, Rongrong
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (04): : 272 - 284