Harmonic enhancement using learnable comb filter for light-weight full-band speech enhancement model

被引:2
|
作者
Le, Xiaohuai [1 ,2 ]
Lei, Tong [1 ,3 ]
Chen, Li [2 ]
Guo, Yiqing [2 ]
He, Chao [2 ]
Chen, Cheng [2 ]
Xia, Xianjun [2 ]
Gao, Hua [2 ]
Xiao, Yijian [2 ]
Ding, Piao [2 ]
Song, Shenyi [2 ]
Lu, Jing [1 ,3 ]
机构
[1] Nanjing Univ, Key Lab Modern Acoust, Nanjing 210093, Peoples R China
[2] ByteDance, RTC Lab, Beijing, Peoples R China
[3] Horizon Robot, NJU Horizon Intelligent Audio Lab, Beijing 100094, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Comb filter; Speech enhancement; PercepNet; DeepFilterNet; NETWORKS;
D O I
10.21437/Interspeech.2023-186
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
With fewer feature dimensions, filter banks are often used in light-weight full-band speech enhancement models. In order to further enhance the coarse speech in the sub-band domain, it is necessary to apply a post-filtering for harmonic retrieval. The signal processing-based comb filters used in RNNoise and PercepNet have limited performance and may cause speech quality degradation due to inaccurate fundamental frequency estimation. To tackle this problem, we propose a learnable comb filter to enhance harmonics. Based on the sub-band model, we design a DNN-based fundamental frequency estimator to estimate the discrete fundamental frequencies and a comb filter for harmonic enhancement, which are trained via an end-to-end pattern. The experiments show the advantages of our proposed method over PecepNet and DeepFilterNet.
引用
收藏
页码:3894 / 3898
页数:5
相关论文
共 36 条
  • [31] Enhancement of bond characteristics between sand-coated GFRP bar and normal weight and light-weight concrete using an innovative anchor
    Doostmohamadi, Alireza
    Shakiba, Milad
    Bazli, Milad
    Ebrahimzadeh, Mohsen
    Arashpour, Mehrdad
    ENGINEERING STRUCTURES, 2023, 294
  • [32] A Dual-microphone Sub-band Post-filter Using Simplified TBRR for Speech Enhancement
    Wang, Haiping
    Zhou, Yi
    Ma, Yongbao
    Liu, Hongqing
    PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 99 - 103
  • [33] Noisy speech enhancement using harmonic-noise model and codebook-based post-processing
    Zavarehei, Esfandiar
    Vaseghi, Saeed
    Yan, Qin
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1194 - 1203
  • [34] SPEECH ENHANCEMENT IN CAR NOISE ENVIRONMENT BASED ON AN ANALYSIS-SYNTHESIS APPROACH USING HARMONIC NOISE MODEL
    Chen, R. F.
    Chan, C. F.
    So, H. C.
    Lee, Jonathan S. C.
    Leung, C. Y.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4413 - +
  • [35] SPEECH ENHANCEMENT USING A MODULATION DOMAIN KALMAN FILTER POST-PROCESSOR WITH A GAUSSIAN MIXTURE NOISE MODEL
    Wang, Yu
    Brookes, Mike
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [36] Wavelet-Based Weighted Low-Rank Sparse Decomposition Model for Speech Enhancement Using Gammatone Filter Bank Under Low SNR Conditions
    Sridhar, K. Venkata
    Kumar, T. Kishore
    FLUCTUATION AND NOISE LETTERS, 2023, 22 (02):