Speech Enhancement using Convolution Neural Network-based Spectrogram Denoising

被引:0
|
作者
Hu Xuhong [1 ]
Yan Lin-Huang [2 ]
Lu Xun [3 ]
Guan Yuan-Sheng [2 ]
Hu Wenlin [1 ]
Wang Jie [2 ,4 ]
机构
[1] China Railway Design Corp, Natl Engn Lab Digital Construct & Evaluat Urban R, Tianjin, Peoples R China
[2] Guangzhou Univ, Sch Elect & Commun Engn, Guangzhou, Guangdong, Peoples R China
[3] Guangdong Power Grid Co, Power Grid Planning Ctr, Guangzhou, Guangdong, Peoples R China
[4] Ctr Rd Traff Noise Control, Natl Environm Protect Engn & Technol, Beijing, Peoples R China
关键词
Speech enhancement; deep learning; convolution neural network; spectrogram denoising; NOISE; EFFICIENT;
D O I
10.1109/CMMNO53328.2021.9467599
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Regarding spectrogram as an image, this paper adopts a convolution neural network (CNN)-based image enhancement algorithm for spectrogram denoising. By doing so, speech denoising can be achieved when the spectrogram is enhanced by the proposed CNN-based image enhancement algorithm. The spectrogram clipping strategy was presented to obtain a large amount of training data, which gave rise to a smaller storage cost and avoided the limited depth development and problem of excessive complexity commonly presented in traditional speech features when training a recurrent neural network. Meanwhile, a deeper network was constructed to improve the capacity and flexibility to use the features of the spectrogram better, and it can also capture enough spatial information to make the noise reduction performance effectively. In addition, the proposed model utilized residual learning strategy in CNN training, with the combination of batch normalization, which greatly improved the performance of the model. The experimental results demonstrates that the proposed spectrogram denoising model has better learning ability and denoising performance, whether it is a known noise situation or a noise mismatch situation, so that the proposed system shows robust speech enhancement effect.
引用
收藏
页码:310 / 318
页数:9
相关论文
共 50 条
  • [41] A wavelet network-based speech enhancement system using noisy-as-clean strategy
    Hajiaghababa, Fatemeh
    Abutalebi, Hamid Reza
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2024, 22 (01)
  • [42] Convolution Neural Network-Based Sensitive Security Parameter Identification and Analysis
    Kim, Hyunki
    Kim, Donghyun
    Yi, Okyeon
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [43] A Convolution Neural Network-based Speckle Tracking Method for Ultrasound Elastography
    Peng, Bo
    Xian, Yuhong
    Jiang, Jingfeng
    2018 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2018,
  • [44] Hyperspectral image dynamic range reconstruction using deep neural network-based denoising methods
    Cheplanov, Loran
    Avidan, Shai
    Bonfil, David J.
    Klapp, Iftach
    MACHINE VISION AND APPLICATIONS, 2024, 35 (03)
  • [45] A Convolution Neural Network-based Method for Designing Honeycomb Absorbing Material
    Han, Lingyan
    Kuang, Lei
    Liu, Huan
    Lu, Jianxia
    Liu, Qing Huo
    2019 PHOTONICS & ELECTROMAGNETICS RESEARCH SYMPOSIUM - FALL (PIERS - FALL), 2019, : 1401 - 1404
  • [46] A Convolution Neural Network-based Approach for Metal Surface Roughness Evaluation
    Pan Z.
    Liu Y.
    Li Z.
    Xun Q.
    Wu Y.
    Current Materials Science, 2024, 17 (02) : 148 - 166
  • [47] Investigation of Convolution Neural Network-based Wavefront Correction for FSO Systems
    Chen, Minan
    Jin, Xianqing
    Xu, Zhengyuan
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [48] Hyperspectral image dynamic range reconstruction using deep neural network-based denoising methods
    Loran Cheplanov
    Shai Avidan
    David J. Bonfil
    Iftach Klapp
    Machine Vision and Applications, 2024, 35
  • [49] 3D convolution neural network-based person identification using gait cycles
    P. Supraja
    Rijo Jackson Tom
    Ravi Shekhar Tiwari
    V. Vijayakumar
    Yan Liu
    Evolving Systems, 2021, 12 : 1045 - 1056
  • [50] 3D convolution neural network-based person identification using gait cycles
    Supraja, P.
    Tom, Rijo Jackson
    Tiwari, Ravi Shekhar
    Vijayakumar, V.
    Liu, Yan
    EVOLVING SYSTEMS, 2021, 12 (04) : 1045 - 1056