Speech Enhancement using Convolution Neural Network-based Spectrogram Denoising

被引:0
|
作者
Hu Xuhong [1 ]
Yan Lin-Huang [2 ]
Lu Xun [3 ]
Guan Yuan-Sheng [2 ]
Hu Wenlin [1 ]
Wang Jie [2 ,4 ]
机构
[1] China Railway Design Corp, Natl Engn Lab Digital Construct & Evaluat Urban R, Tianjin, Peoples R China
[2] Guangzhou Univ, Sch Elect & Commun Engn, Guangzhou, Guangdong, Peoples R China
[3] Guangdong Power Grid Co, Power Grid Planning Ctr, Guangzhou, Guangdong, Peoples R China
[4] Ctr Rd Traff Noise Control, Natl Environm Protect Engn & Technol, Beijing, Peoples R China
关键词
Speech enhancement; deep learning; convolution neural network; spectrogram denoising; NOISE; EFFICIENT;
D O I
10.1109/CMMNO53328.2021.9467599
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Regarding spectrogram as an image, this paper adopts a convolution neural network (CNN)-based image enhancement algorithm for spectrogram denoising. By doing so, speech denoising can be achieved when the spectrogram is enhanced by the proposed CNN-based image enhancement algorithm. The spectrogram clipping strategy was presented to obtain a large amount of training data, which gave rise to a smaller storage cost and avoided the limited depth development and problem of excessive complexity commonly presented in traditional speech features when training a recurrent neural network. Meanwhile, a deeper network was constructed to improve the capacity and flexibility to use the features of the spectrogram better, and it can also capture enough spatial information to make the noise reduction performance effectively. In addition, the proposed model utilized residual learning strategy in CNN training, with the combination of batch normalization, which greatly improved the performance of the model. The experimental results demonstrates that the proposed spectrogram denoising model has better learning ability and denoising performance, whether it is a known noise situation or a noise mismatch situation, so that the proposed system shows robust speech enhancement effect.
引用
收藏
页码:310 / 318
页数:9
相关论文
共 50 条
  • [31] Auditory filterbank denoising neural network for speech enhancement in wearable auditory device
    Kim, Seon Man
    ELECTRONICS LETTERS, 2024, 60 (10)
  • [32] Neural network-based artificial bandwidth expansion of speech
    Kontio, Juho
    Laaksonen, Laura
    Alku, Paavo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 873 - 881
  • [33] Biophysically-inspired features improve the generalizability of neural network-based speech enhancement systems
    Baby, Deepak
    Verhulst, Sarah
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3264 - 3268
  • [34] Unsupervised Speech Denoising Method based on Deep Neural Network
    Chen, Xiaohan
    2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2018, : 254 - 258
  • [35] Deep Convolution Neural Network Based Speech Recognition for Chhattisgarhi
    Londhe, Narendra D.
    Kshirsagar, Ghanahshyam B.
    Tekchandani, Hitesh
    2018 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2018, : 667 - 671
  • [36] Deep Neural Network Based Complex Spectrogram Reconstruction for Speech Bandwidth Expansion
    Yu, Hongjiang
    Zhu, Wei-Ping
    2020 18TH IEEE INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS'20), 2020, : 110 - 113
  • [37] Fusing traditionally extracted features with deep learned features from the speech spectrogram for anger and stress detection using convolution neural network
    Kapoor, Shalini
    Kumar, Tarun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (21) : 31107 - 31128
  • [38] Fusing traditionally extracted features with deep learned features from the speech spectrogram for anger and stress detection using convolution neural network
    Shalini Kapoor
    Tarun Kumar
    Multimedia Tools and Applications, 2022, 81 : 31107 - 31128
  • [39] A neural network-based nonlinear filter for image enhancement
    Zhang, S
    Salari, E
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2002, 12 (02) : 56 - 62
  • [40] Enhancement and Expansion of the Neural Network-Based Compact Model Using a Binning Method
    Choi, Jinyoung
    Jeong, Hyunjoon
    Woo, Sangmin
    Cho, Hyungmin
    Kim, Yohan
    Kong, Jeong-Taek
    Kim, Soyoung
    IEEE JOURNAL OF THE ELECTRON DEVICES SOCIETY, 2024, 12 : 65 - 73