ILMSAF based speech enhancement with DNN and noise classification

被引:16
|
作者
Li, Ruwei [1 ]
Liu, Yanan [1 ]
Shi, Yongqiang [1 ]
Dong, Liang [2 ]
Cui, Weili [3 ]
机构
[1] Beijing Univ Technol, Sch Elect Informat & Control Engn, Beijing 100124, Peoples R China
[2] Baylor Univ, Elect & Comp Engn, Waco, TX 76798 USA
[3] Wilkes Univ, Wilkes Barre, PA 18704 USA
关键词
Speech enhancement; Deep Belief Network; Noise classification; Improved Least Mean Square Adaptive; Filtering; Deep Neural Network;
D O I
10.1016/j.specom.2016.10.008
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In order to improve the performance of speech enhancement algorithm in low Signal-to-Noise Ratio (SNR) complex noise environments, a novel Improved Least Mean Square Adaptive Filtering (ILMSAF) based speech enhancement algorithm with Deep Neural Network (DNN) and noise classification is proposed. An adaptive coefficient of filter's parameters is introduced into conventional Least Mean Square Adaptive Filtering (LMSAF). First, the adaptive coefficient of filter's parameters is estimated by Deep Belief Network (DBN). Then, the enhanced speech is obtained by ILMSAF. In addition, in order to make the presented approach suitable for various kinds of noise environments, a new noise classification method based on DNN is presented. According to the result of noise classification, the corresponding ILMSAF model is selected in the enhancement process. The performance test results under ITU-TG.160 show that, the performance of the proposed algorithm tends to achieve significant improvements in terms of various speech subjective and objective quality measures than the wiener filtering based speech enhancement approach with Weighted Denoising Auto-encoder and noise classification. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 70
页数:18
相关论文
共 50 条
  • [21] Low SNR speech enhancement with DNN based phase estimation
    Chiluveru, Samba Raju
    Tripathy, Manoj
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (01) : 283 - 292
  • [22] Speech Enhancement with Phase Correction based on Modified DNN Architecture
    Cheng, Rui
    Bao, Changchun
    Xiang, Yang
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1222 - 1227
  • [23] Binaural Speech Enhancement based on DNN for the Application of Virtual Reality
    Wang, Jin
    Wang, Jing
    Liu, Ming
    Yan, Zhaoyu
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 629 - 633
  • [24] DNN-Based Cepstral Excitation Manipulation for Speech Enhancement
    Elshamy, Samy
    Fingscheidt, Tim
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1803 - 1814
  • [25] A Soft Decision-based Speech Enhancement using Acoustic Noise Classification
    Choi, Jae-Hun
    Kim, Sang-Kyun
    Chang, Joon-Hyuk
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1200 - 1203
  • [26] A COMPOSITE DNN ARCHITECTURE FOR SPEECH ENHANCEMENT
    Yemini, Yochai
    Chazan, Shlomo E.
    Goldberger, Jacob
    Gannot, Sharon
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 841 - 845
  • [27] JOINT NOISE AND MASK AWARE TRAINING FOR DNN-BASED SPEECH ENHANCEMENT WITH SUB-BAND FEATURES
    Wang, Qing
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 101 - 105
  • [28] Speech enhancement using a DNN-augmented colored-noise Kalman filter
    Yu, Hongjiang
    Zhu, Wei-Ping
    Champagne, Benoit
    SPEECH COMMUNICATION, 2020, 125 : 142 - 151
  • [29] A Statistical Model-Based Speech Enhancement Using Acoustic Noise Classification for Robust Speech Communication
    Choi, Jae-Hun
    Chang, Joon-Hyuk
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2012, E95B (07) : 2513 - 2516
  • [30] Speech enhancement based on emphasizing the fundamental frequency integrated with SNMF/DNN
    Tao Shi
    Rizwan Ullah
    Hongbo Jia
    Multimedia Tools and Applications, 2025, 84 (14) : 13157 - 13175