ILMSAF based speech enhancement with DNN and noise classification

被引：16

作者：

Li, Ruwei ^{[1
]}

Liu, Yanan ^{[1
]}

Shi, Yongqiang ^{[1
]}

Dong, Liang ^{[2
]}

Cui, Weili ^{[3
]}

机构：

[1] Beijing Univ Technol, Sch Elect Informat & Control Engn, Beijing 100124, Peoples R China

[2] Baylor Univ, Elect & Comp Engn, Waco, TX 76798 USA

[3] Wilkes Univ, Wilkes Barre, PA 18704 USA

来源：

SPEECH COMMUNICATION | 2016年 / 85卷

关键词：

Speech enhancement; Deep Belief Network; Noise classification; Improved Least Mean Square Adaptive; Filtering; Deep Neural Network;

D O I：

10.1016/j.specom.2016.10.008

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In order to improve the performance of speech enhancement algorithm in low Signal-to-Noise Ratio (SNR) complex noise environments, a novel Improved Least Mean Square Adaptive Filtering (ILMSAF) based speech enhancement algorithm with Deep Neural Network (DNN) and noise classification is proposed. An adaptive coefficient of filter's parameters is introduced into conventional Least Mean Square Adaptive Filtering (LMSAF). First, the adaptive coefficient of filter's parameters is estimated by Deep Belief Network (DBN). Then, the enhanced speech is obtained by ILMSAF. In addition, in order to make the presented approach suitable for various kinds of noise environments, a new noise classification method based on DNN is presented. According to the result of noise classification, the corresponding ILMSAF model is selected in the enhancement process. The performance test results under ITU-TG.160 show that, the performance of the proposed algorithm tends to achieve significant improvements in terms of various speech subjective and objective quality measures than the wiener filtering based speech enhancement approach with Weighted Denoising Auto-encoder and noise classification. (C) 2016 Elsevier B.V. All rights reserved.

引用

页码：53 / 70

页数：18

共 50 条

[21] Low SNR speech enhancement with DNN based phase estimation
Chiluveru, Samba Raju
Tripathy, Manoj
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (01) : 283 - 292
[22] Speech Enhancement with Phase Correction based on Modified DNN Architecture
Cheng, Rui
Bao, Changchun
Xiang, Yang
2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1222 - 1227
[23] Binaural Speech Enhancement based on DNN for the Application of Virtual Reality
Wang, Jin
Wang, Jing
Liu, Ming
Yan, Zhaoyu
PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 629 - 633
[24] DNN-Based Cepstral Excitation Manipulation for Speech Enhancement
Elshamy, Samy
Fingscheidt, Tim
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1803 - 1814
[25] A Soft Decision-based Speech Enhancement using Acoustic Noise Classification
Choi, Jae-Hun
Kim, Sang-Kyun
Chang, Joon-Hyuk
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1200 - 1203
[26] A COMPOSITE DNN ARCHITECTURE FOR SPEECH ENHANCEMENT
Yemini, Yochai
Chazan, Shlomo E.
Goldberger, Jacob
Gannot, Sharon
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 841 - 845
[27] JOINT NOISE AND MASK AWARE TRAINING FOR DNN-BASED SPEECH ENHANCEMENT WITH SUB-BAND FEATURES
Wang, Qing
Du, Jun
Dai, Li-Rong
Lee, Chin-Hui
2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 101 - 105
[28] Speech enhancement using a DNN-augmented colored-noise Kalman filter
Yu, Hongjiang
Zhu, Wei-Ping
Champagne, Benoit
SPEECH COMMUNICATION, 2020, 125 : 142 - 151
[29] A Statistical Model-Based Speech Enhancement Using Acoustic Noise Classification for Robust Speech Communication
Choi, Jae-Hun
Chang, Joon-Hyuk
IEICE TRANSACTIONS ON COMMUNICATIONS, 2012, E95B (07) : 2513 - 2516
[30] Speech enhancement based on emphasizing the fundamental frequency integrated with SNMF/DNN
Tao Shi
Rizwan Ullah
Hongbo Jia
Multimedia Tools and Applications, 2025, 84 (14) : 13157 - 13175

← 1 2 3 4 5 →