An Adaptation Method in Noise Mismatch Conditions for DNN-based Speech Enhancement

被引：1

作者：

Xu Si-Ying ^{[1
]}

Niu Tong ^{[1
]}

Qu Dan ^{[1
]}

Long Xing-Yan ^{[1
]}

机构：

[1] Natl Digital Switching Syst Engn & Technol R&D Ct, Zhengzhou, Henan, Peoples R China

来源：

KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS | 2018年 / 12卷 / 10期

关键词：

Noise-aware Training; identity-vector; L-2; regularization; speech enhancement; DNN; condition mismatch; INTELLIGIBILITY; RECOGNITION; SUPPRESSION; SELECTION; MODEL;

D O I：

10.3837/tiis.2018.10.017

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The deep learning based speech enhancement has shown considerable success. However, it still suffers performance degradation under mismatch conditions. In this paper, an adaptation method is proposed to improve the performance under noise mismatch conditions. Firstly, we advise a noise aware training by supplying identity vectors (i-vectors) as parallel input features to adapt deep neural network (DNN) acoustic models with the target noise. Secondly, given a small amount of adaptation data, the noise-dependent DNN is obtained by using L-2 regularization from a noise-independent DNN, and forcing the estimated masks to be close to the unadapted condition. Finally, experiments were carried out on different noise and SNR conditions, and the proposed method has achieved significantly 0.1%-9.6% benefits of STOI, and provided consistent improvement in PESQ and segSNR against the baseline systems.

引用

页码：4930 / 4951

页数：22

共 50 条

[1] DNN-BASED ENHANCEMENT OF NOISY AND REVERBERANT SPEECH
Zhao, Yan
Wang, DeLiang
Merks, Ivo
Zhang, Tao
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6525 - 6529
[2] A DNN-based emotional speech synthesis by speaker adaptation
Yang, Hongwu
Zhang, Weizhao
Zhi, Pengpeng
2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 633 - 637
[3] DNN-Based Speech Enhancement Using Soft Audible Noise Masking for Wind Noise Reduction
Haichuan Bai
Fengpei Ge
Yonghong Yan
中国通信, 2018, 15 (09) : 235 - 243
[4] DNN-Based Speech Enhancement Using Soft Audible Noise Masking for Wind Noise Reduction
Bai, Haichuan
Ge, Fengpei
Yan, Yonghong
CHINA COMMUNICATIONS, 2018, 15 (09) : 235 - 243
[5] INTEGRATED DNN-BASED MODEL ADAPTATION TECHNIQUE FOR NOISE-ROBUST SPEECH RECOGNITION
Lee, Kang Hyun
Kang, Woo Hyun
Kang, Tae Gyoon
Kim, Nam Soo
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5245 - 5249
[6] Concatenated Identical DNN (CI-DNN) to Reduce Noise-Type Dependence in DNN-Based Speech Enhancement
Xu, Ziyi
Strake, Maximilian
Fingscheidt, Tim
2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
[7] A study of speaker adaptation for DNN-based speech synthesis
Wu, Zhizheng
Swietojanski, Pawel
Veaux, Christophe
Renals, Steve
King, Simon
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 879 - 883
[8] DNN-based phase estimation for online speech enhancement
Nguyen, Binh Thien
Wakabayashi, Yukoh
Geng, Yuting
Iwai, Kenta
Nishiura, Takanobu
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2025, 46 (02) : 186 - 190
[9] DNN-BASED SPEECH ENHANCEMENT USING MBE MODEL
Huang, Qizheng
Bao, Changchun
Wang, Xianyun
Xiang, Yang
2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 196 - 200
[10] DNN-Based Cepstral Excitation Manipulation for Speech Enhancement
Elshamy, Samy
Fingscheidt, Tim
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1803 - 1814

← 1 2 3 4 5 →