DNN TRAINING BASED ON CLASSIC GAIN FUNCTION FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION

被引：0

作者：

Tu, Yan-Hui ^{[1
]}

Du, Jun ^{[1
]}

Lee, Chin-Hui ^{[2
]}

机构：

[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China

[2] Georgia Inst Technol, Atlanta, GA 30332 USA

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

statistical speech enhancement; ideal ratio mask; deep learning; gain function; speech recognition; NOISE;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

For conventional single-channel speech enhancement based on noise power spectrum, the speech gain function, which suppresses background noise at each time-frequency bin, is calculated by prior signal-to-noise-ratio (SNR). Hence, accurate prior SNR estimation is paramount for successful noise suppression. Accordingly, we have proposed a single-channel approach to combine conventional and deep learning techniques for speech enhancement and automatic speech recognition (ASR) recently. However, the combination process is at the testing stage, which is time-consuming with a complicated procedure. In this study, the gain function of classic speech enhancement will be utilized to optimize the ideal ratio mask based deep neural network (DNN-IRM) at the training stage, denoted as GF-DNN-IRM. And at the testing stage, the estimated IRM by GF-DNN-IRM model is directly used to generate enhanced speech without involving the conventional speech enhancement process. In addition, DNNs with less parameters in the causal processing mode are also discussed. Experiments of the CHiME-4 challenge task show that our proposed algorithm can achieve a relative word error rate reduction of 6.57% on RealData test set comparing to unprocessed speech without acoustic model retraining in causal mode, while the traditional DNN-IRM method fails to improve ASR performance in this case.

引用

页码：910 / 914

页数：5

共 50 条

[31] Single-channel speech enhancement method based on masking properties and minimum statistics
Jiang Xiaoping
Journal of Systems Engineering and Electronics, 2004, (02) : 217 - 224
[32] Single-Channel Speech Enhancement Based on Sub-Band Spectral Entropy
Wei, Yi
Zeng, Yumin
Li, Chen
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2018, 66 (03): : 100 - 113
[33] DNN BASED MULTIFRAME SINGLE-CHANNEL NOISE REDUCTION FILTERS
Pan, Ningning
Chen, Jingdong
Benesty, Jacob
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8782 - 8786
[34] A HYBRID APPROACH TO COMBINING CONVENTIONAL AND DEEP LEARNING TECHNIQUES FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION
Tu, Yan-Hui
Tashev, Ivan
Zarar, Shuayb
Lee, Chin-Hui
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2531 - 2535
[35] A TIME DOMAIN PROGRESSIVE LEARNING APPROACH WITH SNR CONSTRICTION FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION
Nian, Zhaoxu
Du, Jun
Yeung, Yu Ting
Wang, Renyu
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6277 - 6281
[36] CATALOG-BASED SINGLE-CHANNEL SPEECH-MUSIC SEPARATION FOR AUTOMATIC SPEECH RECOGNITION
Demir, Cemil
Cemgil, A. Taylan
Saraclar, Murat
19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 2133 - 2137
[37] Single-channel Speech Enhancement Using Graph Fourier Transform
Zhang, Chenhui
Pan, Xiang
INTERSPEECH 2022, 2022, : 946 - 950
[38] Hybrid quality measures for single-channel speech enhancement algorithms
Dreiseitel, P
EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS, 2002, 13 (02): : 159 - 165
[39] Single-channel multiple regression for in-car speech enhancement
Li, WF
Itou, K
Takeda, K
Itakura, F
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03) : 1032 - 1039
[40] KNOWLEDGE TRANSFER IN PERMUTATION INVARIANT TRAINING FOR SINGLE-CHANNEL MULTI-TALKER SPEECH RECOGNITION
Tan, Tian
Qian, Yanmin
Yu, Dong
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5714 - 5718

← 1 2 3 4 5 →