SNR-BASED TEACHERS-STUDENT TECHNIQUE FOR SPEECH ENHANCEMENT

被引：7

作者：

Hao, Xiang ^{[1
]}

Su, Xiangdong ^{[1
]}

Wang, Zhiyu ^{[1
]}

Zhang, Qiang ^{[1
]}

Xu, Huali ^{[1
]}

Gao, Guanglai ^{[1
]}

机构：

[1] Inner Mongolia Univ, Inner Mongolia Key Lab Mongolian Informat Proc Te, Coll Comp Sci, Hohhot, Peoples R China

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2020年

基金：

中国国家自然科学基金;

关键词：

speech enhancement; knowledge distillation; teacher-student technique; low SNR; NEURAL-NETWORK;

D O I：

10.1109/icme46284.2020.9102846

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

It is very challenging for speech enhancement methods to achieves robust performance under both high signal-to-noise ratio (SNR) and low SNR simultaneously. In this paper, we propose a method that integrates an SNR-based teachers-student technique and time-domain U-Net to deal with this problem. Specifically, this method consists of multiple teacher models and a student model. We first train the teacher models under multiple small-range SNRs that do not coincide with each other so that they can perform speech enhancement well within the specific SNR range. Then, we choose different teacher models to supervise the training of the student model according to the SNR of the training data. Eventually, the student model can perform speech enhancement under both high SNR and low SNR. To evaluate the proposed method, we constructed a dataset with an SNR ranging from -20dB to 20dB based on the public dataset. We experimentally analyzed the effectiveness of the SNR-based teachers-student technique and compared the proposed method with several state-of-the-art methods.

引用

页数：6

共 50 条

[1] SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement
Gao, Tian
Du, Jun
Dai, Li-Rong
Lee, Chin-Hui
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3713 - 3717
[2] SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech Enhancement
Rehr, Robert
Gerkmann, Timo
IEEE/ACM Transactions on Audio Speech and Language Processing, 2021, 29 : 1937 - 1949
[3] SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech Enhancement
Rehr, Robert
Gerkmann, Timo
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1937 - 1949
[4] Low distortion SNR-based speech enhancement employing critical band filter banks
Westerlund, N
de Haan, JM
Dahl, M
Claesson, I
ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 129 - 133
[5] EFFICIENT SNR-BASED SUBBAND POST-PROCESSING FOR RESIDUAL NOISE REDUCTION IN SPEECH ENHANCEMENT ALGORITHMS
Mustiere, Frederic
Bouchard, Martin
Bolic, Miodrag
18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1558 - 1561
[6] Adaptive SNR-based carrier phase multipath mitigation technique
Comp, CJ
Axelrad, P
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1998, 34 (01) : 264 - 276
[7] A series of SNR-based speech intelligibility models in the Auditory Modeling Toolbox
Lavandier, Mathieu
Vicente, Thibault
Prud'homme, Luna
ACTA ACUSTICA, 2022, 6
[8] SNR-based queue observations at CFHT
Devost, Daniel
Moutou, Claire
Manset, Nadine
Mahoney, Billy
Burdullis, Todd
Cuillandre, Jean-Charles
Racine, Rene
OBSERVATORY OPERATIONS: STRATEGIES, PROCESSES, AND SYSTEMS VI, 2016, 9910
[9] SNR-based Link Quality Estimation
Tan, Wee Lum
Hu, Peizhao
Portmann, Marius
2012 IEEE 75TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2012,
[10] A new a priori SNR estimator based on multiple linear regression technique for speech enhancement
Lee, Soojeong
Lim, Chungsoo
Chang, Joon-Hyuk
DIGITAL SIGNAL PROCESSING, 2014, 30 : 154 - 164

← 1 2 3 4 5 →