SNR-BASED TEACHERS-STUDENT TECHNIQUE FOR SPEECH ENHANCEMENT

被引:7
|
作者
Hao, Xiang [1 ]
Su, Xiangdong [1 ]
Wang, Zhiyu [1 ]
Zhang, Qiang [1 ]
Xu, Huali [1 ]
Gao, Guanglai [1 ]
机构
[1] Inner Mongolia Univ, Inner Mongolia Key Lab Mongolian Informat Proc Te, Coll Comp Sci, Hohhot, Peoples R China
基金
中国国家自然科学基金;
关键词
speech enhancement; knowledge distillation; teacher-student technique; low SNR; NEURAL-NETWORK;
D O I
10.1109/icme46284.2020.9102846
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
It is very challenging for speech enhancement methods to achieves robust performance under both high signal-to-noise ratio (SNR) and low SNR simultaneously. In this paper, we propose a method that integrates an SNR-based teachers-student technique and time-domain U-Net to deal with this problem. Specifically, this method consists of multiple teacher models and a student model. We first train the teacher models under multiple small-range SNRs that do not coincide with each other so that they can perform speech enhancement well within the specific SNR range. Then, we choose different teacher models to supervise the training of the student model according to the SNR of the training data. Eventually, the student model can perform speech enhancement under both high SNR and low SNR. To evaluate the proposed method, we constructed a dataset with an SNR ranging from -20dB to 20dB based on the public dataset. We experimentally analyzed the effectiveness of the SNR-based teachers-student technique and compared the proposed method with several state-of-the-art methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement
    Gao, Tian
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3713 - 3717
  • [2] SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech Enhancement
    Rehr, Robert
    Gerkmann, Timo
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2021, 29 : 1937 - 1949
  • [3] SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech Enhancement
    Rehr, Robert
    Gerkmann, Timo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1937 - 1949
  • [4] Low distortion SNR-based speech enhancement employing critical band filter banks
    Westerlund, N
    de Haan, JM
    Dahl, M
    Claesson, I
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 129 - 133
  • [5] EFFICIENT SNR-BASED SUBBAND POST-PROCESSING FOR RESIDUAL NOISE REDUCTION IN SPEECH ENHANCEMENT ALGORITHMS
    Mustiere, Frederic
    Bouchard, Martin
    Bolic, Miodrag
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1558 - 1561
  • [6] Adaptive SNR-based carrier phase multipath mitigation technique
    Comp, CJ
    Axelrad, P
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1998, 34 (01) : 264 - 276
  • [7] A series of SNR-based speech intelligibility models in the Auditory Modeling Toolbox
    Lavandier, Mathieu
    Vicente, Thibault
    Prud'homme, Luna
    ACTA ACUSTICA, 2022, 6
  • [8] SNR-based queue observations at CFHT
    Devost, Daniel
    Moutou, Claire
    Manset, Nadine
    Mahoney, Billy
    Burdullis, Todd
    Cuillandre, Jean-Charles
    Racine, Rene
    OBSERVATORY OPERATIONS: STRATEGIES, PROCESSES, AND SYSTEMS VI, 2016, 9910
  • [9] SNR-based Link Quality Estimation
    Tan, Wee Lum
    Hu, Peizhao
    Portmann, Marius
    2012 IEEE 75TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2012,
  • [10] A new a priori SNR estimator based on multiple linear regression technique for speech enhancement
    Lee, Soojeong
    Lim, Chungsoo
    Chang, Joon-Hyuk
    DIGITAL SIGNAL PROCESSING, 2014, 30 : 154 - 164