DNN-Based Low-Musical-Noise Single-Channel Speech Enhancement Based on Higher-Order-Moments Matching

被引:0
|
作者
Mizoguchi, Satoshi [1 ]
Saito, Yuki [1 ]
Takamichi, Shinnosuke [1 ]
Saruwatari, Hiroshi [1 ]
机构
[1] Univ Tokyo, Tokyo 1138656, Japan
关键词
speech enhancement; musical noise; kurtosis; moment matching; deep learning;
D O I
10.1587/transinf.2021EDP7041
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose deep neural network (DNN)-based speech enhancement that reduces musical noise and achieves better auditory impressions. The musical noise is an artifact generated by nonlinear signal processing and negatively affects the auditory impressions. We aim to develop musical-noise-free speech enhancement methods that suppress the musical noise generation and produce perceptually-comfortable enhanced speech. DNN-based speech enhancement using a soft mask achieves high noise reduction but generates musical noise in non-speech regions. Therefore, first, we define kurtosis matching for DNN-based low-musical-noise speech enhancement. Kurtosis is the fourth-order moment and is known to correlate with the amount of musical noise. The kurtosis matching is a penalty term of the DNN training and works to reduce the amount of musical noise. We further extend this scheme to standardized-moment matching. The extended scheme involves using moments whose orders are higher than kurtosis and generalizes the conventional musical-noise-free method based on kurtosis matching. We formulate standardized-moment matching and explore how effectively the higher-order moments reduce the amount of musical noise. Experimental evaluation results 1) demonstrate that kurtosis matching can reduce musical noise without negatively affecting noise suppression and 2) newly reveal that the sixth-moment matching also achieves low-musical-noise speech enhancement as well as kurtosis matching.
引用
收藏
页码:1971 / 1980
页数:10
相关论文
共 50 条
  • [41] Robust Constrained MFMVDR Filters for Single-Channel Speech Enhancement Based on Spherical Uncertainty Set
    Fischer, Dorte
    Doclo, Simon
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2021, 29 : 618 - 631
  • [42] Comparative Evaluation of Single-Channel MMSE-Based Noise Reduction Schemes for Speech Recognition
    Principi, Emanuele
    Cifani, Simone
    Rotili, Rudy
    Squartini, Stefano
    Piazza, Francesco
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2010, 2010
  • [43] Single Channel Speech Enhancement Algorithm based on BLSTM-DNN Bidirectional Optimized Hybrid Model
    Sun, Xiaoyue
    Li, Ruwei
    Li, Tao
    Yang, Dengcai
    3RD ANNUAL INTERNATIONAL CONFERENCE ON CLOUD TECHNOLOGY AND COMMUNICATION ENGINEERING, 2020, 719
  • [44] SINGLE CHANNEL SPEECH ENHANCEMENT TECHNIQUE FOR LOW SNR QUASI-PERIODIC NOISE BASED ON REDUCED ORDER LINEAR PREDICTION1
    Reddy, Chandan K. A.
    Montazeri, Vahid
    Rao, Yu
    Panahi, Issa M. S.
    2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 712 - 716
  • [45] Robustness and sensitivity metrics-based tuning of the augmented Kalman filter for single-channel speech enhancement
    Roy, Sujan Kumar
    Paliwal, Kuldip K.
    APPLIED ACOUSTICS, 2022, 185
  • [46] Kernel Machines Beat Deep Neural Networks on Mask-based Single-channel Speech Enhancement
    Hui, Like
    Ma, Siyuan
    Belkin, Mikhail
    INTERSPEECH 2019, 2019, : 2748 - 2752
  • [47] Single-Channel Speech Enhancement Based on Improved Frame-Iterative Spectral Subtraction in the Modulation Domain
    Li, Chao
    Jiang, Ting
    Wu, Sheng
    CHINA COMMUNICATIONS, 2021, 18 (09) : 100 - 115
  • [48] On DCT-based MMSE estimation of short time spectral amplitude for single-channel speech enhancement
    Shi, Sisi
    Paliwal, Kuldip
    Busch, Andrew
    APPLIED ACOUSTICS, 2023, 202
  • [49] A Speech Distortion Weighted Single-Channel Wiener Filter Based STFT-Domain Noise Reduction
    Zhang, Jie
    Tao, Rui
    Dai, Li-Rong
    2023 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP, SSP, 2023, : 527 - 531
  • [50] Single-Channel Speech Enhancement Based on Improved Frame-Iterative Spectral Subtraction in the Modulation Domain
    Chao Li
    Ting Jiang
    Sheng Wu
    ChinaCommunications, 2021, 18 (09) : 100 - 115