DNN-Based Low-Musical-Noise Single-Channel Speech Enhancement Based on Higher-Order-Moments Matching

被引：0

作者：

Mizoguchi, Satoshi ^{[1
]}

Saito, Yuki ^{[1
]}

Takamichi, Shinnosuke ^{[1
]}

Saruwatari, Hiroshi ^{[1
]}

机构：

[1] Univ Tokyo, Tokyo 1138656, Japan

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2021年 / E104D卷 / 11期

关键词：

speech enhancement; musical noise; kurtosis; moment matching; deep learning;

D O I：

10.1587/transinf.2021EDP7041

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose deep neural network (DNN)-based speech enhancement that reduces musical noise and achieves better auditory impressions. The musical noise is an artifact generated by nonlinear signal processing and negatively affects the auditory impressions. We aim to develop musical-noise-free speech enhancement methods that suppress the musical noise generation and produce perceptually-comfortable enhanced speech. DNN-based speech enhancement using a soft mask achieves high noise reduction but generates musical noise in non-speech regions. Therefore, first, we define kurtosis matching for DNN-based low-musical-noise speech enhancement. Kurtosis is the fourth-order moment and is known to correlate with the amount of musical noise. The kurtosis matching is a penalty term of the DNN training and works to reduce the amount of musical noise. We further extend this scheme to standardized-moment matching. The extended scheme involves using moments whose orders are higher than kurtosis and generalizes the conventional musical-noise-free method based on kurtosis matching. We formulate standardized-moment matching and explore how effectively the higher-order moments reduce the amount of musical noise. Experimental evaluation results 1) demonstrate that kurtosis matching can reduce musical noise without negatively affecting noise suppression and 2) newly reveal that the sixth-moment matching also achieves low-musical-noise speech enhancement as well as kurtosis matching.

引用

页码：1971 / 1980

页数：10

共 50 条

[41] Robust Constrained MFMVDR Filters for Single-Channel Speech Enhancement Based on Spherical Uncertainty Set
Fischer, Dorte
Doclo, Simon
IEEE/ACM Transactions on Audio Speech and Language Processing, 2021, 29 : 618 - 631
[42] Comparative Evaluation of Single-Channel MMSE-Based Noise Reduction Schemes for Speech Recognition
Principi, Emanuele
Cifani, Simone
Rotili, Rudy
Squartini, Stefano
Piazza, Francesco
JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2010, 2010
[43] Single Channel Speech Enhancement Algorithm based on BLSTM-DNN Bidirectional Optimized Hybrid Model
Sun, Xiaoyue
Li, Ruwei
Li, Tao
Yang, Dengcai
3RD ANNUAL INTERNATIONAL CONFERENCE ON CLOUD TECHNOLOGY AND COMMUNICATION ENGINEERING, 2020, 719
[44] SINGLE CHANNEL SPEECH ENHANCEMENT TECHNIQUE FOR LOW SNR QUASI-PERIODIC NOISE BASED ON REDUCED ORDER LINEAR PREDICTION1
Reddy, Chandan K. A.
Montazeri, Vahid
Rao, Yu
Panahi, Issa M. S.
2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 712 - 716
[45] Robustness and sensitivity metrics-based tuning of the augmented Kalman filter for single-channel speech enhancement
Roy, Sujan Kumar
Paliwal, Kuldip K.
APPLIED ACOUSTICS, 2022, 185
[46] Kernel Machines Beat Deep Neural Networks on Mask-based Single-channel Speech Enhancement
Hui, Like
Ma, Siyuan
Belkin, Mikhail
INTERSPEECH 2019, 2019, : 2748 - 2752
[47] Single-Channel Speech Enhancement Based on Improved Frame-Iterative Spectral Subtraction in the Modulation Domain
Li, Chao
Jiang, Ting
Wu, Sheng
CHINA COMMUNICATIONS, 2021, 18 (09) : 100 - 115
[48] On DCT-based MMSE estimation of short time spectral amplitude for single-channel speech enhancement
Shi, Sisi
Paliwal, Kuldip
Busch, Andrew
APPLIED ACOUSTICS, 2023, 202
[49] A Speech Distortion Weighted Single-Channel Wiener Filter Based STFT-Domain Noise Reduction
Zhang, Jie
Tao, Rui
Dai, Li-Rong
2023 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP, SSP, 2023, : 527 - 531
[50] Single-Channel Speech Enhancement Based on Improved Frame-Iterative Spectral Subtraction in the Modulation Domain
Chao Li
Ting Jiang
Sheng Wu
ChinaCommunications, 2021, 18 (09) : 100 - 115

← 1 2 3 4 5 →