Speech Enhancement with Nonstationary Acoustic Noise Detection in Time Domain

被引：23

作者：

Tavares, R. ^{[1
]}

Coelho, R. ^{[1
]}

机构：

[1] Mil Inst Engn IME, Lab Acoust Signal Proc, Rio De Janeiro, RJ, Brazil

来源：

IEEE SIGNAL PROCESSING LETTERS | 2016年 / 23卷 / 01期

关键词：

Index of nonstationarity; robust estimation; speech enhancement; SPECTRUM; ENVIRONMENTS; RECOGNITION;

D O I：

10.1109/LSP.2015.2495102

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This letter proposes a new time domain speech enhancement technique for signals corrupted by nonstationary acoustic noises. In this method, the noise components are detected and attenuated directly from the corrupted speech samples. They are obtained with a robust estimation of the noise standard deviation considering any speech and noise amplitude distribution. These values are used to define a noise selection threshold. Additionally, this solution does not require the usage of any spectral analysis or temporal decomposition as a pre-processing phase. The experiments results show that the proposed scheme leads to significant improvement in the speech quality and intelligibility when compared to competing enhancement approaches.

引用

页码：6 / 10

页数：5

共 50 条

[41] Residual noise compensation for robust speech recognition in nonstationary noise
Yao, KS
Shi, BE
Fung, P
Cao, ZG
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1125 - 1128
[42] Noise-Tolerant Time-Domain Speech Separation with Noise Bases
Ozamoto, Kohei
Uto, Kuniaki
Iwano, Koji
Shinoda, Koichi
2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 624 - 629
[43] Two-Stage Learning and Fusion Network With Noise Aware for Time-Domain Monaural Speech Enhancement
Xiang, Xiaoxiao
Zhang, Xiaojuan
Chen, Haozhe
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1754 - 1758
[44] Time-Reversal Enhancement Network With Cross-Domain Information for Noise-Robust Speech Recognition
Chao, Fu-An
Hung, Jeih-Weih
Sheu, Tommy
Chen, Berlin
IEEE MULTIMEDIA, 2022, 29 (01) : 114 - 124
[45] IMPROVING NOISE ROBUST AUTOMATIC SPEECH RECOGNITION WITH SINGLE-CHANNEL TIME-DOMAIN ENHANCEMENT NETWORK
Kinoshita, Keisuke
Ochiai, Tsubasa
Delcroix, Marc
Nakatani, Tomohiro
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7009 - 7013
[46] UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-noise Ratio Condition
Hao, Xiang
Su, Xiangdong
Wang, Zhiyu
Zhang, Hui
Batushiren
INTERSPEECH 2019, 2019, : 1786 - 1790
[47] Speech preprocessing and enhancement based on joint time domain and time-frequency domain analysis
Zhang, Wenbo
Xie, Xuefeng
Du, Yanling
Huang, Dongmei
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 155 (06): : 3580 - 3588
[48] A Soft Decision-based Speech Enhancement using Acoustic Noise Classification
Choi, Jae-Hun
Kim, Sang-Kyun
Chang, Joon-Hyuk
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1200 - 1203
[49] ACOUSTIC NOISE-ANALYSIS AND SPEECH ENHANCEMENT TECHNIQUES FOR MOBILE RADIO APPLICATIONS
DALDEGAN, N
PRATI, C
SIGNAL PROCESSING, 1988, 15 (01) : 43 - 56
[50] DOUBLE PSEUDO AFFINE PROJECTION ALGORITHM FOR SPEECH ENHANCEMENT AND ACOUSTIC NOISE REDUCTION
Djendi, Mohamed
Scalart, Pascal
2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2080 - 2084

← 1 2 3 4 5 →