Speech Enhancement with Nonstationary Acoustic Noise Detection in Time Domain

被引:23
|
作者
Tavares, R. [1 ]
Coelho, R. [1 ]
机构
[1] Mil Inst Engn IME, Lab Acoust Signal Proc, Rio De Janeiro, RJ, Brazil
关键词
Index of nonstationarity; robust estimation; speech enhancement; SPECTRUM; ENVIRONMENTS; RECOGNITION;
D O I
10.1109/LSP.2015.2495102
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This letter proposes a new time domain speech enhancement technique for signals corrupted by nonstationary acoustic noises. In this method, the noise components are detected and attenuated directly from the corrupted speech samples. They are obtained with a robust estimation of the noise standard deviation considering any speech and noise amplitude distribution. These values are used to define a noise selection threshold. Additionally, this solution does not require the usage of any spectral analysis or temporal decomposition as a pre-processing phase. The experiments results show that the proposed scheme leads to significant improvement in the speech quality and intelligibility when compared to competing enhancement approaches.
引用
收藏
页码:6 / 10
页数:5
相关论文
共 50 条
  • [41] Residual noise compensation for robust speech recognition in nonstationary noise
    Yao, KS
    Shi, BE
    Fung, P
    Cao, ZG
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1125 - 1128
  • [42] Noise-Tolerant Time-Domain Speech Separation with Noise Bases
    Ozamoto, Kohei
    Uto, Kuniaki
    Iwano, Koji
    Shinoda, Koichi
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 624 - 629
  • [43] Two-Stage Learning and Fusion Network With Noise Aware for Time-Domain Monaural Speech Enhancement
    Xiang, Xiaoxiao
    Zhang, Xiaojuan
    Chen, Haozhe
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1754 - 1758
  • [44] Time-Reversal Enhancement Network With Cross-Domain Information for Noise-Robust Speech Recognition
    Chao, Fu-An
    Hung, Jeih-Weih
    Sheu, Tommy
    Chen, Berlin
    IEEE MULTIMEDIA, 2022, 29 (01) : 114 - 124
  • [45] IMPROVING NOISE ROBUST AUTOMATIC SPEECH RECOGNITION WITH SINGLE-CHANNEL TIME-DOMAIN ENHANCEMENT NETWORK
    Kinoshita, Keisuke
    Ochiai, Tsubasa
    Delcroix, Marc
    Nakatani, Tomohiro
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7009 - 7013
  • [46] UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-noise Ratio Condition
    Hao, Xiang
    Su, Xiangdong
    Wang, Zhiyu
    Zhang, Hui
    Batushiren
    INTERSPEECH 2019, 2019, : 1786 - 1790
  • [47] Speech preprocessing and enhancement based on joint time domain and time-frequency domain analysis
    Zhang, Wenbo
    Xie, Xuefeng
    Du, Yanling
    Huang, Dongmei
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 155 (06): : 3580 - 3588
  • [48] A Soft Decision-based Speech Enhancement using Acoustic Noise Classification
    Choi, Jae-Hun
    Kim, Sang-Kyun
    Chang, Joon-Hyuk
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1200 - 1203
  • [49] ACOUSTIC NOISE-ANALYSIS AND SPEECH ENHANCEMENT TECHNIQUES FOR MOBILE RADIO APPLICATIONS
    DALDEGAN, N
    PRATI, C
    SIGNAL PROCESSING, 1988, 15 (01) : 43 - 56
  • [50] DOUBLE PSEUDO AFFINE PROJECTION ALGORITHM FOR SPEECH ENHANCEMENT AND ACOUSTIC NOISE REDUCTION
    Djendi, Mohamed
    Scalart, Pascal
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2080 - 2084