Noise Suppression based on nonnegative matrix factorization for robust speech recognition

被引:0
|
作者
Fan, Hao-teng [1 ]
Lin, Pao-han [1 ]
Hung, Jeih-weih [1 ]
机构
[1] Natl Chi Nan Univ, Dept Elect Engn, Puli, Taiwan
关键词
nonnegative matrix factorization; noise suppression; speech recognition; noise-robustness;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel noise robustness method, nonnegative matrix factorization-based noise suppression (NNS), to enhance the magnitude spectrum of speech signals for better speech recognition performance in noise-corrupted environments. In the presented approach, the clean data and noise in the training set are firstly converted to the spectrograms via short-time Fourier transform (STFT), and the basis spectral matrices of the speech data and noise are learned from the corresponding spectrograms accordingly. Then, the magnitude spectrogram of the noise-corrupted testing data is factorized via the basis matrices of the clean data, and the resulting noise components are alleviated from the original magnitude spectrogram. Finally, the new noisereduced magnitude spectrogram is integrated with the original noisy phase spectrogram and then converted back to a time-domain signal, which is subsequently converted to a sequence of MFCC speech features. By using the presented NNS as a pre-processing stage of the speech recognition system, the obtained recognition accuracy can outperform the MFCC baseline especially at median and low SNR cases. Furthermore, performing NNS on the different sub-band spectrograms can further improve the recognition results relative to the original NNS performing on the full-band spectrogram, indicating that sub-band NNS can produce more robust speech features suitable for noisy speech recognition.
引用
收藏
页码:1731 / +
页数:2
相关论文
共 50 条
  • [21] Noise Robust Acoustic Anomaly Detection System with Nonnegative Matrix Factorization Based on Generalized Gaussian Distribution
    Aiba, Akihito
    Yoshida, Minoru
    Kitamura, Daichi
    Takamichi, Shinnosuke
    Saruwatari, Hiroshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (03) : 441 - 449
  • [22] Robust Graph Regularized Nonnegative Matrix Factorization
    Huang, Qi
    Zhang, Guodao
    Yin, Xuesong
    Wang, Yigang
    IEEE ACCESS, 2022, 10 : 86962 - 86978
  • [23] Motor data-regularized nonnegative matrix factorization for ego-noise suppression
    Schmidt, Alexander
    Brendel, Andreas
    Haubner, Thomas
    Kellermann, Walter
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2020, 2020 (01)
  • [24] Motor data-regularized nonnegative matrix factorization for ego-noise suppression
    Alexander Schmidt
    Andreas Brendel
    Thomas Haubner
    Walter Kellermann
    EURASIP Journal on Audio, Speech, and Music Processing, 2020
  • [25] Deep Q-network-based noise suppression for robust speech recognition
    Park, Tae-Jun
    Chang, Joon-Hyuk
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2021, 29 (05) : 2362 - 2373
  • [26] Noise suppression based on auditory-like filters for robust speech recognition
    Zhao, JH
    Xie, X
    Kuang, JM
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 560 - 563
  • [27] Deep Q-network-based noise suppression for robust speech recognition
    Park T.-J.
    Chang J.-H.
    Turkish Journal of Electrical Engineering and Computer Sciences, 2021, 25 (09) : 2362 - 2373
  • [28] Incremental Nonnegative Matrix Factorization for Face Recognition
    Chen, Wen-Sheng
    Pan, Binbin
    Fang, Bin
    Li, Ming
    Tang, Jianliang
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2008, 2008
  • [29] Robust Nonnegative Matrix Factorization Based on Cosine Similarity Induced Metric
    Chen, Wen-Sheng
    Chen, Haitao
    Pan, Binbin
    Chen, Bo
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING, PT II, 2019, 11936 : 278 - 288
  • [30] Robust sparse nonnegative matrix factorization based on maximum correntropy criterion
    Peng, Siyuan
    Ser, Wee
    Lin, Zhiping
    Chen, Badong
    2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,