Noise Suppression based on nonnegative matrix factorization for robust speech recognition

被引:0
|
作者
Fan, Hao-teng [1 ]
Lin, Pao-han [1 ]
Hung, Jeih-weih [1 ]
机构
[1] Natl Chi Nan Univ, Dept Elect Engn, Puli, Taiwan
关键词
nonnegative matrix factorization; noise suppression; speech recognition; noise-robustness;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel noise robustness method, nonnegative matrix factorization-based noise suppression (NNS), to enhance the magnitude spectrum of speech signals for better speech recognition performance in noise-corrupted environments. In the presented approach, the clean data and noise in the training set are firstly converted to the spectrograms via short-time Fourier transform (STFT), and the basis spectral matrices of the speech data and noise are learned from the corresponding spectrograms accordingly. Then, the magnitude spectrogram of the noise-corrupted testing data is factorized via the basis matrices of the clean data, and the resulting noise components are alleviated from the original magnitude spectrogram. Finally, the new noisereduced magnitude spectrogram is integrated with the original noisy phase spectrogram and then converted back to a time-domain signal, which is subsequently converted to a sequence of MFCC speech features. By using the presented NNS as a pre-processing stage of the speech recognition system, the obtained recognition accuracy can outperform the MFCC baseline especially at median and low SNR cases. Furthermore, performing NNS on the different sub-band spectrograms can further improve the recognition results relative to the original NNS performing on the full-band spectrogram, indicating that sub-band NNS can produce more robust speech features suitable for noisy speech recognition.
引用
收藏
页码:1731 / +
页数:2
相关论文
共 50 条
  • [41] CONSTRAINED NONNEGATIVE MATRIX FACTORIZATION FOR ROBUST HYPERSPECTRAL UNMIXING
    Feng, Fan
    Deng, Chenwei
    Wang, Wenzheng
    Dai, Jiahui
    Li, Zhenzhen
    Zhao, Baojun
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 4221 - 4224
  • [42] Robust Structured Nonnegative Matrix Factorization for Image Representation
    Li, Zechao
    Tang, Jinhui
    He, Xiaofei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (05) : 1947 - 1960
  • [43] Nonlinear Hyperspectral Unmixing With Robust Nonnegative Matrix Factorization
    Fevotte, Cedric
    Dobigeon, Nicolas
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) : 4810 - 4819
  • [44] Robust Collaborative Nonnegative Matrix Factorization for Hyperspectral Unmixing
    Li, Jun
    Bioucas-Dias, Jose M.
    Plaza, Antonio
    Liu, Lin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (10): : 6076 - 6090
  • [45] Robust Graph Regularized Nonnegative Matrix Factorization for Clustering
    Peng, Chong
    Kang, Zhao
    Hu, Yunhong
    Cheng, Jie
    Cheng, Qiang
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2017, 11 (03)
  • [46] ROBUST NONNEGATIVE MATRIX FACTORIZATION WITH DISCRIMINABILITY FOR IMAGE REPRESENTATION
    Guo, Yuchen
    Ding, Guiguang
    Zhou, Jile
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
  • [47] Online Nonnegative Matrix Factorization With Robust Stochastic Approximation
    Guan, Naiyang
    Tao, Dacheng
    Luo, Zhigang
    Yuan, Bo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (07) : 1087 - 1099
  • [48] Robust graph regularized nonnegative matrix factorization for clustering
    Shudong Huang
    Hongjun Wang
    Tao Li
    Tianrui Li
    Zenglin Xu
    Data Mining and Knowledge Discovery, 2018, 32 : 483 - 503
  • [49] Robust Nonnegative Matrix Factorization with Ordered Structure Constraints
    Wang, Jing
    Tian, Feng
    Liu, Chang Hong
    Yu, Hongchuan
    Wang, Xiao
    Tang, Xianchao
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 478 - 485
  • [50] Noise suppression based on neurophysiologically-motivated SNR estimation for robust speech recognition
    Tchorz, J
    Kleinschmidt, M
    Kollmeier, B
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 821 - 827