共 22 条
- [1] Huang P S, Kim M, Hasegawa-Johnson M, Smaragdis P., Deep learning for monaural speech separation, Proceedings of the 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 1562-1566, (2014)
- [2] Huang P S, Kim M, Hasegawa-Johnson M, Smaragdis P., Joint optimization of masks and deep recurrent neural networks for monaural source separation, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23, 12, pp. 2136-2147, (2015)
- [3] Liu Wen-Ju, Nie Shuai, Liang Shan, Zhang Xue-Liang, Deep learning based speech separation technology and its developments, Acta Automatica Sinica, 42, 6, pp. 819-833, (2016)
- [4] Lee D D, Seung H S., Learning the parts of objects by non-negative matrix factorization, Nature, 401, 6755, pp. 788-791, (1999)
- [5] Wang D L, Brown G J., Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, (2006)
- [6] Han Wei, Zhang Xiong-Wei, Min Gang, Zhang Qi-Ye, A single-channel speech enhancement approach based on perceptual masking deep neural network, Acta Automatica Sinica, 43, 2, pp. 248-258, (2017)
- [7] Yuan Wen-Hao, Sun Wen-Zhu, Xia Bin, Ou Shi-Feng, Improving speech enhancement in unseen noise using deep convolutional neural network, Acta Automatica Sinica, 44, 4, pp. 751-759, (2018)
- [8] Smaragdis P., Convolutive speech bases and their application to supervised speech separation, IEEE Transactions on Audio, Speech, and Language Processing, 15, 1, pp. 1-12, (2007)
- [9] O'Grady P D, Pearlmutter B A., Discovering speech phones using convolutive non-negative matrix factorisation with a sparseness constraint, Neurocomputing, 72, 1-3, pp. 88-101, (2008)
- [10] Sun M, Li Y N, Gemmeke J F, Zhang X W., Speech enhancement under low SNR conditions via noise estimation using sparse and low-rank NMF with Kullback--Leibler divergence, IEEE Transactions on Audio, Speech, and Language Processing, 23, 7, pp. 1233-1242, (2015)