Blind source separation with optimal transport non-negative matrix factorization

被引:9
|
作者
Rolet, Antoine [1 ]
Seguy, Vivien [1 ]
Blondel, Mathieu [2 ]
Sawada, Hiroshi [2 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Yoshida Honmachi, Kyoto, Japan
[2] NTT Commun Sci Labs, Kyoto, Japan
关键词
NMF; Speech; BSS; Optimal transport; ALGORITHMS;
D O I
10.1186/s13634-018-0576-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Optimal transport as a loss for machine learning optimization problems has recently gained a lot of attention. Building upon recent advances in computational optimal transport, we develop an optimal transport non-negative matrix factorization (NMF) algorithm for supervised speech blind source separation (BSS). Optimal transport allows us to design and leverage a cost between short-time Fourier transform (SIFT) spectrogram frequencies, which takes into account how humans perceive sound. We give empirical evidence that using our proposed optimal transport, NMF leads to perceptually better results than NMF with other losses, for both isolated voice reconstruction and speech denoising using BSS. Finally, we demonstrate how to use optimal transport for cross-domain sound processing tasks, where frequencies represented in the input spectrograms may be different from one spectrogram to another.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Robust Non-negative Matrix Factorization with β-Divergence for Speech Separation
    Li, Yinan
    Zhang, Xiongwei
    Sun, Meng
    ETRI JOURNAL, 2017, 39 (01) : 21 - 29
  • [32] Blind primary colorant spectral separation combining ICA and POCS non-negative matrix factorization
    Kuo, CH
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 489 - 492
  • [33] Separation of reflection components by sparse non-negative matrix factorization
    Akashi, Yasushi
    Okatani, Takayuki
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 146 : 77 - 85
  • [34] SHIFTED AND CONVOLUTIVE SOURCE-FILTER NON-NEGATIVE MATRIX FACTORIZATION FOR MONAURAL AUDIO SOURCE SEPARATION
    Nakamura, Tomohiko
    Kameoka, Hirokazu
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 489 - 493
  • [35] Kernel Non-Negative Matrix Factorization for Seismic Signature Separation
    Mehmood, Asif
    Damarla, Thyagaraju
    JOURNAL OF PATTERN RECOGNITION RESEARCH, 2013, 8 (01): : 13 - 24
  • [36] Optimal Recovery of Missing Values for Non-Negative Matrix Factorization
    Dean, Rebecca Chen
    Varshney, Lav R.
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2021, 2 : 207 - 216
  • [37] Optimal Bayesian clustering using non-negative matrix factorization
    Wang, Ketong
    Porter, Michael D.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 128 : 395 - 411
  • [38] An optimal method for the initialization of non-negative matrix factorization (NMF)
    College of Information and Science Technology, Jinan University, Guangzhou, China
    J. Inf. Comput. Sci., 5 (1765-1778):
  • [39] Spatial Sparsity-based Blind Source Separation Method including Non-negative Matrix Factorization for Multispectral Image Unmixing
    Karoui, Moussa Sofiane
    Deville, Yannick
    Hosseini, Shahram
    Ouamri, Abdelaziz
    2011 10TH INTERNATIONAL WORKSHOP ON ELECTRONICS, CONTROL, MEASUREMENT AND SIGNALS (ECMS), 2011, : 14 - 19
  • [40] Non-negative Matrix Factorization using Stable Alternating Direction Method of Multipliers for Source Separation
    Zhang, Shaofei
    Huang, Dongyan
    Xie, Lei
    Chng, Eng Siong
    Li, Haizhou
    Dong, Minghui
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 222 - 228