Blind source separation with optimal transport non-negative matrix factorization

被引:9
|
作者
Rolet, Antoine [1 ]
Seguy, Vivien [1 ]
Blondel, Mathieu [2 ]
Sawada, Hiroshi [2 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Yoshida Honmachi, Kyoto, Japan
[2] NTT Commun Sci Labs, Kyoto, Japan
关键词
NMF; Speech; BSS; Optimal transport; ALGORITHMS;
D O I
10.1186/s13634-018-0576-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Optimal transport as a loss for machine learning optimization problems has recently gained a lot of attention. Building upon recent advances in computational optimal transport, we develop an optimal transport non-negative matrix factorization (NMF) algorithm for supervised speech blind source separation (BSS). Optimal transport allows us to design and leverage a cost between short-time Fourier transform (SIFT) spectrogram frequencies, which takes into account how humans perceive sound. We give empirical evidence that using our proposed optimal transport, NMF leads to perceptually better results than NMF with other losses, for both isolated voice reconstruction and speech denoising using BSS. Finally, we demonstrate how to use optimal transport for cross-domain sound processing tasks, where frequencies represented in the input spectrograms may be different from one spectrogram to another.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Adaptive Group Sparsity for Non-negative Matrix Factorization with Application to Unsupervised Source Separation
    Li, Xu
    Wang, Ziteng
    Wang, Xiaofei
    Ful, Qiang
    Yan, Yonghong
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3349 - 3353
  • [22] Adaptive Sparsity Non-Negative Matrix Factorization for Single-Channel Source Separation
    Gao, Bin
    Woo, W. L.
    Dlay, S. S.
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (05) : 989 - 1001
  • [23] Blind source separation of molecular components of the human skin in vivo: non-negative matrix factorization of Raman microspectroscopy data
    Yakimov, B. P.
    Venets, A. V.
    Schleusener, J.
    Fadeev, V. V.
    Lademann, J.
    Shirshin, E. A.
    Darvin, M. E.
    ANALYST, 2021, 146 (10) : 3185 - 3196
  • [24] Non-negative matrix factorisation for blind source separation in wavelet transform domain
    Hattay, Jamel
    Belaid, Samir
    Naanaa, Wady
    IET SIGNAL PROCESSING, 2015, 9 (02) : 111 - 119
  • [25] Perceptual Single-Channel Audio Source Separation by Non-negative Matrix Factorization
    Kirbiz, Serap
    Gunsel, Bilge
    2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 654 - 657
  • [26] Sound Source Separation Based on Multichannel Non-negative Matrix Factorization with Weighted Averaging
    Yamamoto, Tsuyoshi
    Uenohara, Shingo
    Nishijima, Keisuke
    Furuya, Ken'ichi
    COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, 2021, 1194 : 177 - 187
  • [27] DYNAMIC GROUP SPARSITY FOR NON-NEGATIVE MATRIX FACTORIZATION WITH APPLICATION TO UNSUPERVISED SOURCE SEPARATION
    Li, Xu
    Wang, Xiaofei
    Fu, Qiang
    Yan, Yonghong
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [28] BLIND SOURCE SEPARATION IN DYNAMIC CELL IMAGING USING NON-NEGATIVE MATRIX FACTORIZATION APPLIED TO BREAST CANCER BIOPSIES
    Mandache, D.
    la Guillaume, E. Benoit a
    Olivo-Marin, J-C
    Meas-Yedid, V
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 1605 - 1608
  • [29] Non-negative matrix factorization approach to blind image deconvolution
    Kopriva, I
    Nuzillard, D
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, PROCEEDINGS, 2006, 3889 : 966 - 973
  • [30] Separation of Reflection Components by Sparse Non-negative Matrix Factorization
    Akashi, Yasuhiro
    Okatani, Takayuki
    COMPUTER VISION - ACCV 2014, PT V, 2015, 9007 : 611 - 625