SPARSENESS-BASED MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR BLIND SOURCE SEPARATION

被引:0
|
作者
Higuchi, Takuya [1 ]
Yoshioka, Takuya [1 ]
Nakatani, Tomohiro [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Tokyo, Japan
关键词
audio source separation; sparseness; nonnegative matrix factorization; MIXTURES;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper deals with the problem of audio source separation using multichannel observation. Utilizing the sparseness of sound signals in the time-frequency domain is a successful approach to source separation that enables us to perform separation based on spatial features obtained from a microphone array. On the other hand, nonnegative matrix factorization (NMF) is also a promising approach for audio source separation, which performs separation based on spectral features. This paper incorporates the idea of NMF into sparseness-based source separation and proposes a novel approach to multichannel source separation based on both spatial and spectral features. Experimental results reveal that our proposed method improves the signal-to-distortion ratio (SDR) by 0.26 dB and the signal-to-interference ratio (SIR) by 1.96 dB compared with a conventional sparseness-based approach. In addition, our proposed model eliminates the need for a number of matrix inversions thanks to the sparseness assumption, and thereby requires a much lower computational cost than a previously-proposed multichannel NMF approach, which also utilizes spectral and spatial features.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Beamspace-Domain Multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Lee, Seokjin
    Park, Sang Ha
    Sung, Koeng-Mo
    IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (01) : 43 - 46
  • [22] Multichannel Blind Sound Source Separation Using Spatial Covariance Model With Level and Time Differences and Nonnegative Matrix Factorization
    Carabias-Orti, Julio Jose
    Nikunen, Joonas
    Virtanen, Tuomas
    Vera-Candeas, Pedro
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1512 - 1527
  • [23] Ray-Space constrained multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Munoz-Montoro, Antonio J.
    Olivieri, Marco
    Pezzoli, Mirco
    Carabias-Orti, Julio
    Antonacci, Fabio
    Sarti, Augusto
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 396 - 400
  • [24] Joint Nonnegative Matrix Factorization for Underdetermined Blind Source Separation in Nonlinear Mixtures
    Kopriva, Ivica
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 107 - 115
  • [25] Underdetermined Blind Source Separation Combining Tensor Decomposition and Nonnegative Matrix Factorization
    Xie, Yuan
    Xie, Kan
    Yang, Junjie
    Xie, Shengli
    SYMMETRY-BASEL, 2018, 10 (10):
  • [26] A STRUCTURED NONNEGATIVE MATRIX FACTORIZATION FOR SOURCE SEPARATION
    Laroche, Clement
    Kowalski, Matthieu
    Papadopoulos, Helene
    Richard, Gael
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2033 - 2037
  • [27] Orthogonal Nonnegative Matrix Factorization for Blind Image Separation
    Mirzal, Andri
    ADVANCES IN VISUAL INFORMATICS, 2013, 8237 : 25 - 35
  • [28] Determined Blind Source Separation Unifying Independent Vector Analysis and Nonnegative Matrix Factorization
    Kitamura, Daichi
    Ono, Nobutaka
    Sawada, Hiroshi
    Kameoka, Hirokazu
    Saruwatari, Hiroshi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) : 1626 - 1641
  • [29] Online Blind Source Separation Using Incremental Nonnegative Matrix Factorization with Volume Constraint
    Zhou, Guoxu
    Yang, Zuyuan
    Xie, Shengli
    Yang, Jun-Mei
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (04): : 550 - 560
  • [30] Fast Multichannel Nonnegative Matrix Factorization With Directivity-Aware Jointly-Diagonalizable Spatial Covariance Matrices for Blind Source Separation
    Sekiguchi, Kouhei
    Bando, Yoshiaki
    Nugraha, Aditya Arie
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2610 - 2625