Blind Audio Source Separation With Minimum-Volume Beta-Divergence NMF

被引:38
|
作者
Leplat, Valentin [1 ]
Gillis, Nicolas [1 ]
Ang, Andersen M. S. [1 ]
机构
[1] Univ Mons, Dept Math & Operat Res, Fac Polytech, B-7000 Mons, Belgium
基金
欧洲研究理事会;
关键词
Nonnegative matrix factorization; beta-divergences; minimum-volume regularization; identifiability; blind audio source separation; model order selection; NONNEGATIVE MATRIX FACTORIZATION; IDENTIFIABILITY;
D O I
10.1109/TSP.2020.2991801
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Considering a mixed signal composed of various audio sources and recorded with a single microphone, we consider in this paper the blind audio source separation problem which consists in isolating and extracting each of the sources. To perform this task, nonnegative matrix factorization (NMF) based on the Kullback-Leibler and Itakura-Saito beta-divergences is a standard and state-of-the-art technique that uses the time-frequency representation of the signal. We present a new NMF model better suited for this task. It is based on the minimization of beta-divergences along with a penalty term that promotes the columns of the dictionary matrix to have a small volume. Under some mild assumptions and in noiseless conditions, we prove that this model is provably able to identify the sources. In order to solve this problem, we propose multiplicative updates whose derivations are based on the standard majorization-minimization framework. We show on several numerical experiments that our new model is able to obtain more interpretable results than standard NMF models. Moreover, we show that it is able to recover the sources even when the number of sources present into the mixed signal is overestimated. In fact, our model automatically sets sources to zero in this situation, hence performs model order selection automatically.
引用
收藏
页码:3400 / 3410
页数:11
相关论文
共 50 条
  • [1] Minimum-volume regularized ILRMA for blind audio source separation
    Wang, Jianyu
    Guan, Shanzheng
    Zhang, Xiao-Lei
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 630 - 634
  • [2] Minimum-Volume Multichannel Nonnegative Matrix Factorization for Blind Audio Source Separation
    Wang, Jianyu
    Guan, Shanzheng
    Liu, Shupei
    Zhang, Xiao-Lei
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 3089 - 3103
  • [3] Multi-resolution beta-divergence NMF for blind spectral unmixing
    Leplat, Valentin
    Gillis, Nicolas
    Fevotte, Cedric
    SIGNAL PROCESSING, 2022, 193
  • [4] MAJORIZATION-MINIMIZATION ALGORITHMS FOR CONVOLUTIVE NMF WITH THE BETA-DIVERGENCE
    Fagot, Dylan
    Wendt, Herwig
    Fevotte, Cedric
    Smaragdis, Paris
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8202 - 8206
  • [5] Inertial Majorization-Minimization Algorithm for Minimum-Volume NMF
    Thanh, Olivier Vu
    Ang, Andersen
    Gillis, Nicolas
    Le Thi Khanh Hien
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1065 - 1069
  • [6] Robust blind source separation by beta divergence
    Mihoko, M
    Eguchi, S
    NEURAL COMPUTATION, 2002, 14 (08) : 1859 - 1886
  • [7] Adaptively robust blind audio signals separation by the minimum β-divergence method
    Mollah, Md. Nurul Haque
    Eguchi, Shinto
    PROCEEDINGS OF 10TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2007), 2007, : 221 - 226
  • [8] NMF versus ICA for blind source separation
    Mirzal, Andri
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2017, 11 (01) : 25 - 48
  • [9] Blind source separation with pattern expression NMF
    Zhang, Junying
    Zhang Hongyi
    Wei, Le
    Wang, Yue Joseph
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 1159 - 1164
  • [10] Single Channel Audio Source Separation by Clustered NMF
    Kirbiz, Serap
    Gunsel, Bilge
    2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 469 - 472