Blind Audio Source Separation With Minimum-Volume Beta-Divergence NMF

被引：38

作者：

Leplat, Valentin ^{[1
]}

Gillis, Nicolas ^{[1
]}

Ang, Andersen M. S. ^{[1
]}

机构：

[1] Univ Mons, Dept Math & Operat Res, Fac Polytech, B-7000 Mons, Belgium

来源：

IEEE TRANSACTIONS ON SIGNAL PROCESSING | 2020年 / 68卷

基金：

欧洲研究理事会;

关键词：

Nonnegative matrix factorization; beta-divergences; minimum-volume regularization; identifiability; blind audio source separation; model order selection; NONNEGATIVE MATRIX FACTORIZATION; IDENTIFIABILITY;

D O I：

10.1109/TSP.2020.2991801

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Considering a mixed signal composed of various audio sources and recorded with a single microphone, we consider in this paper the blind audio source separation problem which consists in isolating and extracting each of the sources. To perform this task, nonnegative matrix factorization (NMF) based on the Kullback-Leibler and Itakura-Saito beta-divergences is a standard and state-of-the-art technique that uses the time-frequency representation of the signal. We present a new NMF model better suited for this task. It is based on the minimization of beta-divergences along with a penalty term that promotes the columns of the dictionary matrix to have a small volume. Under some mild assumptions and in noiseless conditions, we prove that this model is provably able to identify the sources. In order to solve this problem, we propose multiplicative updates whose derivations are based on the standard majorization-minimization framework. We show on several numerical experiments that our new model is able to obtain more interpretable results than standard NMF models. Moreover, we show that it is able to recover the sources even when the number of sources present into the mixed signal is overestimated. In fact, our model automatically sets sources to zero in this situation, hence performs model order selection automatically.

引用

页码：3400 / 3410

页数：11

共 50 条

[1] Minimum-volume regularized ILRMA for blind audio source separation
Wang, Jianyu
Guan, Shanzheng
Zhang, Xiao-Lei
2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 630 - 634
[2] Minimum-Volume Multichannel Nonnegative Matrix Factorization for Blind Audio Source Separation
Wang, Jianyu
Guan, Shanzheng
Liu, Shupei
Zhang, Xiao-Lei
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 3089 - 3103
[3] Multi-resolution beta-divergence NMF for blind spectral unmixing
Leplat, Valentin
Gillis, Nicolas
Fevotte, Cedric
SIGNAL PROCESSING, 2022, 193
[4] MAJORIZATION-MINIMIZATION ALGORITHMS FOR CONVOLUTIVE NMF WITH THE BETA-DIVERGENCE
Fagot, Dylan
Wendt, Herwig
Fevotte, Cedric
Smaragdis, Paris
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8202 - 8206
[5] Inertial Majorization-Minimization Algorithm for Minimum-Volume NMF
Thanh, Olivier Vu
Ang, Andersen
Gillis, Nicolas
Le Thi Khanh Hien
29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1065 - 1069
[6] Robust blind source separation by beta divergence
Mihoko, M
Eguchi, S
NEURAL COMPUTATION, 2002, 14 (08) : 1859 - 1886
[7] Adaptively robust blind audio signals separation by the minimum β-divergence method
Mollah, Md. Nurul Haque
Eguchi, Shinto
PROCEEDINGS OF 10TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2007), 2007, : 221 - 226
[8] NMF versus ICA for blind source separation
Mirzal, Andri
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2017, 11 (01) : 25 - 48
[9] Blind source separation with pattern expression NMF
Zhang, Junying
Zhang Hongyi
Wei, Le
Wang, Yue Joseph
ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 1159 - 1164
[10] Single Channel Audio Source Separation by Clustered NMF
Kirbiz, Serap
Gunsel, Bilge
2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 469 - 472

← 1 2 3 4 5 →