Blind Audio Source Separation With Minimum-Volume Beta-Divergence NMF

被引：38

作者：

Leplat, Valentin ^{[1
]}

Gillis, Nicolas ^{[1
]}

Ang, Andersen M. S. ^{[1
]}

机构：

[1] Univ Mons, Dept Math & Operat Res, Fac Polytech, B-7000 Mons, Belgium

来源：

IEEE TRANSACTIONS ON SIGNAL PROCESSING | 2020年 / 68卷

基金：

欧洲研究理事会;

关键词：

Nonnegative matrix factorization; beta-divergences; minimum-volume regularization; identifiability; blind audio source separation; model order selection; NONNEGATIVE MATRIX FACTORIZATION; IDENTIFIABILITY;

D O I：

10.1109/TSP.2020.2991801

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Considering a mixed signal composed of various audio sources and recorded with a single microphone, we consider in this paper the blind audio source separation problem which consists in isolating and extracting each of the sources. To perform this task, nonnegative matrix factorization (NMF) based on the Kullback-Leibler and Itakura-Saito beta-divergences is a standard and state-of-the-art technique that uses the time-frequency representation of the signal. We present a new NMF model better suited for this task. It is based on the minimization of beta-divergences along with a penalty term that promotes the columns of the dictionary matrix to have a small volume. Under some mild assumptions and in noiseless conditions, we prove that this model is provably able to identify the sources. In order to solve this problem, we propose multiplicative updates whose derivations are based on the standard majorization-minimization framework. We show on several numerical experiments that our new model is able to obtain more interpretable results than standard NMF models. Moreover, we show that it is able to recover the sources even when the number of sources present into the mixed signal is overestimated. In fact, our model automatically sets sources to zero in this situation, hence performs model order selection automatically.

引用

页码：3400 / 3410

页数：11

共 50 条

[31] Underdetermined Blind Audio Source Separation Using Modal Decomposition
Abdeldjalil Aïssa-El-Bey
Karim Abed-Meraim
Yves Grenier
EURASIP Journal on Audio, Speech, and Music Processing, 2007
[32] Underdetermined Blind Audio Source Separation Using Modal Decomposition
Aissa-El-Bey, Abdeldjalil
Abed-Meraim, Karim
Grenier, Yves
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007, 2007 (1)
[33] Blind Audio Source Separation Using Wiener Filtering Approach
Sharma, Pardeep
Mehra, Rajesh
Dubey, Naveen
2015 4TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (ICRITO) (TRENDS AND FUTURE DIRECTIONS), 2015,
[34] A digital audio watermarking scheme based on blind source separation
Ma, XH
Wang, C
Cong, XP
Yin, FL
ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 550 - 555
[35] Blind audio source separation based on independent component analysis
Makino, Shoji
Sawada, Hiroshi
Araki, Shoko
INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2007, 4666 : 843 - 843
[36] Improving the Performance of the Instantaneous Blind Audio Source Separation Algorithms
Mahmoud, Amr E.
Ammar, Reda A.
Eladawy, Mohamed I.
Hussien, Medhat
2009 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2009), 2009, : 519 - +
[37] An Efficient Algorithm for Underdetermined Blind Source Separation of Audio Mixtures
Dutta, Malay Kishore
Gupta, Phalguni
Pathak, Vinay K.
2009 INTERNATIONAL CONFERENCE ON ADVANCES IN RECENT TECHNOLOGIES IN COMMUNICATION AND COMPUTING (ARTCOM 2009), 2009, : 136 - +
[38] Towards a model of perceived quality of blind audio source separation
Fox, Brendan
Pardo, Bryan
2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1898 - 1901
[39] Determined Reverberant Blind Source Separation of Audio Mixing Signals
Yang, Senquan
Ding, Fan
Liu, Jianjun
Li, Pu
Hu, Songxi
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03): : 3309 - 3323
[40] A multiple audio watermarking for database based on blind source separation
Cui, Xin-Chun
He, Jie
Qin, Xiao-Lin
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2012, 40 (01): : 78 - 83

← 1 2 3 4 5 →