A STRUCTURED NONNEGATIVE MATRIX FACTORIZATION FOR SOURCE SEPARATION

被引:0
|
作者
Laroche, Clement [1 ,2 ]
Kowalski, Matthieu [2 ,3 ]
Papadopoulos, Helene [2 ]
Richard, Gael [1 ]
机构
[1] Telecom ParisTech, Inst Mines Telecom, CNRS LTCE, Paris, France
[2] Univ Paris 11, CNRS, Cent Supelec, L2S, Gif Sur Yvette, France
[3] CEA Saclay, INRIA, Parietal Project Team, F-91191 Gif Sur Yvette, France
关键词
nonnegative matrix factorization; projective nonnegative matrix factorization; audio source separation; harmonic/percussive decomposition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a new unconstrained nonnegative matrix factorization method designed to utilize the multilayer structure of audio signals to improve the quality of the source separation. The tonal layer is sparse in frequency and temporally stable, while the transient layer is composed of short term broadband sounds. Our method has a part well suited for tonal extraction which decomposes the signals in sparse orthogonal components, while the transient part is represented by a regular nonnegative matrix factorization decomposition. Experiments on synthetic and real music data in a source separation context show that such decomposition is suitable for audio signal. Compared with three state-of-the-art harmonic/percussive decomposition algorithms, the proposed method shows competitive performances.
引用
收藏
页码:2033 / 2037
页数:5
相关论文
共 50 条
  • [21] Robust Structured Nonnegative Matrix Factorization for Image Representation
    Li, Zechao
    Tang, Jinhui
    He, Xiaofei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (05) : 1947 - 1960
  • [22] STRUCTURED DISCRIMINATIVE NONNEGATIVE MATRIX FACTORIZATION FOR HYPERSPECTRAL UNMIXING
    Li, Xue
    Zhou, Jun
    Tong, Lei
    Yu, Xun
    Guo, Jianhui
    Zhao, Chunxia
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1848 - 1852
  • [23] Generic Uniqueness of a Structured Matrix Factorization and Applications in Blind Source Separation
    Domanov, Ignat
    De lathauwer, Lieven
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2016, 10 (04) : 701 - 711
  • [24] β-Divergence Two-Dimensional Sparse Nonnegative Matrix Factorization for Audio Source Separation
    Darsono, A. M.
    Haron, N. Z.
    Jaafar, A. S.
    Ahmad, M. I.
    2013 IEEE CONFERENCE ON WIRELESS SENSOR (ICWISE), 2013, : 119 - 123
  • [25] MULTICHANNEL NONNEGATIVE TENSOR FACTORIZATION WITH STRUCTURED CONSTRAINTS FOR USER-GUIDED AUDIO SOURCE SEPARATION
    Ozerov, Alexey
    Fevotte, Cedric
    Blouet, Raphael
    Durrieu, Jean-Louis
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 257 - 260
  • [26] Discriminative Nonnegative Matrix Factorization Using Cross-Reconstruction Error for Source Separation
    Kwon, Kisoo
    Shin, Jong Won
    Kim, Hyung Yong
    Kim, Nam Soo
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1513 - 1516
  • [27] Ray-Space-Based Multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Pezzoli, Mirco
    Carabias-Orti, Julio Jose
    Cobos, Maximo
    Antonacci, Fabio
    Sarti, Augusto
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 369 - 373
  • [28] Minimum-Volume Multichannel Nonnegative Matrix Factorization for Blind Audio Source Separation
    Wang, Jianyu
    Guan, Shanzheng
    Liu, Shupei
    Zhang, Xiao-Lei
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 3089 - 3103
  • [29] Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation
    Laroche, Clement
    Kowalski, Matthieu
    Papadopoulos, Helene
    Richard, Gael
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1499 - 1511
  • [30] Supervised Audio Source Separation Based on Nonnegative Matrix Factorization with Cosine Similarity Penalty
    Iwase, Yuta
    Kitamura, Daichi
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2022, E105A (06) : 906 - 913