Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation

被引:3
|
作者
Laroche, Clement [1 ,2 ]
Kowalski, Matthieu [2 ]
Papadopoulos, Helene [2 ]
Richard, Gael [1 ]
机构
[1] Univ Paris Saclay, Telecom ParisTech, LTCI, F-75013 Paris, France
[2] Univ Paris Sud, Cent Supelec, CNRS, UMR 8506,Lab Signaux & Syst, F-91192 Gif Sur Yvette, France
关键词
Nonnegative matrix factorization; projective nonnegative matrix factorization; audio source separation; harmonic/percussive decomposition; POLYPHONIC MUSIC; MELODY EXTRACTION; SPEECH SIGNALS; TRANSCRIPTION; DECOMPOSITION; ALGORITHMS;
D O I
10.1109/TASLP.2018.2830116
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
One of the most general models of music signals considers that such signals can be represented as a sum of two distinct components: a tonal part that is sparse in frequency and temporally stable and a transient (or percussive) part that is composed of short-term broadband sounds. In this paper, we propose a novel hybrid method built upon nonnegative matrix factorization (NMF) that decomposes the time frequency representation of an audio signal into such two components. The tonal part is estimated by a sparse and orthogonal nonnegative decomposition, and the transient part is estimated by a straightforward NMF decomposition constrained by a pre-learned dictionary of smooth spectra. The optimization problem at the heart of our method remains simple with very few hyperparameters and can be solved thanks to simple multiplicative update rules. The extensive benchmark on a large and varied music database against four state of the art harmonic/percussive source separation algorithms demonstrate the merit of the proposed approach.
引用
收藏
页码:1499 / 1511
页数:13
相关论文
共 50 条
  • [31] Underdetermined Blind Source Separation Combining Tensor Decomposition and Nonnegative Matrix Factorization
    Xie, Yuan
    Xie, Kan
    Yang, Junjie
    Xie, Shengli
    SYMMETRY-BASEL, 2018, 10 (10):
  • [32] SPARSENESS-BASED MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR BLIND SOURCE SEPARATION
    Higuchi, Takuya
    Yoshioka, Takuya
    Nakatani, Tomohiro
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [33] Beamspace-Domain Multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Lee, Seokjin
    Park, Sang Ha
    Sung, Koeng-Mo
    IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (01) : 43 - 46
  • [34] Machine Learning Source Separation Using Maximum A Posteriori Nonnegative Matrix Factorization
    Gao, Bin
    Woo, W. L.
    Ling, Bingo W-K.
    IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (07) : 1169 - 1179
  • [35] ADAPTATION OF SOURCE-SPECIFIC DICTIONARIES IN NON-NEGATIVE MATRIX FACTORIZATION FOR SOURCE SEPARATION
    Jaureguiberry, Xabier
    Leveau, Pierre
    Maller, Simon
    Burred, Juan Jose
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5 - 8
  • [36] Dual-Transform Source Separation Using Sparse Nonnegative Matrix Factorization
    Md. Imran Hossain
    Md. Shohidul Islam
    Mst. Titasa Khatun
    Rizwan Ullah
    Asim Masood
    Zhongfu Ye
    Circuits, Systems, and Signal Processing, 2021, 40 : 1868 - 1891
  • [37] Layered Nonnegative Matrix Factorization for Speech Separation
    Hsu, Chung-Chien
    Chien, Jen-Tzung
    Chi, Tai-Shih
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 628 - 632
  • [38] Projective nonnegative matrix factorization for image compression and feature extraction
    Yuan, ZJ
    Oja, E
    IMAGE ANALYSIS, PROCEEDINGS, 2005, 3540 : 333 - 342
  • [39] Feature matching using modified projective nonnegative matrix factorization
    Yan, Weidong
    Tian, Zheng
    Wen, Jinhuan
    Pan, Lulu
    JOURNAL OF ELECTRONIC IMAGING, 2012, 21 (01)
  • [40] A HYBRID ITERATIVE ALGORITHM FOR NONNEGATIVE MATRIX FACTORIZATION
    Soltuz, Stefan M.
    Wang, Wenwu
    Jackson, Philip J. B.
    2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 409 - 412