Nonnegative Matrix Partial Co-Factorization for Spectral and Temporal Drum Source Separation

被引:31
|
作者
Kim, Minje [1 ]
Yoo, Jiho [2 ]
Kang, Kyeongok [1 ]
Choi, Seungjin [2 ,3 ]
机构
[1] Elect & Telecommun Res Inst, Realist Acoust Res Team, Taejon 305700, South Korea
[2] Pohang Univ Sci & Technol, Dept Comp Sci, Pohang 790784, South Korea
[3] Pohang Univ Sci & Technol, Div IT Convergence Engn, Pohang 790784, South Korea
关键词
Blind source separation; music source separation (MSS); nonnegative matrix factorization (NMF); nonnegative matrix partial co-factorization (NMPCF);
D O I
10.1109/JSTSP.2011.2158803
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We address a problem of separating drum sources from monaural mixtures of polyphonic music containing various pitched instruments as well as drums. We consider a spectrogram of music, described by a matrix where each row is associated with intensities of a frequency over time. We employ a joint decomposition to several spectrogram matrices that include two or more column-blocks of the mixture spectrograms (columns of mixture spectrograms are partitioned into 2 or more blocks) and a drum-only (drum solo playing) matrix constructed from various drums a priori. To this end, we apply nonnegative matrix partial co-factorization (NMPCF) to these target matrices, in which column-blocks of mixture spectrograms and the drum-only matrix are jointly decomposed, sharing a factor matrix partially, in order to determine common basis vectors that capture the spectral and temporal characteristics of drum sources. Common basis vectors learned by NMPCF capture spectral patterns of drums since they are shared in the decomposition of the drum-only matrix and accommodate temporal patterns of drums because repetitive characteristics are captured by factorizing column-blocks of mixture spectrograms (each of which is associated with different time periods). Experimental results on real-world commercial music signal demonstrate the performance of the proposed method.
引用
收藏
页码:1192 / 1204
页数:13
相关论文
共 50 条
  • [1] NONNEGATIVE MATRIX PARTIAL CO-FACTORIZATION FOR DRUM SOURCE SEPARATION
    Yoo, Jiho
    Kim, Minje
    Kang, Kyeongok
    Choi, Seungjin
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1942 - 1945
  • [2] TEXT-INFORMED AUDIO SOURCE SEPARATION USING NONNEGATIVE MATRIX PARTIAL CO-FACTORIZATION
    Le Magoarou, Luc
    Ozerov, Alexey
    Duong, Ngoc Q. K.
    2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2013,
  • [3] Multimodal Soft Nonnegative Matrix Co-Factorization for Convolutive Source Separation
    Sedighin, Farnaz
    Babaie-Zadeh, Massoud
    Rivet, Bertrand
    Jutten, Christian
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (12) : 3179 - 3190
  • [4] Separation of Singing Voice Using Nonnegative Matrix Partial Co-Factorization for Singer Identification
    Hu, Ying
    Liu, Guizhong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (04) : 643 - 653
  • [5] Soft Nonnegative Matrix Co-Factorization
    Seichepine, Nicolas
    Essid, Slim
    Fevotte, Cedric
    Cappe, Olivier
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (22) : 5940 - 5949
  • [6] Monaural Speech Separation by Means of Convolutive Nonnegative Matrix Partial Co-factorization in Low SNR Condition
    Dong X.-L.
    Hu Y.
    Huang H.
    Wushour S.
    Zidonghua Xuebao/Acta Automatica Sinica, 2020, 46 (06): : 1200 - 1209
  • [7] Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation
    Laroche, Clement
    Kowalski, Matthieu
    Papadopoulos, Helene
    Richard, Gael
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1499 - 1511
  • [8] SOFT NONNEGATIVE MATRIX CO-FACTORIZATION WITH APPLICATION TO MULTIMODAL SPEAKER DIARIZATION
    Seichepine, N.
    Essid, S.
    Fevotte, C.
    Cappe, O.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3537 - 3541
  • [9] Monaural Singing Voice Separation by Non-negative Matrix Partial Co-Factorization with Temporal Continuity and Sparsity Criteria
    Hu, Ying
    Wang, Liejun
    Huang, Hao
    Zhou, Gang
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 33 - 43
  • [10] A STRUCTURED NONNEGATIVE MATRIX FACTORIZATION FOR SOURCE SEPARATION
    Laroche, Clement
    Kowalski, Matthieu
    Papadopoulos, Helene
    Richard, Gael
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2033 - 2037