A STRUCTURED NONNEGATIVE MATRIX FACTORIZATION FOR SOURCE SEPARATION

被引:0
|
作者
Laroche, Clement [1 ,2 ]
Kowalski, Matthieu [2 ,3 ]
Papadopoulos, Helene [2 ]
Richard, Gael [1 ]
机构
[1] Telecom ParisTech, Inst Mines Telecom, CNRS LTCE, Paris, France
[2] Univ Paris 11, CNRS, Cent Supelec, L2S, Gif Sur Yvette, France
[3] CEA Saclay, INRIA, Parietal Project Team, F-91191 Gif Sur Yvette, France
关键词
nonnegative matrix factorization; projective nonnegative matrix factorization; audio source separation; harmonic/percussive decomposition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a new unconstrained nonnegative matrix factorization method designed to utilize the multilayer structure of audio signals to improve the quality of the source separation. The tonal layer is sparse in frequency and temporally stable, while the transient layer is composed of short term broadband sounds. Our method has a part well suited for tonal extraction which decomposes the signals in sparse orthogonal components, while the transient part is represented by a regular nonnegative matrix factorization decomposition. Experiments on synthetic and real music data in a source separation context show that such decomposition is suitable for audio signal. Compared with three state-of-the-art harmonic/percussive decomposition algorithms, the proposed method shows competitive performances.
引用
收藏
页码:2033 / 2037
页数:5
相关论文
共 50 条
  • [31] FLOW-BASED FAST MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR BLIND SOURCE SEPARATION
    Nugraha, Aditya Arie
    Sekiguchi, Kouhei
    Fontaine, Mathieu
    Bando, Yoshiaki
    Yoshii, Kazuyoshi
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 501 - 505
  • [32] Determined Blind Source Separation Unifying Independent Vector Analysis and Nonnegative Matrix Factorization
    Kitamura, Daichi
    Ono, Nobutaka
    Sawada, Hiroshi
    Kameoka, Hirokazu
    Saruwatari, Hiroshi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) : 1626 - 1641
  • [33] AUTOREGRESSIVE FAST MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR JOINT BLIND SOURCE SEPARATION AND DEREVERBERATION
    Sekiguchi, Kouhei
    Bando, Yoshiaki
    Nugraha, Aditya Arie
    Fontaine, Mathieu
    Yoshii, Kazuyoshi
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 511 - 515
  • [34] Ray-Space constrained multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Munoz-Montoro, Antonio J.
    Olivieri, Marco
    Pezzoli, Mirco
    Carabias-Orti, Julio
    Antonacci, Fabio
    Sarti, Augusto
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 396 - 400
  • [35] Nonnegative Matrix Partial Co-Factorization for Spectral and Temporal Drum Source Separation
    Kim, Minje
    Yoo, Jiho
    Kang, Kyeongok
    Choi, Seungjin
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (06) : 1192 - 1204
  • [36] Online Blind Source Separation Using Incremental Nonnegative Matrix Factorization with Volume Constraint
    Zhou, Guoxu
    Yang, Zuyuan
    Xie, Shengli
    Yang, Jun-Mei
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (04): : 550 - 560
  • [37] Harmonic Sparse Structured Nonnegative Matrix Factorization: A Novel Method for the Separation of Coupled Fault Feature
    Zhang, Boyao
    Lin, Jing
    Miao, Yonghao
    Jiao, Jinyang
    Liu, Hanyang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (04) : 6209 - 6221
  • [38] Monaural sound source separation by nonnegative matrix factorization with tempora continuity and sparseness criteria
    Virtanen, Tuomas
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 1066 - 1074
  • [39] Underdetermined blind source separation using normalized spatial covariance matrix and multichannel nonnegative matrix factorization
    Oh, Son-hook
    Kim, Jung-Han
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (02): : 120 - 130
  • [40] Transductive Convolutive Nonnegative Matrix Factorization for Speech Separation
    Mai, Yaodan
    Lan, Long
    Guan, Naiyang
    Zhang, Xiang
    Luo, Zhigang
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1400 - 1404