Hybrid Multichannel Signal Separation Using Supervised Nonnegative Matrix Factorization with Spectrogram Restoration

被引:0
|
作者
Kitamura, Daichi [1 ]
Saruwatari, Hiroshi [2 ]
Nakamura, Satoshi [3 ]
Takahashi, Yu [4 ]
Kondo, Kazunobu [4 ]
Kameoka, Hirokazu [2 ]
机构
[1] Grad Univ Adv Studies, Chiyoda Ku, 2-1-2 Hitotsubashi, Tokyo 1018430, Japan
[2] Univ Tokyo, Bunkyo Ku, Tokyo 1138656, Japan
[3] Nara Inst Sci & Technol, Ikoma, Nara 6300192, Japan
[4] Yamaha Corp, Iwata, Shizuoka 4380192, Japan
关键词
ALGORITHMS;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a new hybrid method that concatenates directional clustering and advanced nonnegative matrix factorization (NMF) for the purpose of the specific sound extraction from the multichannel music signal. Multichannel music signal separation technology is aimed to extract a specific target signal from observed multichannel signals that contain multiple instrumental sounds. In the previous studies, various methods using NMF have been proposed, but they remain many problems, e.g., poor convergence in update rules in NMF and lack of robustness. To solve these problems, we propose a new supervised NMF (SNMF) with spectrogram restoration and its hybrid method that concatenates the proposed SNMF after directional clustering. Via extrapolation of supervised spectral bases, the proposed SNMF attempts both target signal separation and reconstruction of the lost target components, which are generated by preceding directional clustering. In addition, we theoretically reveal the trade-off between separation and extrapolation abilities and propose a new scheme for multi-divergence, where optimal divergence can be automatically changed in each time frame according to the local spatial conditions. The results of an evaluation experiment show that our proposed hybrid method outperforms the conventional music signal separation methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Semi-Supervised Nonnegative Matrix Factorization
    Lee, Hyekyoung
    Yoo, Jiho
    Choi, Seungjin
    IEEE SIGNAL PROCESSING LETTERS, 2010, 17 (01) : 4 - 7
  • [22] Ray-Space-Based Multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Pezzoli, Mirco
    Carabias-Orti, Julio Jose
    Cobos, Maximo
    Antonacci, Fabio
    Sarti, Augusto
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 369 - 373
  • [23] Minimum-Volume Multichannel Nonnegative Matrix Factorization for Blind Audio Source Separation
    Wang, Jianyu
    Guan, Shanzheng
    Liu, Shupei
    Zhang, Xiao-Lei
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 3089 - 3103
  • [24] FLOW-BASED FAST MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR BLIND SOURCE SEPARATION
    Nugraha, Aditya Arie
    Sekiguchi, Kouhei
    Fontaine, Mathieu
    Bando, Yoshiaki
    Yoshii, Kazuyoshi
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 501 - 505
  • [25] AUTOREGRESSIVE FAST MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR JOINT BLIND SOURCE SEPARATION AND DEREVERBERATION
    Sekiguchi, Kouhei
    Bando, Yoshiaki
    Nugraha, Aditya Arie
    Fontaine, Mathieu
    Yoshii, Kazuyoshi
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 511 - 515
  • [26] Ray-Space constrained multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Munoz-Montoro, Antonio J.
    Olivieri, Marco
    Pezzoli, Mirco
    Carabias-Orti, Julio
    Antonacci, Fabio
    Sarti, Augusto
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 396 - 400
  • [27] Regularized nonnegative matrix factorization using Gaussian mixture priors for supervised single channel source separation
    Grais, Emad M.
    Erdogan, Hakan
    COMPUTER SPEECH AND LANGUAGE, 2013, 27 (03): : 746 - 762
  • [28] Supervised and Semi-supervised Speech Enhancement Using Weighted Nonnegative Matrix Factorization
    Zou, Xia
    Hu, Yonggang
    Zhang, Xiongwei
    2017 9TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2017,
  • [29] Supervised Audio Source Separation Based on Nonnegative Matrix Factorization with Cosine Similarity Penalty
    Iwase, Yuta
    Kitamura, Daichi
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2022, E105A (06) : 906 - 913
  • [30] Continuous Semi-Supervised Nonnegative Matrix Factorization
    Lindstrom, Michael R. R.
    Ding, Xiaofu
    Liu, Feng
    Somayajula, Anand
    Needell, Deanna
    ALGORITHMS, 2023, 16 (04)