Hybrid Multichannel Signal Separation Using Supervised Nonnegative Matrix Factorization with Spectrogram Restoration

被引:0
|
作者
Kitamura, Daichi [1 ]
Saruwatari, Hiroshi [2 ]
Nakamura, Satoshi [3 ]
Takahashi, Yu [4 ]
Kondo, Kazunobu [4 ]
Kameoka, Hirokazu [2 ]
机构
[1] Grad Univ Adv Studies, Chiyoda Ku, 2-1-2 Hitotsubashi, Tokyo 1018430, Japan
[2] Univ Tokyo, Bunkyo Ku, Tokyo 1138656, Japan
[3] Nara Inst Sci & Technol, Ikoma, Nara 6300192, Japan
[4] Yamaha Corp, Iwata, Shizuoka 4380192, Japan
关键词
ALGORITHMS;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a new hybrid method that concatenates directional clustering and advanced nonnegative matrix factorization (NMF) for the purpose of the specific sound extraction from the multichannel music signal. Multichannel music signal separation technology is aimed to extract a specific target signal from observed multichannel signals that contain multiple instrumental sounds. In the previous studies, various methods using NMF have been proposed, but they remain many problems, e.g., poor convergence in update rules in NMF and lack of robustness. To solve these problems, we propose a new supervised NMF (SNMF) with spectrogram restoration and its hybrid method that concatenates the proposed SNMF after directional clustering. Via extrapolation of supervised spectral bases, the proposed SNMF attempts both target signal separation and reconstruction of the lost target components, which are generated by preceding directional clustering. In addition, we theoretically reveal the trade-off between separation and extrapolation abilities and propose a new scheme for multi-divergence, where optimal divergence can be automatically changed in each time frame according to the local spatial conditions. The results of an evaluation experiment show that our proposed hybrid method outperforms the conventional music signal separation methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] A Generalization of Laplace Nonnegative Matrix Factorization and Its Multichannel Extension
    Tanji, Hiroki
    Murakami, Takahiro
    Kamata, Hiroyuki
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1694 - 1699
  • [42] TRANSDUCTIVE NONNEGATIVE MATRIX FACTORIZATION FOR SEMI-SUPERVISED HIGH-PERFORMANCE SPEECH SEPARATION
    Guan, Naiyang
    Lan, Long
    Tao, Dacheng
    Luo, Zhigang
    Yang, Xuejun
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [43] Blind image separation using Nonnegative Matrix Factorization with Gibbs smoothing
    Zdunek, Rafal
    Cichocki, Andrzej
    NEURAL INFORMATION PROCESSING, PART II, 2008, 4985 : 519 - +
  • [44] A HYBRID ITERATIVE ALGORITHM FOR NONNEGATIVE MATRIX FACTORIZATION
    Soltuz, Stefan M.
    Wang, Wenwu
    Jackson, Philip J. B.
    2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 409 - 412
  • [45] Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation
    Fontaine, Mathieu
    Sekiguchi, Kouhei
    Nugraha, Aditya Arie
    Bando, Yoshiaki
    Yoshii, Kazuyoshi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1734 - 1748
  • [46] Convolutive Transfer Function-Based Multichannel Nonnegative Matrix Factorization for Overdetermined Blind Source Separation
    Wang, Taihui
    Yang, Feiran
    Yang, Jun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 802 - 815
  • [47] Nonnegative Matrix Factorization Using Nonnegative Polynomial Approximations
    Debals, Otto
    Van Barel, Marc
    De Lathauwer, Lieven
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (07) : 948 - 952
  • [48] MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION IN CONVOLUTIVE MIXTURES. WITH APPLICATION TO BLIND AUDIO SOURCE SEPARATION.
    Ozerov, Alexey
    Fevotte, Cedric
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3137 - +
  • [49] Spectrogram analysis of fission chamber's outputs signals using nonnegative matrix and tensor factorization algorithms
    Arahmane, Hanane
    Cherkaoui El Moursli, Rajaa
    Hamzaoui, El-Mehdi
    2018 15TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS AND DEVICES (SSD), 2018, : 642 - 647
  • [50] Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation
    Laroche, Clement
    Kowalski, Matthieu
    Papadopoulos, Helene
    Richard, Gael
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1499 - 1511