Hybrid Multichannel Signal Separation Using Supervised Nonnegative Matrix Factorization with Spectrogram Restoration

被引：0

作者：

Kitamura, Daichi ^{[1
]}

Saruwatari, Hiroshi ^{[2
]}

Nakamura, Satoshi ^{[3
]}

Takahashi, Yu ^{[4
]}

Kondo, Kazunobu ^{[4
]}

Kameoka, Hirokazu ^{[2
]}

机构：

[1] Grad Univ Adv Studies, Chiyoda Ku, 2-1-2 Hitotsubashi, Tokyo 1018430, Japan

[2] Univ Tokyo, Bunkyo Ku, Tokyo 1138656, Japan

[3] Nara Inst Sci & Technol, Ikoma, Nara 6300192, Japan

[4] Yamaha Corp, Iwata, Shizuoka 4380192, Japan

来源：

2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA) | 2014年

关键词：

ALGORITHMS;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In this paper, we propose a new hybrid method that concatenates directional clustering and advanced nonnegative matrix factorization (NMF) for the purpose of the specific sound extraction from the multichannel music signal. Multichannel music signal separation technology is aimed to extract a specific target signal from observed multichannel signals that contain multiple instrumental sounds. In the previous studies, various methods using NMF have been proposed, but they remain many problems, e.g., poor convergence in update rules in NMF and lack of robustness. To solve these problems, we propose a new supervised NMF (SNMF) with spectrogram restoration and its hybrid method that concatenates the proposed SNMF after directional clustering. Via extrapolation of supervised spectral bases, the proposed SNMF attempts both target signal separation and reconstruction of the lost target components, which are generated by preceding directional clustering. In addition, we theoretically reveal the trade-off between separation and extrapolation abilities and propose a new scheme for multi-divergence, where optimal divergence can be automatically changed in each time frame according to the local spatial conditions. The results of an evaluation experiment show that our proposed hybrid method outperforms the conventional music signal separation methods.

引用

页数：10

共 50 条

[21] Semi-Supervised Nonnegative Matrix Factorization
Lee, Hyekyoung
Yoo, Jiho
Choi, Seungjin
IEEE SIGNAL PROCESSING LETTERS, 2010, 17 (01) : 4 - 7
[22] Ray-Space-Based Multichannel Nonnegative Matrix Factorization for Audio Source Separation
Pezzoli, Mirco
Carabias-Orti, Julio Jose
Cobos, Maximo
Antonacci, Fabio
Sarti, Augusto
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 369 - 373
[23] Minimum-Volume Multichannel Nonnegative Matrix Factorization for Blind Audio Source Separation
Wang, Jianyu
Guan, Shanzheng
Liu, Shupei
Zhang, Xiao-Lei
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 3089 - 3103
[24] FLOW-BASED FAST MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR BLIND SOURCE SEPARATION
Nugraha, Aditya Arie
Sekiguchi, Kouhei
Fontaine, Mathieu
Bando, Yoshiaki
Yoshii, Kazuyoshi
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 501 - 505
[25] AUTOREGRESSIVE FAST MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR JOINT BLIND SOURCE SEPARATION AND DEREVERBERATION
Sekiguchi, Kouhei
Bando, Yoshiaki
Nugraha, Aditya Arie
Fontaine, Mathieu
Yoshii, Kazuyoshi
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 511 - 515
[26] Ray-Space constrained multichannel Nonnegative Matrix Factorization for Audio Source Separation
Munoz-Montoro, Antonio J.
Olivieri, Marco
Pezzoli, Mirco
Carabias-Orti, Julio
Antonacci, Fabio
Sarti, Augusto
32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 396 - 400
[27] Regularized nonnegative matrix factorization using Gaussian mixture priors for supervised single channel source separation
Grais, Emad M.
Erdogan, Hakan
COMPUTER SPEECH AND LANGUAGE, 2013, 27 (03): : 746 - 762
[28] Supervised and Semi-supervised Speech Enhancement Using Weighted Nonnegative Matrix Factorization
Zou, Xia
Hu, Yonggang
Zhang, Xiongwei
2017 9TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2017,
[29] Supervised Audio Source Separation Based on Nonnegative Matrix Factorization with Cosine Similarity Penalty
Iwase, Yuta
Kitamura, Daichi
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2022, E105A (06) : 906 - 913
[30] Continuous Semi-Supervised Nonnegative Matrix Factorization
Lindstrom, Michael R. R.
Ding, Xiaofu
Liu, Feng
Somayajula, Anand
Needell, Deanna
ALGORITHMS, 2023, 16 (04)

← 1 2 3 4 5 →