Underdetermined Source Separation Based on Generalized Multichannel Variational Autoencoder

被引:18
|
作者
Seki, Shogo [1 ]
Kameoka, Hirokazu [2 ]
Li, Li [3 ]
Toda, Tomoki [4 ]
Takeda, Kazuya [5 ]
机构
[1] Nagoya Univ, Grad Sch Informat, Nagoya, Aichi 4640861, Japan
[2] NTT Corp, Atsugi, Kanagawa 2430198, Japan
[3] Univ Tsukuba, Grad Sch Syst & Informat Engn, Tsukuba, Ibaraki 3058573, Japan
[4] Nagoya Univ, Informat Technol Ctr, Nagoya, Aichi 4640861, Japan
[5] Nagoya Univ, Inst Innovat Future Soc, Nagoya, Aichi 4648603, Japan
来源
IEEE ACCESS | 2019年 / 7卷
关键词
Underdetermined source separation; variational audoencoder; non-negative matrix factorization; AUDIO SOURCE SEPARATION; NONNEGATIVE MATRIX FACTORIZATION;
D O I
10.1109/ACCESS.2019.2954120
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper deals with a multichannel audio source separation problem under underdetermined conditions. Multichannel non-negative matrix factorization (MNMF) is a powerful method for underdetermined audio source separation, which adopts the NMF concept to model and estimate the power spectrograms of the sound sources in a mixture signal. This concept is also used in independent low-rank matrix analysis (ILRMA), a special class of the MNMF formulated under determined conditions. While these methods work reasonably well for particular types of sound sources, one limitation is that they can fail to work for sources with spectrograms that do not comply with the NMF model. To address this limitation, an extension of ILRMA called the multichannel variational autoencoder (MVAE) method was recently proposed, where a conditional VAE (CVAE) is used instead of the NMF model for expressing source power spectrograms. This approach has performed impressively in determined source separation tasks thanks to the representation power of deep neural networks. While the original MVAE method was formulated under determined mixing conditions, this paper proposes a generalized version of it by combining the ideas of MNMF and MVAE so that it can also deal with underdetermined cases. We call this method the generalized MVAE (GMVAE) method. In underdetermined source separation and speech enhancement experiments, the proposed method performed better than baseline methods.
引用
收藏
页码:168104 / 168115
页数:12
相关论文
共 50 条
  • [41] GENERALIZED VARIATIONAL BOUNDS FOR MULTICHANNEL SCATTERINGS
    HAHN, Y
    PHYSICAL REVIEW C, 1970, 1 (01): : 12 - +
  • [42] Wavelet-based underdetermined blind source separation of speech mixtures
    Hamadal, Takehiro
    Nakano, Kazushi
    Ichijo, Akihiro
    2007 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS, VOLS 1-6, 2007, : 1629 - 1633
  • [43] Underdetermined Blind Source Separation Based on Third-order Statistics
    Zou Liang
    Zhang Peng
    Chen Xun
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (11) : 3960 - 3966
  • [44] PDOA BASED UNDERDETERMINED BLIND SOURCE SEPARATION USING TWO MICROPHONES
    Levi, Avram
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [45] A Source Signal Recovery Method for Underdetermined Blind Source Separation based on Shortest Path
    Wang, Chuanchuan
    Jia, Rui
    APPLIED COMPUTATIONAL ELECTROMAGNETICS SOCIETY JOURNAL, 2020, 35 (04): : 406 - 414
  • [46] Underdetermined blind delayed source separation based on single source intervals in frequency domain
    Xiao, Ming
    Xie, Sheng-Li
    Fu, Yu-Li
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2007, 35 (12): : 2279 - 2283
  • [47] Underdetermined Blind Source Separation Algorithm Based on Directional Amplitude Ratio
    Ji C.
    Jiang Y.-T.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2019, 40 (07): : 920 - 924
  • [48] Underdetermined Blind Source Separation Based on OS-SASP Algorithm
    Ji C.
    Zhang H.
    Geng R.
    Li B.-Q.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2021, 42 (04): : 501 - 508
  • [49] Underdetermined Blind Source Separation of Adjacent Satellite Interference Based on Sparseness
    Li, Chengjie
    Zhu, Lidong
    Luo, Zhongqiang
    CHINA COMMUNICATIONS, 2017, 14 (04) : 140 - 149
  • [50] Nonnegative Mixture for Underdetermined Blind Source Separation Based on a Tensor Algorithm
    Sunan Ge
    Jie Han
    Min Han
    Circuits, Systems, and Signal Processing, 2015, 34 : 2935 - 2950