LOCAL GAUSSIAN MODEL WITH SOURCE-SET CONSTRAINTS IN AUDIO SOURCE SEPARATION

被引:0
|
作者
Ikeshita, Rintaro [1 ]
Togami, Masahito [1 ]
Kawaguchi, Yohei [1 ]
Fujita, Yusuke [1 ]
Nagamatsu, Kenji [1 ]
机构
[1] Hitachi Ltd, Res & Dev Grp, Tokyo, Japan
关键词
Blind audio source separation; local Gaussian model; time-frequency mask; diffusion noise; permutation alignment; NONNEGATIVE MATRIX FACTORIZATION; MIXTURES;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To improve the performance of blind audio source separation of convolutive mixtures, the local Gaussian model (LGM) having full rank covariance matrices proposed by Duong et al. is extended. The previous model basically assumes that all sources contribute to each time-frequency slot, which may fail to capture the characteristic of signals with many intermittent silent periods. A constraint on source sets that contribute to each time-frequency slot is therefore explicitly introduced. This approach can be regarded as a relaxation of the sparsity constraint in the conventional time-frequency mask. The proposed model is jointly optimized among the original local Gaussian model parameters, the relaxed version of the time-frequency mask, and a permutation alignment, leading to a robust permutation-free algorithm. We also present a novel multi-channel Wiener filter weighted by a relaxed version of the time-frequency mask. Experimental results over noisy speech signals show that the proposed model is effective compared with the original local Gaussian model and is comparable to its extension, the multi-channel nonnegative matrix factorization.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Blind source separation based on generalized Gaussian model
    Information Engineering College, Shanghai Maritime University, Shanghai 200135, China
    不详
    J. Harbin Inst. Technol., 2007, 3 (362-367):
  • [22] Maximum likelihood approach for blind audio source separation using time-frequency Gaussian source models
    Févotte, C
    Cardoso, JF
    2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 78 - 81
  • [23] Audio source separation with a signal-adaptive local cosine transform
    Nesbit, Andrew
    Plumbley, Mark D.
    Davies, Mike E.
    SIGNAL PROCESSING, 2007, 87 (08) : 1848 - 1858
  • [24] Gaussian processes for source separation
    Park, Sunho
    Choi, Seungjin
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1909 - 1912
  • [25] Towards a model of perceived quality of blind audio source separation
    Fox, Brendan
    Pardo, Bryan
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1898 - 1901
  • [26] Joint Audio Inpainting and Source Separation
    Bilen, Cagdas
    Ozerov, Alexey
    Perez, Patrick
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, LVA/ICA 2015, 2015, 9237 : 251 - 258
  • [27] Single channel audio source separation
    Gao, Bin
    Woo, W.L.
    Dlay, S.S.
    WSEAS Transactions on Signal Processing, 2008, 4 (04): : 173 - 182
  • [28] Audio source separation of convolutive mixtures
    Mitianoudis, N
    Davies, ME
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 489 - 497
  • [29] MOTION INFORMED AUDIO SOURCE SEPARATION
    Parekh, Sanjeel
    Essid, Slim
    Ozerov, Alexey
    Duong, Ngoc Q. K.
    Perez, Patrick
    Richard, Gael
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 6 - 10
  • [30] AN OVERVIEW OF INFORMED AUDIO SOURCE SEPARATION
    Liutkus, Antoine
    Durrieu, Jean-Louis
    Daudet, Laurent
    Richard, Gael
    2013 14TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES (WIAMIS), 2013,