Independent Vector Analysis for Blind Speech Separation Using Complex Generalized Gaussian Mixture Model with Weighted Variance

被引:0
|
作者
Tang, Xinyu [1 ,2 ]
Chen, Rilin [1 ]
Wang, Xiyuan [3 ]
Zhou, Yi [2 ]
Su, Dan [1 ]
机构
[1] Tencent AI Lab, Beijing 100193, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China
[3] Beijing Informat Sci & Technol Univ, Sch Informat & Commun Engn, Beijing 100101, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose using complex generalized Gaussian mixture distribution with weighted variance for speech modelling and devise an improved independent vector analysis (IVA) algorithm for blind speech separation (BSS). Capable of capturing both non-Gaussianity and non-stationarity, the proposed complex generalized Gaussian mixture model (CGGMM) allows for a much flexible characterization of practical speech signals. The majorization minimization (MM) framework is adopted for the IVA algorithm design. Each iteration of the algorithm is comprised of the updates of demixing matrices and mixture model parameters. For demixing matrices, the update operates in a manner similar to that of the auxiliary function based IVA (AuxIVA) method, and for mixture parameters, the expectation maximization (EM) update is performed. As both updates are in closed form and pre-whitening is not a prerequisite, the IVA algorithm under CGGMM is of low complexity and can be carried out efficiently. Experimental results show that the proposed algorithm outperforms existing ones in terms of separation accuracy and also enjoys a fast convergence rate in both simulated and real environments.
引用
收藏
页码:720 / 726
页数:7
相关论文
共 50 条
  • [11] Emotion Recognition from Speech using Gaussian Mixture Model and Vector Quantization
    Agrawal, Surabhi
    Dongaonkar, Shabda
    2015 4TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (ICRITO) (TRENDS AND FUTURE DIRECTIONS), 2015,
  • [12] Hybrid Source Prior Based Independent Vector Analysis for Blind Separation of Speech Signals
    Khan, Junaid Bahadar
    Jan, Tariqullah
    Khalil, Ruhul Amin
    Altalbe, Ali
    IEEE ACCESS, 2020, 8 : 132871 - 132881
  • [13] Research of City Engineering Speech Blind Separation Algorithms Based on Independent Vector Analysis
    Yang, Zhuo
    Li, Chun-ming
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ELECTRONIC & MECHANICAL ENGINEERING AND INFORMATION TECHNOLOGY (EMEIT-2012), 2012, 23
  • [14] Experiment of Blind Signal Separation of Wireless Mixture using Complex Valued Fast Independent Component Analysis
    Shiomi, Hidehisa
    Yata, Tatsuro
    Okamura, Yasuyuki
    2008 IEEE ANTENNAS AND PROPAGATION SOCIETY INTERNATIONAL SYMPOSIUM, VOLS 1-9, 2008, : 1516 - 1519
  • [15] Blind source separation based on generalized gaussian model
    杨斌
    孔薇
    周越
    Journal of Harbin Institute of Technology, 2007, (03) : 362 - 367
  • [16] Blind source separation based on generalized Gaussian model
    Information Engineering College, Shanghai Maritime University, Shanghai 200135, China
    不详
    J. Harbin Inst. Technol., 2007, 3 (362-367):
  • [17] Independent Vector Analysis for Source Separation Using a Mixture of Gaussians Prior
    Hao, Jiucang
    Lee, Intae
    Lee, Te-Won
    Sejnowski, Terrence J.
    NEURAL COMPUTATION, 2010, 22 (06) : 1646 - 1673
  • [18] Parallel structured independent component analysis for SIMO-model-based blind separation and deconvolution of convolutive speech mixture
    Saruwatari, H
    Yamajo, H
    Takatani, T
    Nishikawa, T
    Shikano, K
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 714 - 719
  • [19] Nonorthogonal Independent Vector Analysis Using Multivariate Gaussian Model
    Anderson, Matthew
    Li, Xi-Lin
    Adali, Tuelay
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, 2010, 6365 : 354 - 361
  • [20] Speech enhancement based on speech spectral complex Gaussian Mixture Model
    Ding, GH
    Wang, X
    Cao, Y
    Ding, F
    Tang, YZ
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 165 - 168