A SUPERVISED MULTI-CHANNEL SPEECH ENHANCEMENT ALGORITHM BASED ON BAYESIAN NMF MODEL

被引:0
|
作者
Chung, Hanwook [1 ]
Plourde, Eric [2 ]
Champagne, Benoit [1 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
[2] Sherbrooke Univ, Dept Elect & Comp Engn, Sherbrooke, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Multi-channel speech enhancement; MVDR beamforming; non-negative matrix factorization; probabilistic generative model; variational Bayesian expectation-maximization; CONVOLUTIVE MIXTURES; ENVIRONMENT;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we introduce a supervised multi-channel speech enhancement algorithm based on a Bayesian multi-channel non-negative matrix factorization (MNMF) model. In the proposed framework, we consider the probabilistic generative model (PGM) of MNMF, specified by Poisson-distributed latent variables and gamma-distributed priors. In the training stage, the MNMF parameters of the speech and noise sources are estimated via the variational Bayesian expectation-maximization (VBEM) algorithm. In the enhancement stage, the clean speech signal is estimated via the MNMF-based minimum variance distortionless response (MVDR) beamformer. To further improve the enhanced speech quality, we efficiently combine the MNMF-based beamforming technique with a classical unsupervised single-channel enhancement method. Experiments show that the proposed method can provide better enhancement performance than the selected benchmarks.
引用
收藏
页码:221 / 225
页数:5
相关论文
共 50 条
  • [31] Factorized MVDR Deep Beamforming for Multi-Channel Speech Enhancement
    Kim, Hansol
    Kang, Kyeongmuk
    Shin, Jong Won
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1898 - 1902
  • [32] A separation and interaction framework for causal multi-channel speech enhancement
    Liu, Wenzhe
    Li, Andong
    Zheng, Chengshi
    Li, Xiaodong
    DIGITAL SIGNAL PROCESSING, 2022, 126
  • [33] Multi-channel Speech Enhancement with Multiple-target GANs
    Yuan, Jing
    Bao, Changchun
    2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020), 2020,
  • [34] Multi-channel speech dereverberation based on a statistical model of late reverberation
    Habets, EAP
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 173 - 176
  • [35] Blind dereverberation algorithm for speech signals based on multi-channel linear prediction
    Delcroix, Marc
    Hikichi, Takafumi
    Miyoshi, Masato
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2005, 26 (05) : 432 - 439
  • [36] Multi-scale decomposition based supervised single channel deep speech enhancement
    Saleem, Nasir
    Khattak, Muhammad Irfan
    APPLIED SOFT COMPUTING, 2020, 95
  • [37] Noise eigenvalue modification methods for spatial subspace based multi-channel speech enhancement
    Kim, Gibak
    Cho, Nam Ik
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 573 - +
  • [38] MULTI-CHANNEL SIGNAL ENHANCEMENT WITH SPEECH AND NOISE COVARIANCE ESTIMATES COMPUTED BY A PROBABILISTIC LOCALIZATION MODEL
    Anemueller, Joern
    Kayser, Hendrik
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 156 - 160
  • [39] Discriminative Training of NMF Model Based on Class Probabilities for Speech Enhancement
    Chung, Hanwook
    Plourde, Eric
    Champagne, Benoit
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (04) : 502 - 506
  • [40] Speech enhancement by multi-channel crosstalk resistant adaptive noise cancellation
    Zeng, Qingning
    Abdulla, Waleed H.
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 485 - 488