A SUPERVISED MULTI-CHANNEL SPEECH ENHANCEMENT ALGORITHM BASED ON BAYESIAN NMF MODEL

被引:0
|
作者
Chung, Hanwook [1 ]
Plourde, Eric [2 ]
Champagne, Benoit [1 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
[2] Sherbrooke Univ, Dept Elect & Comp Engn, Sherbrooke, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Multi-channel speech enhancement; MVDR beamforming; non-negative matrix factorization; probabilistic generative model; variational Bayesian expectation-maximization; CONVOLUTIVE MIXTURES; ENVIRONMENT;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we introduce a supervised multi-channel speech enhancement algorithm based on a Bayesian multi-channel non-negative matrix factorization (MNMF) model. In the proposed framework, we consider the probabilistic generative model (PGM) of MNMF, specified by Poisson-distributed latent variables and gamma-distributed priors. In the training stage, the MNMF parameters of the speech and noise sources are estimated via the variational Bayesian expectation-maximization (VBEM) algorithm. In the enhancement stage, the clean speech signal is estimated via the MNMF-based minimum variance distortionless response (MVDR) beamformer. To further improve the enhanced speech quality, we efficiently combine the MNMF-based beamforming technique with a classical unsupervised single-channel enhancement method. Experiments show that the proposed method can provide better enhancement performance than the selected benchmarks.
引用
收藏
页码:221 / 225
页数:5
相关论文
共 50 条
  • [41] DESNET: A MULTI-CHANNEL NETWORK FOR SIMULTANEOUS SPEECH DEREVERBERATION, ENHANCEMENT AND SEPARATION
    Fu, Yihui
    Wu, Jian
    Hu, Yanxin
    Xing, Mengtao
    Xie, Lei
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 857 - 864
  • [42] Construction of microphone arrays for the optimization of multi-channel speech enhancement systems
    Drews, M
    FREQUENZ, 1996, 50 (9-10) : 223 - 227
  • [43] Improved Semi-Supervised NMF Based Real-Time Capable Speech Enhancement
    Hu, Yonggang
    Zhang, Xiongwei
    Zou, Xia
    Sun, Meng
    Min, Gang
    Li, Yinan
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2016, E99A (01) : 402 - 406
  • [44] A Cross-channel Attention-based Wave-U-Net for Multi-channel Speech Enhancement
    Ho, Minh Tri
    Lee, Jinyoung
    Lee, Bong-Ki
    Yi, Dong Hoon
    Kang, Hong-Goo
    INTERSPEECH 2020, 2020, : 4049 - 4053
  • [45] A Multi-Channel Noise Estimator Based on Improved Minima Controlled Recursive Averaging for Speech Enhancement
    Tangsangiumvisai, Nisachon
    ENGINEERING JOURNAL-THAILAND, 2023, 27 (11): : 99 - 112
  • [46] Online LSTM-based Iterative Mask Estimation for Multi-Channel Speech Enhancement and ASR
    Tu, Yan-Hui
    Du, Jun
    Zhou, Nan
    Lee, Chin-Hui
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 362 - 366
  • [47] Speech enhancement based on a combined multi-channel array with constrained interative and auditory masked processing
    Zhang, XX
    Hansen, JHL
    Arehart, K
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 229 - 232
  • [48] A Two-step NMF Based Algorithm for Single Channel Speech Separation
    Wang, Shuo
    Wu, Wenjun
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1987 - 1990
  • [49] Single Channel Blind Source Separation Based on NMF and Its Application to Speech Enhancement
    Chen, Yongqiang
    2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 1066 - 1069
  • [50] Signed Convex Combination of Fast Convergence Algorithm to Generalized Sidelobe Canceller Beamformer for Multi-Channel Speech Enhancement
    Priyanka, Siva S.
    Kumar, Kishore T.
    TRAITEMENT DU SIGNAL, 2021, 38 (03) : 785 - 795