Post-filtering with multichannel power spectral estimation using joint diagonalization in multi-speaker environments

被引:0
|
作者
Dam, Hai Quang [1 ]
Nordholm, Sven [1 ]
Dam, Hai Huyen [1 ]
Low, Siow Yong [1 ]
机构
[1] Univ Western Australia, Western Australian Telecommun Res Inst, Nedlands, WA 6009, Australia
基金
澳大利亚研究理事会;
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses the problem of extracting a desired speech signal from the received signals by a microphone array in multi-speaker environments. A new beamformer structure is proposed, which combines a fixed beamformer with a post-filtering technique. In the first stage, a fixed beamformer is designed to spatially extract the desired speech signal by suppressing other undesired speech signals. In the second stage, a post-filter employs a power spectral estimator using the joint diagonalization is proposed to increase the suppression capability. Evaluations using recordings from a real room environment show that the proposed beamformer offers a good interference suppression level whilst maintaining a low distortion level of the desired source.
引用
收藏
页码:938 / +
页数:2
相关论文
共 36 条
  • [1] Multichannel post-filtering in nonstationary noise environments
    Cohen, I
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (05) : 1149 - 1160
  • [2] Speaker Verification in Multi-Speaker Environments Using Temporal Feature Fusion
    Aloradi, Ahmad
    Mack, Wolfgang
    Elminshawi, Mohamed
    Habets, EmanuM A. P.
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 354 - 358
  • [3] Multi-speaker DoA Estimation Using Audio and Visual Modality
    Yulin Wu
    Ruimin Hu
    Xiaochen Wang
    Shanfa Ke
    Neural Processing Letters, 2023, 55 : 8887 - 8901
  • [4] Multi-speaker DoA Estimation Using Audio and Visual Modality
    Wu, Yulin
    Hu, Ruimin
    Wang, Xiaochen
    Ke, Shanfa
    NEURAL PROCESSING LETTERS, 2023, 55 (07) : 8887 - 8901
  • [5] Application of combined temporal and spectral processing methods for speaker recognition under noisy, reverberant or multi-speaker environments
    P. Krishnamoorthy
    S. R. Mahadeva Prasanna
    Sadhana, 2009, 34 : 729 - 754
  • [6] Application of combined temporal and spectral processing methods for speaker recognition under noisy, reverberant or multi-speaker environments
    Krishnamoorthy, P.
    Prasanna, S. R. Mahadeva
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2009, 34 (05): : 729 - 754
  • [7] Joint estimation of pitch and direction of arrival: improving robustness and accuracy for multi-speaker scenarios
    Gerlach, Stephan
    Bitzer, Joerg
    Goetze, Stefan
    Doclo, Simon
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
  • [8] Joint iterative multi-speaker identification and source separation using expectation propagation
    Walsh, John MacLaren
    Kim, Youngmoo E.
    Doll, Travis M.
    2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 245 - 248
  • [9] Postfiltering Using Multichannel Spectral Estimation in Multispeaker Environments
    Hai Quang Dam
    Sven Nordholm
    Hai Huyen Dam
    Siow Yong Low
    EURASIP Journal on Advances in Signal Processing, 2008
  • [10] Joint estimation of pitch and direction of arrival: improving robustness and accuracy for multi-speaker scenarios
    Stephan Gerlach
    Jörg Bitzer
    Stefan Goetze
    Simon Doclo
    EURASIP Journal on Audio, Speech, and Music Processing, 2014 (1)