Post-filtering with multichannel power spectral estimation using joint diagonalization in multi-speaker environments

被引：0

作者：

Dam, Hai Quang ^{[1
]}

Nordholm, Sven ^{[1
]}

Dam, Hai Huyen ^{[1
]}

Low, Siow Yong ^{[1
]}

机构：

[1] Univ Western Australia, Western Australian Telecommun Res Inst, Nedlands, WA 6009, Australia

来源：

2006 ASIA-PACIFIC CONFERENCE ON COMMUNICATION, VOLS 1 AND 2 | 2006年

基金：

澳大利亚研究理事会;

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper addresses the problem of extracting a desired speech signal from the received signals by a microphone array in multi-speaker environments. A new beamformer structure is proposed, which combines a fixed beamformer with a post-filtering technique. In the first stage, a fixed beamformer is designed to spatially extract the desired speech signal by suppressing other undesired speech signals. In the second stage, a post-filter employs a power spectral estimator using the joint diagonalization is proposed to increase the suppression capability. Evaluations using recordings from a real room environment show that the proposed beamformer offers a good interference suppression level whilst maintaining a low distortion level of the desired source.

引用

页码：938 / +

页数：2

共 36 条

[1] Multichannel post-filtering in nonstationary noise environments
Cohen, I
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (05) : 1149 - 1160
[2] Speaker Verification in Multi-Speaker Environments Using Temporal Feature Fusion
Aloradi, Ahmad
Mack, Wolfgang
Elminshawi, Mohamed
Habets, EmanuM A. P.
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 354 - 358
[3] Multi-speaker DoA Estimation Using Audio and Visual Modality
Yulin Wu
Ruimin Hu
Xiaochen Wang
Shanfa Ke
Neural Processing Letters, 2023, 55 : 8887 - 8901
[4] Multi-speaker DoA Estimation Using Audio and Visual Modality
Wu, Yulin
Hu, Ruimin
Wang, Xiaochen
Ke, Shanfa
NEURAL PROCESSING LETTERS, 2023, 55 (07) : 8887 - 8901
[5] Application of combined temporal and spectral processing methods for speaker recognition under noisy, reverberant or multi-speaker environments
P. Krishnamoorthy
S. R. Mahadeva Prasanna
Sadhana, 2009, 34 : 729 - 754
[6] Application of combined temporal and spectral processing methods for speaker recognition under noisy, reverberant or multi-speaker environments
Krishnamoorthy, P.
Prasanna, S. R. Mahadeva
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2009, 34 (05): : 729 - 754
[7] Joint estimation of pitch and direction of arrival: improving robustness and accuracy for multi-speaker scenarios
Gerlach, Stephan
Bitzer, Joerg
Goetze, Stefan
Doclo, Simon
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
[8] Joint iterative multi-speaker identification and source separation using expectation propagation
Walsh, John MacLaren
Kim, Youngmoo E.
Doll, Travis M.
2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 245 - 248
[9] Postfiltering Using Multichannel Spectral Estimation in Multispeaker Environments
Hai Quang Dam
Sven Nordholm
Hai Huyen Dam
Siow Yong Low
EURASIP Journal on Advances in Signal Processing, 2008
[10] Joint estimation of pitch and direction of arrival: improving robustness and accuracy for multi-speaker scenarios
Stephan Gerlach
Jörg Bitzer
Stefan Goetze
Simon Doclo
EURASIP Journal on Audio, Speech, and Music Processing, 2014 (1)

← 1 2 3 4 →