Post-filtering with multichannel power spectral estimation using joint diagonalization in multi-speaker environments

被引：0

作者：

Dam, Hai Quang ^{[1
]}

Nordholm, Sven ^{[1
]}

Dam, Hai Huyen ^{[1
]}

Low, Siow Yong ^{[1
]}

机构：

[1] Univ Western Australia, Western Australian Telecommun Res Inst, Nedlands, WA 6009, Australia

来源：

2006 ASIA-PACIFIC CONFERENCE ON COMMUNICATION, VOLS 1 AND 2 | 2006年

基金：

澳大利亚研究理事会;

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper addresses the problem of extracting a desired speech signal from the received signals by a microphone array in multi-speaker environments. A new beamformer structure is proposed, which combines a fixed beamformer with a post-filtering technique. In the first stage, a fixed beamformer is designed to spatially extract the desired speech signal by suppressing other undesired speech signals. In the second stage, a post-filter employs a power spectral estimator using the joint diagonalization is proposed to increase the suppression capability. Evaluations using recordings from a real room environment show that the proposed beamformer offers a good interference suppression level whilst maintaining a low distortion level of the desired source.

引用

页码：938 / +

页数：2

共 36 条

[21] JOINT ESTIMATION OF LATE REVERBERANT AND SPEECH POWER SPECTRAL DENSITIES IN NOISY ENVIRONMENTS USING FROBENIUS NORM
Schwartz, Ofer
Gannot, Sharon
Habets, Emanuel A. P.
2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1123 - 1127
[22] Localization-Driven Speech Enhancement in Noisy Multi-Speaker Hospital Environments Using Deep Learning and Meta Learning
Barhoush, Mahdi
Hallawa, Ahmed
Peine, Arne
Martin, Lukas
Schmeink, Anke
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 670 - 683
[23] Speech Enhancement Using Multi-channel Post-Filtering with Modified Signal Presence Probability in Reverberant Environment
Wang Xiaofei
Guo Yanmeng
Fu Qiang
Yan Yonghong
CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (03) : 512 - 519
[24] Speech Enhancement Using Multi-channel Post-Filtering with Modified Signal Presence Probability in Reverberant Environment
WANG Xiaofei
GUO Yanmeng
FU Qiang
YAN Yonghong
Chinese Journal of Electronics, 2016, 25 (03) : 512 - 519
[25] Learning soft mask with DNN and DNN-SVM for multi-speaker DOA estimation using an acoustic vector sensor
Wang, Disong
Zou, Yuexian
Wang, Wenwu
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2018, 355 (04): : 1692 - 1709
[26] Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function
Wang, Qing
Chen, Hang
Jiang, Ya
Wang, Zhe
Wang, Yuyang
Du, Jun
Lee, Chin-Hui
2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 250 - 254
[27] JOINT MAXIMUM LIKELIHOOD ESTIMATION OF LATE REVERBERANT AND SPEECH POWER SPECTRAL DENSITY IN NOISY ENVIRONMENTS
Schwartz, Ofer
Gannot, Sharon
Habets, Emanueel A. P.
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 151 - 155
[28] Recognition of Dysarthric Speech Using Voice Parameters for Speaker Adaptation and Multi-taper Spectral Estimation
Bhat, Chitralekha
Vachhani, Bhavik
Kopparapu, Sunil
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 228 - 232
[29] Deep Learning Post-Filtering Using Multi-Head Attention and Multiresolution Feature Fusion for Image and Intra-Video Quality Enhancement
Schiopu, Ionut
Munteanu, Adrian
SENSORS, 2022, 22 (04)
[30] 2-D DOAs estimation in impulsive noise environments using joint diagonalization fractional lower-order spatio-temporal matrices
TieQi Xia
Qun Wan
XueGang Wang
Yi Zheng
Science in China Series F: Information Sciences, 2008, 51 : 1585 - 1593

← 1 2 3 4 →