Post-filtering with multichannel power spectral estimation using joint diagonalization in multi-speaker environments

被引:0
|
作者
Dam, Hai Quang [1 ]
Nordholm, Sven [1 ]
Dam, Hai Huyen [1 ]
Low, Siow Yong [1 ]
机构
[1] Univ Western Australia, Western Australian Telecommun Res Inst, Nedlands, WA 6009, Australia
基金
澳大利亚研究理事会;
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses the problem of extracting a desired speech signal from the received signals by a microphone array in multi-speaker environments. A new beamformer structure is proposed, which combines a fixed beamformer with a post-filtering technique. In the first stage, a fixed beamformer is designed to spatially extract the desired speech signal by suppressing other undesired speech signals. In the second stage, a post-filter employs a power spectral estimator using the joint diagonalization is proposed to increase the suppression capability. Evaluations using recordings from a real room environment show that the proposed beamformer offers a good interference suppression level whilst maintaining a low distortion level of the desired source.
引用
收藏
页码:938 / +
页数:2
相关论文
共 36 条
  • [21] JOINT ESTIMATION OF LATE REVERBERANT AND SPEECH POWER SPECTRAL DENSITIES IN NOISY ENVIRONMENTS USING FROBENIUS NORM
    Schwartz, Ofer
    Gannot, Sharon
    Habets, Emanuel A. P.
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1123 - 1127
  • [22] Localization-Driven Speech Enhancement in Noisy Multi-Speaker Hospital Environments Using Deep Learning and Meta Learning
    Barhoush, Mahdi
    Hallawa, Ahmed
    Peine, Arne
    Martin, Lukas
    Schmeink, Anke
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 670 - 683
  • [23] Speech Enhancement Using Multi-channel Post-Filtering with Modified Signal Presence Probability in Reverberant Environment
    Wang Xiaofei
    Guo Yanmeng
    Fu Qiang
    Yan Yonghong
    CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (03) : 512 - 519
  • [24] Speech Enhancement Using Multi-channel Post-Filtering with Modified Signal Presence Probability in Reverberant Environment
    WANG Xiaofei
    GUO Yanmeng
    FU Qiang
    YAN Yonghong
    Chinese Journal of Electronics, 2016, 25 (03) : 512 - 519
  • [25] Learning soft mask with DNN and DNN-SVM for multi-speaker DOA estimation using an acoustic vector sensor
    Wang, Disong
    Zou, Yuexian
    Wang, Wenwu
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2018, 355 (04): : 1692 - 1709
  • [26] Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function
    Wang, Qing
    Chen, Hang
    Jiang, Ya
    Wang, Zhe
    Wang, Yuyang
    Du, Jun
    Lee, Chin-Hui
    2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 250 - 254
  • [27] JOINT MAXIMUM LIKELIHOOD ESTIMATION OF LATE REVERBERANT AND SPEECH POWER SPECTRAL DENSITY IN NOISY ENVIRONMENTS
    Schwartz, Ofer
    Gannot, Sharon
    Habets, Emanueel A. P.
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 151 - 155
  • [28] Recognition of Dysarthric Speech Using Voice Parameters for Speaker Adaptation and Multi-taper Spectral Estimation
    Bhat, Chitralekha
    Vachhani, Bhavik
    Kopparapu, Sunil
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 228 - 232
  • [29] Deep Learning Post-Filtering Using Multi-Head Attention and Multiresolution Feature Fusion for Image and Intra-Video Quality Enhancement
    Schiopu, Ionut
    Munteanu, Adrian
    SENSORS, 2022, 22 (04)
  • [30] 2-D DOAs estimation in impulsive noise environments using joint diagonalization fractional lower-order spatio-temporal matrices
    TieQi Xia
    Qun Wan
    XueGang Wang
    Yi Zheng
    Science in China Series F: Information Sciences, 2008, 51 : 1585 - 1593