A Hybrid Reverberation Model and Its Application to Joint Speech Dereverberation and Separation

被引:0
|
作者
Liu, Tongzheng [1 ]
Lu, Zhihua [1 ]
da Costa, Joao Paulo J. [2 ]
Fei, Tai [3 ]
机构
[1] Ningbo Univ, Coll Informat Sci & Engn, Ningbo 315211, Peoples R China
[2] Hamm Lippstadt Univ Appl Sci HSHL, Dept Lippstadt 2, D-59063 Hamm, Germany
[3] HELLA GmbH & Co KGaA, D-59552 Lippstadt, Germany
基金
中国国家自然科学基金;
关键词
Reverberation model; dereverberation; speech separation; blind source separation; multichannel nonnegative matrix factorization; microphone array; BLIND SOURCE SEPARATION; NONNEGATIVE MATRIX FACTORIZATION; INDEPENDENT VECTOR EXTRACTION; NOISE-REDUCTION; ALGORITHMS; CANCELLATION; ENHANCEMENT; MIXTURES;
D O I
10.1109/TASLP.2023.3301227
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This article proposes a hybrid reverberation model by integrating two conventional models, namely, the multichannel linear prediction (MCLP) model and the spatial coherence model. The late reverberation is divided into two components. One component is modeled using an MCLP model, and the other is modeled using the spatial coherence model. In contrast with the conventional models, the proposed hybrid model increases modeling capacity, especially in the case of long reverberation time. In order to optimally estimate model parameters, joint speech dereverberation and separation is taken into account. The hybrid reverberation model is then used in conjunction with the multichannel nonnegative matrix factorization (MNMF). The method called Hybrid-FastMNMF is proposed by treating the reverberation component modeled by the spatial coherence model as a noise source and estimating its parameters similarly to speech sources. Furthermore, prior knowledge of the spatial coherence matrix is employed to whiten the observations, resulting in another method called Hybrid-FastMNMF-W. Experimental findings demonstrate the proposed methods' superior performance in terms of joint speech dereverberation and separation, and they further justify the efficiency of the proposed hybrid reverberation model.
引用
收藏
页码:3000 / 3014
页数:15
相关论文
共 50 条
  • [31] Joint Noise Reduction and Dereverberation of Speech Using Hybrid TF-GSC and Adaptive MMSE Estimator
    Dashtbozorg, Behdad
    Abutalebi, Hamid Reza
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1335 - 1338
  • [32] Single channel speech dereverberation and separation using RPCA and SNMF
    Ullah, Rizwan
    Islam, Md Shohidul
    Hossain, Md. Imran
    Wahab, Fazal E.
    Ye, Zhongfu
    APPLIED ACOUSTICS, 2020, 167
  • [33] A Semi-blind Source Separation Approach for Speech Dereverberation
    Wang, Ziteng
    Na, Yueyue
    Liu, Zhang
    Li, Yun
    Tian, Biao
    Fu, Qiang
    INTERSPEECH 2020, 2020, : 3925 - 3929
  • [34] Joint source-channel modeling and estimation for speech dereverberation
    Juang, Biing-Hwang
    Nakatani, Tomohiro
    2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 2990 - 2993
  • [35] JOINT SEPARATION AND DEREVERBERATION OF REVERBERANT MIXTURES WITH MULTICHANNEL VARIATIONAL AUTOENCODER
    Inoue, Shota
    Kameoka, Hirokazu
    Li, Li
    Seki, Shogo
    Makino, Shoji
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 96 - 100
  • [36] An EM Algorithm for Joint Dual-Speaker Separation and Dereverberation
    Cohen, Nili
    Hazan, Gershon
    Schwartz, Boaz
    Gannot, Sharon
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [37] TOWARDS MULTI-MICROPHONE SPEECH DEREVERBERATION USING SPECTRAL ENHANCEMENT AND STATISTICAL REVERBERATION MODELS
    Habets, Emanuel A. P.
    2008 42ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-4, 2008, : 806 - 810
  • [38] Joint Blind Source Separation and Dereverberation for Automatic Speech Recognition using Delayed-Subsource MNMF with Localization Prior
    Fras, Mieszko
    Witkowski, Marcin
    Kowalczyk, Konrad
    INTERSPEECH 2023, 2023, : 3734 - 3738
  • [39] Joint System for Speech Separation from Speaking and Non-speaking Background, and De-reverberation: Application on Real-World Recordings
    Wiem, Belhedi
    Anouar, Ben Messaoud Mohamed
    Aicha, Bouzid
    2017 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF SIGNAL PROCESSING (ICFSP), 2017, : 30 - 34
  • [40] Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations
    Fan, Cunhang
    Tao, Jianhua
    Liu, Bin
    Yi, Jiangyan
    Wen, Zhengqi
    INTERSPEECH 2020, 2020, : 4536 - 4540