A Hybrid Reverberation Model and Its Application to Joint Speech Dereverberation and Separation

被引：0

作者：

Liu, Tongzheng ^{[1
]}

Lu, Zhihua ^{[1
]}

da Costa, Joao Paulo J. ^{[2
]}

Fei, Tai ^{[3
]}

机构：

[1] Ningbo Univ, Coll Informat Sci & Engn, Ningbo 315211, Peoples R China

[2] Hamm Lippstadt Univ Appl Sci HSHL, Dept Lippstadt 2, D-59063 Hamm, Germany

[3] HELLA GmbH & Co KGaA, D-59552 Lippstadt, Germany

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2023年 / 31卷

基金：

中国国家自然科学基金;

关键词：

Reverberation model; dereverberation; speech separation; blind source separation; multichannel nonnegative matrix factorization; microphone array; BLIND SOURCE SEPARATION; NONNEGATIVE MATRIX FACTORIZATION; INDEPENDENT VECTOR EXTRACTION; NOISE-REDUCTION; ALGORITHMS; CANCELLATION; ENHANCEMENT; MIXTURES;

D O I：

10.1109/TASLP.2023.3301227

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This article proposes a hybrid reverberation model by integrating two conventional models, namely, the multichannel linear prediction (MCLP) model and the spatial coherence model. The late reverberation is divided into two components. One component is modeled using an MCLP model, and the other is modeled using the spatial coherence model. In contrast with the conventional models, the proposed hybrid model increases modeling capacity, especially in the case of long reverberation time. In order to optimally estimate model parameters, joint speech dereverberation and separation is taken into account. The hybrid reverberation model is then used in conjunction with the multichannel nonnegative matrix factorization (MNMF). The method called Hybrid-FastMNMF is proposed by treating the reverberation component modeled by the spatial coherence model as a noise source and estimating its parameters similarly to speech sources. Furthermore, prior knowledge of the spatial coherence matrix is employed to whiten the observations, resulting in another method called Hybrid-FastMNMF-W. Experimental findings demonstrate the proposed methods' superior performance in terms of joint speech dereverberation and separation, and they further justify the efficiency of the proposed hybrid reverberation model.

引用

页码：3000 / 3014

页数：15

共 50 条

[31] Joint Noise Reduction and Dereverberation of Speech Using Hybrid TF-GSC and Adaptive MMSE Estimator
Dashtbozorg, Behdad
Abutalebi, Hamid Reza
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1335 - 1338
[32] Single channel speech dereverberation and separation using RPCA and SNMF
Ullah, Rizwan
Islam, Md Shohidul
Hossain, Md. Imran
Wahab, Fazal E.
Ye, Zhongfu
APPLIED ACOUSTICS, 2020, 167
[33] A Semi-blind Source Separation Approach for Speech Dereverberation
Wang, Ziteng
Na, Yueyue
Liu, Zhang
Li, Yun
Tian, Biao
Fu, Qiang
INTERSPEECH 2020, 2020, : 3925 - 3929
[34] Joint source-channel modeling and estimation for speech dereverberation
Juang, Biing-Hwang
Nakatani, Tomohiro
2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 2990 - 2993
[35] JOINT SEPARATION AND DEREVERBERATION OF REVERBERANT MIXTURES WITH MULTICHANNEL VARIATIONAL AUTOENCODER
Inoue, Shota
Kameoka, Hirokazu
Li, Li
Seki, Shogo
Makino, Shoji
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 96 - 100
[36] An EM Algorithm for Joint Dual-Speaker Separation and Dereverberation
Cohen, Nili
Hazan, Gershon
Schwartz, Boaz
Gannot, Sharon
2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
[37] TOWARDS MULTI-MICROPHONE SPEECH DEREVERBERATION USING SPECTRAL ENHANCEMENT AND STATISTICAL REVERBERATION MODELS
Habets, Emanuel A. P.
2008 42ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-4, 2008, : 806 - 810
[38] Joint Blind Source Separation and Dereverberation for Automatic Speech Recognition using Delayed-Subsource MNMF with Localization Prior
Fras, Mieszko
Witkowski, Marcin
Kowalczyk, Konrad
INTERSPEECH 2023, 2023, : 3734 - 3738
[39] Joint System for Speech Separation from Speaking and Non-speaking Background, and De-reverberation: Application on Real-World Recordings
Wiem, Belhedi
Anouar, Ben Messaoud Mohamed
Aicha, Bouzid
2017 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF SIGNAL PROCESSING (ICFSP), 2017, : 30 - 34
[40] Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations
Fan, Cunhang
Tao, Jianhua
Liu, Bin
Yi, Jiangyan
Wen, Zhengqi
INTERSPEECH 2020, 2020, : 4536 - 4540

← 1 2 3 4 5 →