Multichannel Equalization in the KLT and Frequency Domains With Application to Speech Dereverberation

被引：6

作者：

Rashobh, Rajan S. ^{[1
]}

Khong, Andy W. H. ^{[1
]}

Liu, Di ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2014年 / 22卷 / 03期

基金：

新加坡国家研究基金会;

关键词：

Acoustic microphone array; multichannel equalization; speech dereverberation; IDENTIFICATION; REVERBERANT; ALGORITHMS; SYSTEMS; SIGNALS;

D O I：

10.1109/TASLP.2013.2297013

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Equalization of acoustic channels usually involves inversion of acoustic impulse responses (AIRs), and generally employs multichannel techniques. In this paper, we propose three equalization algorithms, one in the Karhunen-Loeve transform (KLT) domain and the other two in the frequency domain. Our proposed algorithm in the KLT domain provides a platform to achieve equalization in conjunction with denoising. Existing multiple-input/output inverse theorem (MINT)-based non-adaptive algorithms require the inversion of a matrix with dimension that is proportional to the AIR length, and is computationally expensive. To overcome this limitation, we propose the frequency-domain algorithm which is computationally very efficient and thus can be employed for the equalization of high-order AIRs in practical applications. In addition, the frequency-domain method is more robust to AIR estimation errors. To achieve further reduction in the complexity without significant performance degradation, we then propose a modified version of the frequency-domain algorithm.

引用

页码：634 / 646

页数：13

共 50 条

[31] ROBUST SPARSITY-PROMOTING ACOUSTIC MULTI-CHANNEL EQUALIZATION FOR SPEECH DEREVERBERATION
Kodrasi, Ina
Jukic, Ante
Doclo, Simon
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 166 - 170
[32] MULTICHANNEL SPEECH DEREVERBERATION AND SEPARATION WITH OPTIMIZED COMBINATION OF LINEAR AND NON-LINEAR FILTERING
Togami, Masahito
Kawaguchi, Yohei
Takeda, Ryu
Obuchi, Yasunari
Nukaga, Nobuo
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4057 - 4060
[33] Joint Multichannel Blind Speech Separation and Dereverberation: A Real-Time Algorithmic Implementation
Rotili, Rudy
De Simone, Claudio
Perelli, Alessandro
Cifani, Simone
Squartini, Stefano
ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, 2010, 93 : 85 - 93
[34] SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation
Quan, Changsheng
Li, Xiaofei
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1310 - 1323
[35] Enhanced Multichannel Histogram Equalization for Speech Recognition in noisy acoustic conditions
Principi, Emanuele
Rotili, Rudy
Squartini, Stefano
NEURAL NETS WIRN11, 2011, 234 : 149 - 161
[36] An Assessment of the Improvement Potential of Time-Frequency Masking for Speech Dereverberation
Zheng, Chenxi
Falk, Tiago H.
Chan, Wai-Yip
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 212 - +
[37] Blind deconvolution using Bayesian methods with application to the dereverberation of speech
Daly, MJ
Reilly, JP
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING SIGNAL PROCESSING THEORY AND METHODS, 2004, : 1009 - 1012
[38] A Time-Varying Forgetting Factor-Based QRRLS Algorithm for Multichannel Speech Dereverberation
Tang, Xinyu
Xu, Yang
Chen, Rilin
Zhou, Yi
2020 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2020), 2020,
[39] Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising
Williamson, Donald S.
Wang, DeLiang
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (07) : 1492 - 1501
[40] Frequency-domain Dereverberation on Speech Signal using Surround Retinex
Zhang, Mingming
Li, Weifeng
Wang, Longbiao
Wei, Jianguo
Wu, Zhiyong
Liao, Qingmin
2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,

← 1 2 3 4 5 →