Multichannel Equalization in the KLT and Frequency Domains With Application to Speech Dereverberation

被引:6
|
作者
Rashobh, Rajan S. [1 ]
Khong, Andy W. H. [1 ]
Liu, Di [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
基金
新加坡国家研究基金会;
关键词
Acoustic microphone array; multichannel equalization; speech dereverberation; IDENTIFICATION; REVERBERANT; ALGORITHMS; SYSTEMS; SIGNALS;
D O I
10.1109/TASLP.2013.2297013
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Equalization of acoustic channels usually involves inversion of acoustic impulse responses (AIRs), and generally employs multichannel techniques. In this paper, we propose three equalization algorithms, one in the Karhunen-Loeve transform (KLT) domain and the other two in the frequency domain. Our proposed algorithm in the KLT domain provides a platform to achieve equalization in conjunction with denoising. Existing multiple-input/output inverse theorem (MINT)-based non-adaptive algorithms require the inversion of a matrix with dimension that is proportional to the AIR length, and is computationally expensive. To overcome this limitation, we propose the frequency-domain algorithm which is computationally very efficient and thus can be employed for the equalization of high-order AIRs in practical applications. In addition, the frequency-domain method is more robust to AIR estimation errors. To achieve further reduction in the complexity without significant performance degradation, we then propose a modified version of the frequency-domain algorithm.
引用
收藏
页码:634 / 646
页数:13
相关论文
共 50 条
  • [31] ROBUST SPARSITY-PROMOTING ACOUSTIC MULTI-CHANNEL EQUALIZATION FOR SPEECH DEREVERBERATION
    Kodrasi, Ina
    Jukic, Ante
    Doclo, Simon
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 166 - 170
  • [32] MULTICHANNEL SPEECH DEREVERBERATION AND SEPARATION WITH OPTIMIZED COMBINATION OF LINEAR AND NON-LINEAR FILTERING
    Togami, Masahito
    Kawaguchi, Yohei
    Takeda, Ryu
    Obuchi, Yasunari
    Nukaga, Nobuo
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4057 - 4060
  • [33] Joint Multichannel Blind Speech Separation and Dereverberation: A Real-Time Algorithmic Implementation
    Rotili, Rudy
    De Simone, Claudio
    Perelli, Alessandro
    Cifani, Simone
    Squartini, Stefano
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, 2010, 93 : 85 - 93
  • [34] SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation
    Quan, Changsheng
    Li, Xiaofei
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1310 - 1323
  • [35] Enhanced Multichannel Histogram Equalization for Speech Recognition in noisy acoustic conditions
    Principi, Emanuele
    Rotili, Rudy
    Squartini, Stefano
    NEURAL NETS WIRN11, 2011, 234 : 149 - 161
  • [36] An Assessment of the Improvement Potential of Time-Frequency Masking for Speech Dereverberation
    Zheng, Chenxi
    Falk, Tiago H.
    Chan, Wai-Yip
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 212 - +
  • [37] Blind deconvolution using Bayesian methods with application to the dereverberation of speech
    Daly, MJ
    Reilly, JP
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING SIGNAL PROCESSING THEORY AND METHODS, 2004, : 1009 - 1012
  • [38] A Time-Varying Forgetting Factor-Based QRRLS Algorithm for Multichannel Speech Dereverberation
    Tang, Xinyu
    Xu, Yang
    Chen, Rilin
    Zhou, Yi
    2020 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2020), 2020,
  • [39] Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising
    Williamson, Donald S.
    Wang, DeLiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (07) : 1492 - 1501
  • [40] Frequency-domain Dereverberation on Speech Signal using Surround Retinex
    Zhang, Mingming
    Li, Weifeng
    Wang, Longbiao
    Wei, Jianguo
    Wu, Zhiyong
    Liao, Qingmin
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,