Multichannel Equalization in the KLT and Frequency Domains With Application to Speech Dereverberation

被引:6
|
作者
Rashobh, Rajan S. [1 ]
Khong, Andy W. H. [1 ]
Liu, Di [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
基金
新加坡国家研究基金会;
关键词
Acoustic microphone array; multichannel equalization; speech dereverberation; IDENTIFICATION; REVERBERANT; ALGORITHMS; SYSTEMS; SIGNALS;
D O I
10.1109/TASLP.2013.2297013
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Equalization of acoustic channels usually involves inversion of acoustic impulse responses (AIRs), and generally employs multichannel techniques. In this paper, we propose three equalization algorithms, one in the Karhunen-Loeve transform (KLT) domain and the other two in the frequency domain. Our proposed algorithm in the KLT domain provides a platform to achieve equalization in conjunction with denoising. Existing multiple-input/output inverse theorem (MINT)-based non-adaptive algorithms require the inversion of a matrix with dimension that is proportional to the AIR length, and is computationally expensive. To overcome this limitation, we propose the frequency-domain algorithm which is computationally very efficient and thus can be employed for the equalization of high-order AIRs in practical applications. In addition, the frequency-domain method is more robust to AIR estimation errors. To achieve further reduction in the complexity without significant performance degradation, we then propose a modified version of the frequency-domain algorithm.
引用
收藏
页码:634 / 646
页数:13
相关论文
共 50 条
  • [41] Multichannel speech separation and localization by frequency assignment
    Ihara, Takehiro
    Handa, Masaki
    Nagai, Takayuki
    Kurematsu, Akira
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2007, 90 (01): : 59 - 70
  • [42] Alpha-Stable Autoregressive Fast Multichannel Nonnegative Matrix Factorization for Joint Speech Enhancement and Dereverberation
    Fontaine, Mathieu
    Sekiguchi, Kouhei
    Nugraha, Aditya Arie
    Bando, Yoshiaki
    Yoshii, Kazuyoshi
    INTERSPEECH 2021, 2021, : 661 - 665
  • [43] A Hybrid Reverberation Model and Its Application to Joint Speech Dereverberation and Separation
    Liu, Tongzheng
    Lu, Zhihua
    da Costa, Joao Paulo J.
    Fei, Tai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 3000 - 3014
  • [44] Multichannel Linear Prediction-Based Speech Dereverberation Considering Sparse and Low-Rank Priors
    Wang, Taihui
    Yang, Feiran
    Yang, Jun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1724 - 1735
  • [45] Kronecker Product Multichannel Linear Filtering for Adaptive Weighted Prediction Error-Based Speech Dereverberation
    Huang, Gongping
    Benesty, Jacob
    Cohen, Israel
    Chen, Jingdong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1277 - 1289
  • [46] Speech Enhancement by Denoising and Dereverberation Using a Generalized Sidelobe Canceller-Based Multichannel Wiener Filter
    Bai, Mingsian R.
    Kung, Fan-Jie
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2022, 70 (03): : 140 - 155
  • [47] Speech Dereverberation Based on Generative Adversarial Network with Additive Frequency Domain Decomposition
    Quan H.
    Wang T.
    Zheng Z.
    Gongcheng Kexue Yu Jishu/Advanced Engineering Sciences, 2022, 54 (02): : 180 - 187
  • [48] Multichannel Online Blind Speech Dereverberation with Marginalization of Static Observation Parameters in a Rao-Blackwellized Particle Filter
    Evers, Christine
    Hopgood, James R.
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2011, 63 (03): : 315 - 332
  • [49] Multichannel Online Blind Speech Dereverberation with Marginalization of Static Observation Parameters in a Rao-Blackwellized Particle Filter
    Christine Evers
    James R. Hopgood
    Journal of Signal Processing Systems, 2011, 63 : 315 - 332
  • [50] Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation
    Kothapally, Vinay
    Hansen, John H. L.
    INTERSPEECH 2022, 2022, : 2543 - 2547