Multichannel Linear Prediction-Based Speech Dereverberation Considering Sparse and Low-Rank Priors

被引:1
|
作者
Wang, Taihui [1 ,2 ]
Yang, Feiran [3 ]
Yang, Jun [1 ,2 ]
机构
[1] Chinese Acad Sci, Key Lab Noise & Vibrat Res, Inst Acoust, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Chinese Acad Sci, State Key Lab Acoust, Inst Acoust, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Speech processing; Reverberation; Microphones; Time-frequency analysis; Shape; Indexes; Cost function; Speech dereverberation; multichannel linear prediction; weighted prediction error; complex generalized Gaussian; nonnegative matrix factorization; TIME; REVERBERATION; MASKING; DOMAIN; SYSTEM; NOISE;
D O I
10.1109/TASLP.2024.3369535
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This article addresses the multi-channel linear prediction (MCLP)-based speech dereverberation problem by jointly considering the sparsity and low-rank priors of speech spectrograms. We utilize the complex generalized Gaussian (CGG) distribution as the source model and the generalized nonnegative matrix factorization (NMF) as the spectral model. The difference between the presented model and existing ones for MCLP is twofold. First, we adopt the CGG distribution with a time-frequency-variant scale parameter instead of that with a time-frequency-invariant scale parameter. Second, the time-frequency-varying scale parameter is approximated by NMF in a low-rank manner. Based on the maximum-likelihood criterion, speech dereverberation is formulated as an optimization problem that minimizes the prediction error weighted by the reciprocal of sparse and low-rank parameters. A convergence-guaranteed algorithm is derived to estimate the parameters using the majorization-minimization technology. The WPE, NMF-based WPE and CGG-based WPE can be treated as special cases of the proposed method with different shape and domain parameters. As a byproduct, the proposed method provides a simple and elegant way to derive the CGG-based WPE algorithm. A series of experiments show the superiority of the proposed method over WPE, NMF-based WPE and CGG-based WPE methods.
引用
收藏
页码:1724 / 1735
页数:12
相关论文
共 50 条
  • [41] Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction
    Nakatani, Tomohiro
    Yoshioka, Takuya
    Kinoshita, Keisuke
    Miyoshi, Masato
    Juang, Biing-Hwang
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1717 - 1731
  • [42] SPEECH DEREVERBERATION BASED ON LINEAR PREDICTION: AN ACOUSTIC VECTOR SENSOR APPROACH
    Shujau, M.
    Ritz, C. H.
    Burnett, I. S.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 639 - 643
  • [43] Noncoherent Sparse Subarrays for DOA Estimation Based on Low-Rank and Sparse Recovery
    Leite, Wesley S.
    De Lamare, Rodrigo C.
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1751 - 1755
  • [44] Speech Enhancement Based on Constrained Low-rank Sparse Matrix Decomposition Integrated with Temporal Continuity Regularisation
    Sun, Chengli
    Yuan, Conglin
    ARCHIVES OF ACOUSTICS, 2019, 44 (04) : 681 - 692
  • [45] ROBUST IMAGE HASHING BASED ON LOW-RANK AND SPARSE DECOMPOSITION
    Li, Yue Nan
    Wang, Ping
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2154 - 2158
  • [46] Low-Rank and Eigenface Based Sparse Representation for Face Recognition
    Hou, Yi-Fu
    Sun, Zhan-Li
    Chong, Yan-Wen
    Zheng, Chun-Hou
    PLOS ONE, 2014, 9 (10):
  • [47] Sparse photoacoustic microscopy based on low-rank matrix approximation
    刘婷
    孙明健
    冯乃章
    王明华
    陈德应
    沈毅
    ChineseOpticsLetters, 2016, 14 (09) : 66 - 70
  • [48] Sparse photoacoustic microscopy based on low-rank matrix approximation
    Liu, Ting
    Sun, Mingjian
    Feng, Naizhang
    Wang, Minghua
    Chen, Deying
    Shen, Yi
    CHINESE OPTICS LETTERS, 2016, 14 (09)
  • [49] Truncated γ norm-based low-rank and sparse decomposition
    Zhenzhen Yang
    Yongpeng Yang
    Lu Fan
    Bing-Kun Bao
    Multimedia Tools and Applications, 2022, 81 : 38279 - 38295
  • [50] Background Subtraction Based on Low-Rank and Structured Sparse Decomposition
    Liu, Xin
    Zhao, Guoying
    Yao, Jiawen
    Qi, Chun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (08) : 2502 - 2514