Optimization of learned dictionary for sparse coding in speech processing

被引:11
|
作者
He, Yongjun [1 ]
Sun, Guanglu [1 ]
Han, Jiqing [2 ]
机构
[1] Harbin Univ Sci & Technol, Sch Comp Sci & Technol, Harbin 150080, Peoples R China
[2] Harbin Inst Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Sparse coding; Speech denoising; Speech recognition; Dictionary optimization; K-SVD; OVERCOMPLETE DICTIONARIES; REPRESENTATION; ALGORITHM; CLASSIFICATION; REGRESSION; SEPARATION; EQUATIONS; SIGNALS; SYSTEMS;
D O I
10.1016/j.neucom.2015.03.061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a promising technique, sparse coding has been widely used for the analysis, representation, compression, denoising and separation of speech. This technique needs a good dictionary which contains atoms to represent speech signals. Although many methods have been proposed to learn such a dictionary, there are still two problems. First, unimportant atoms bring a heavy computational load to sparse decomposition and reconstruction, which prevents sparse coding from real-time application. Second, in speech denoising and separation, harmful atoms have no or ignorable contributions to reducing the sparsity degree but increase the source confusion, resulting in severe distortions. To solve these two problems, we first analyze the inherent assumptions of sparse coding and show that distortion can be caused if the assumptions do not hold true. Next, we propose two methods to optimize a given dictionary by removing unimportant atoms and harmful atoms, respectively. Experiments show that the proposed methods can further improve the performance of dictionaries. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:471 / 482
页数:12
相关论文
共 50 条
  • [1] Dictionary evaluation and optimization for sparse coding based speech processing
    He, Yongjun
    Chen, Deyun
    Sun, Guanglu
    Han, Jiqing
    INFORMATION SCIENCES, 2015, 310 : 77 - 96
  • [2] SPEECH ENHANCEMENT WITH SPARSE CODING IN LEARNED DICTIONARIES
    Sigg, Christian D.
    Dikk, Tomas
    Buhmann, Joachim M.
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4758 - 4761
  • [3] Language Recognition via Sparse Coding over Learned Dictionary
    Singh, Om Prakash
    Sinha, Rohit
    2017 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2017, : 494 - 497
  • [4] Sparse coding with adaptive dictionary learning for underdetermined blind speech separation
    Xu, Tao
    Wang, Wenwu
    Dai, Wei
    SPEECH COMMUNICATION, 2013, 55 (03) : 432 - 450
  • [5] LEARNED CONVOLUTIONAL SPARSE CODING
    Sreter, Hillel
    Giryes, Raja
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2191 - 2195
  • [6] Speech enhancement based on discriminative joint sparse dictionary alternate optimization
    Jia H.
    Wang W.
    Wang Y.
    Pei J.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (03): : 74 - 81
  • [7] Rain Detection and Removal via Shrinkage-based Sparse Coding and Learned Rain Dictionary
    Son, Chang-Hwan
    Zhang, Xiao-Ping
    JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2020, 64 (03)
  • [8] Parametric Dictionary Design for Sparse Coding
    Yaghoobi, Mehrdad
    Daudet, Laurent
    Davies, Mike E.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (12) : 4800 - 4810
  • [9] Submodular Dictionary Learning for Sparse Coding
    Jiang, Zhuolin
    Zhang, Guangxiao
    Davis, Larry S.
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3418 - 3425
  • [10] SPARSE CODING FOR SPEECH RECOGNITION
    Sivaram, G. S. V. S.
    Nemala, Sridhar Krishna
    Elhilali, Mounya
    Trac D. Tran
    Hermansky, Hynek
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4346 - 4349