Optimization of learned dictionary for sparse coding in speech processing

被引:11
|
作者
He, Yongjun [1 ]
Sun, Guanglu [1 ]
Han, Jiqing [2 ]
机构
[1] Harbin Univ Sci & Technol, Sch Comp Sci & Technol, Harbin 150080, Peoples R China
[2] Harbin Inst Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Sparse coding; Speech denoising; Speech recognition; Dictionary optimization; K-SVD; OVERCOMPLETE DICTIONARIES; REPRESENTATION; ALGORITHM; CLASSIFICATION; REGRESSION; SEPARATION; EQUATIONS; SIGNALS; SYSTEMS;
D O I
10.1016/j.neucom.2015.03.061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a promising technique, sparse coding has been widely used for the analysis, representation, compression, denoising and separation of speech. This technique needs a good dictionary which contains atoms to represent speech signals. Although many methods have been proposed to learn such a dictionary, there are still two problems. First, unimportant atoms bring a heavy computational load to sparse decomposition and reconstruction, which prevents sparse coding from real-time application. Second, in speech denoising and separation, harmful atoms have no or ignorable contributions to reducing the sparsity degree but increase the source confusion, resulting in severe distortions. To solve these two problems, we first analyze the inherent assumptions of sparse coding and show that distortion can be caused if the assumptions do not hold true. Next, we propose two methods to optimize a given dictionary by removing unimportant atoms and harmful atoms, respectively. Experiments show that the proposed methods can further improve the performance of dictionaries. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:471 / 482
页数:12
相关论文
共 50 条
  • [31] Sparse coding and dictionary learning for electron hologram denoising
    Anada, Satoshi
    Nomura, Yuki
    Hirayama, Tsukasa
    Yamamoto, Kazuo
    ULTRAMICROSCOPY, 2019, 206
  • [32] Kernel Regularized Nonlinear Dictionary Learning for Sparse Coding
    Liu, Huaping
    Liu, He
    Sun, Fuchun
    Fang, Bin
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 49 (04): : 766 - 775
  • [33] Adaptive sparse coding on PCA dictionary for image denoising
    Liu, Qian
    Zhang, Caiming
    Guo, Qiang
    Xu, Hui
    Zhou, Yuanfeng
    VISUAL COMPUTER, 2016, 32 (04): : 535 - 549
  • [34] A Convergent Incoherent Dictionary Learning Algorithm for Sparse Coding
    Bao, Chenglong
    Quan, Yuhui
    Ji, Hui
    COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 302 - 316
  • [35] Layered convolutional dictionary learning for sparse coding itemsets
    Mansha, Sameen
    Hoang Thanh Lam
    Yin, Hongzhi
    Kamiran, Faisal
    Ali, Mohsen
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (05): : 2225 - 2239
  • [36] Weak correlation dictionary construction method for sparse coding
    Long H.
    Zhuo L.
    Qu P.
    Zhang J.
    Journal of Shanghai Jiaotong University (Science), 2017, 22 (1) : 77 - 81
  • [37] Sparse Coding and Dictionary Learning with Linear Dynamical Systems
    Huang, Wenbing
    Sun, Fuchun
    Cao, Lele
    Zhao, Deli
    Liu, Huaping
    Harandi, Mehrtash
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3938 - 3947
  • [38] Weak Correlation Dictionary Construction Method for Sparse Coding
    龙海霞
    卓力
    屈盼玲
    张菁
    Journal of Shanghai Jiaotong University(Science), 2017, 22 (01) : 77 - 81
  • [39] Fast Dictionary Learning for Sparse Representations of Speech Signals
    Jafari, Maria G.
    Plumbley, Mark D.
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (05) : 1025 - 1031
  • [40] Speech separation using an adaptive sparse dictionary algorithm
    Jafari, Maria G.
    Plumbley, Mark D.
    Davies, Mike E.
    2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 26 - +