Optimization of learned dictionary for sparse coding in speech processing

被引:11
|
作者
He, Yongjun [1 ]
Sun, Guanglu [1 ]
Han, Jiqing [2 ]
机构
[1] Harbin Univ Sci & Technol, Sch Comp Sci & Technol, Harbin 150080, Peoples R China
[2] Harbin Inst Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Sparse coding; Speech denoising; Speech recognition; Dictionary optimization; K-SVD; OVERCOMPLETE DICTIONARIES; REPRESENTATION; ALGORITHM; CLASSIFICATION; REGRESSION; SEPARATION; EQUATIONS; SIGNALS; SYSTEMS;
D O I
10.1016/j.neucom.2015.03.061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a promising technique, sparse coding has been widely used for the analysis, representation, compression, denoising and separation of speech. This technique needs a good dictionary which contains atoms to represent speech signals. Although many methods have been proposed to learn such a dictionary, there are still two problems. First, unimportant atoms bring a heavy computational load to sparse decomposition and reconstruction, which prevents sparse coding from real-time application. Second, in speech denoising and separation, harmful atoms have no or ignorable contributions to reducing the sparsity degree but increase the source confusion, resulting in severe distortions. To solve these two problems, we first analyze the inherent assumptions of sparse coding and show that distortion can be caused if the assumptions do not hold true. Next, we propose two methods to optimize a given dictionary by removing unimportant atoms and harmful atoms, respectively. Experiments show that the proposed methods can further improve the performance of dictionaries. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:471 / 482
页数:12
相关论文
共 50 条
  • [21] Improved Speaker Verification using Block Sparse Coding over Joint Speaker-Channel Learned Dictionary
    Sreeram, Ganji
    Haris, B. C.
    Sinha, Rohit
    TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,
  • [22] Greedy double sparse dictionary learning for sparse representation of speech signals
    Abrol, V.
    Sharma, P.
    Sao, A. K.
    SPEECH COMMUNICATION, 2016, 85 : 71 - 82
  • [23] COVARIATE-DEPENDENT DICTIONARY LEARNING AND SPARSE CODING
    Zhou, Mingyuan
    Yang, Hongxia
    Sapiro, Guillermo
    Dunson, David
    Carin, Lawrence
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5824 - 5827
  • [24] Joint Multiple Dictionary Learning for Tensor Sparse Coding
    Fu, Yifan
    Gao, Junbin
    Sun, Yanfeng
    Hong, Xia
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2957 - 2964
  • [25] Dictionary Learning for Sparse Coding: Algorithms and Convergence Analysis
    Bao, Chenglong
    Ji, Hui
    Quan, Yuhui
    Shen, Zuowei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (07) : 1356 - 1369
  • [26] SPARSE CODING AND DICTIONARY LEARNING BASED ON THE MDL PRINCIPLE
    Ramirez, Ignacio
    Sapiro, Guillermo
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2160 - 2163
  • [27] Adaptive sparse coding on PCA dictionary for image denoising
    Qian Liu
    Caiming Zhang
    Qiang Guo
    Hui Xu
    Yuanfeng Zhou
    The Visual Computer, 2016, 32 : 535 - 549
  • [28] Super-resolution based on sparse dictionary coding
    Li, Min
    Cheng, Jian
    Le, Xiang
    Luo, Huan-Min
    Ruan Jian Xue Bao/Journal of Software, 2012, 23 (05): : 1315 - 1324
  • [29] Layered convolutional dictionary learning for sparse coding itemsets
    Sameen Mansha
    Hoang Thanh Lam
    Hongzhi Yin
    Faisal Kamiran
    Mohsen Ali
    World Wide Web, 2019, 22 : 2225 - 2239
  • [30] Multi-Attributed Dictionary Learning for Sparse Coding
    Chiang, Chen-Kuo
    Su, Te-Feng
    Yen, Chih
    Lai, Shang-Hong
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1137 - 1144