Optimization of learned dictionary for sparse coding in speech processing

被引:11
|
作者
He, Yongjun [1 ]
Sun, Guanglu [1 ]
Han, Jiqing [2 ]
机构
[1] Harbin Univ Sci & Technol, Sch Comp Sci & Technol, Harbin 150080, Peoples R China
[2] Harbin Inst Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Sparse coding; Speech denoising; Speech recognition; Dictionary optimization; K-SVD; OVERCOMPLETE DICTIONARIES; REPRESENTATION; ALGORITHM; CLASSIFICATION; REGRESSION; SEPARATION; EQUATIONS; SIGNALS; SYSTEMS;
D O I
10.1016/j.neucom.2015.03.061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a promising technique, sparse coding has been widely used for the analysis, representation, compression, denoising and separation of speech. This technique needs a good dictionary which contains atoms to represent speech signals. Although many methods have been proposed to learn such a dictionary, there are still two problems. First, unimportant atoms bring a heavy computational load to sparse decomposition and reconstruction, which prevents sparse coding from real-time application. Second, in speech denoising and separation, harmful atoms have no or ignorable contributions to reducing the sparsity degree but increase the source confusion, resulting in severe distortions. To solve these two problems, we first analyze the inherent assumptions of sparse coding and show that distortion can be caused if the assumptions do not hold true. Next, we propose two methods to optimize a given dictionary by removing unimportant atoms and harmful atoms, respectively. Experiments show that the proposed methods can further improve the performance of dictionaries. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:471 / 482
页数:12
相关论文
共 50 条
  • [41] Sparse representation by dictionary combined convolutional sparse coding and K-SVD
    Lian, Qiu-Sheng
    Han, Dong-Mei
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2012, 34 (07): : 1493 - 1498
  • [42] Ear Recognition Via Sparse Representation Over Learned Dictionary
    Jiang Chen
    Mu Zhichun
    Zhang Baoqing
    Zhang Jin
    2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 1487 - 1491
  • [43] PARALLEL IMAGING VIA SPARSE REPRESENTATION OVER A LEARNED DICTIONARY
    Wang, Shanshan
    Peng, Xi
    Dong, Pei
    Ying, Leslie
    Feng, David Dagan
    Liang, Dong
    2015 IEEE 12TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2015, : 687 - 690
  • [44] Sparse image coding using learned overcomplete dictionaries
    Murray, JF
    Kreutz-Delgado, K
    MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 579 - 588
  • [45] Learned and Designed Features for Sparse Coding in Image Classification
    Doan, Dung A.
    Ngoc-Trung Tran
    Dinh-Phong Vo
    Bac Le
    PROCEEDINGS OF 2013 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES: RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2013, : 237 - 241
  • [46] Sparse Linear Predictors for Speech Processing
    Giacobello, Daniele
    Christensen, Mads Groesboll
    Dahl, Joachim
    Jensen, Soren Holdt
    Moonen, Marc
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1353 - +
  • [47] Sparse representation in speech signal processing
    Lee, TW
    Jang, GJ
    Kwon, OW
    WAVELETS: APPLICATIONS IN SIGNAL AND IMAGE PROCESSING X, PTS 1 AND 2, 2003, 5207 : 311 - 320
  • [48] Dictionary Optimization for Block-Sparse Representations
    Zelnik-Manor, Lihi
    Rosenblum, Kevin
    Eldar, Yonina C.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (05) : 2386 - 2395
  • [49] Joint Sparse Coding and Frame Optimization
    Goehle, Geoff
    Cowen, Benjamin
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 301 - 305
  • [50] Nonlinear speech processing: Overview and possibilities in speech coding
    Faundez-Zanuy, M
    NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 15 - 42