SPEECH ENHANCEMENT WITH SPARSE CODING IN LEARNED DICTIONARIES

被引:39
|
作者
Sigg, Christian D. [1 ]
Dikk, Tomas [1 ]
Buhmann, Joachim M. [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
关键词
Speech Enhancement; Dictionary Learning; Sparse Coding; Source Separation;
D O I
10.1109/ICASSP.2010.5495157
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The enhancement of speech degraded by non-stationary interferers is a highly relevant and difficult task of many signal processing applications. We present a monaural speech enhancement method based on sparse coding of noisy speech signals in a composite dictionary, consisting of the concatenation of a speech and interferer dictionary, both being possibly over-complete. The speech dictionary is learned off-line on a training corpus, while an environment specific interferer dictionary is learned on-line during speech pauses. Our approach optimizes the trade-off between source distortion and source confusion, and thus achieves significant improvements on objective quality measures like cepstral distance, in the speaker dependent and independent case, in several real-world environments and at low signal-to-noise ratios. Our enhancement method outperforms state-of-the-art methods like multi-band spectral subtraction and approaches based on vector quantization.
引用
收藏
页码:4758 / 4761
页数:4
相关论文
共 50 条
  • [1] SPARSE STEREO IMAGE CODING WITH LEARNED DICTIONARIES
    Palaz, Dimitri
    Tosic, Ivana
    Frossard, Pascal
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 133 - 136
  • [2] Sparse image coding using learned overcomplete dictionaries
    Murray, JF
    Kreutz-Delgado, K
    MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 579 - 588
  • [3] Robust Speaker Verification With Joint Sparse Coding Over Learned Dictionaries
    Haris, B. C.
    Sinha, Rohit
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2015, 10 (10) : 2143 - 2157
  • [4] Learned dictionaries for sparse representation based unit selection speech synthesis
    Sharma, Pulkit
    Abrol, Vinayak
    Sao, Anil Kumar
    2016 TWENTY SECOND NATIONAL CONFERENCE ON COMMUNICATION (NCC), 2016,
  • [5] Optimization of learned dictionary for sparse coding in speech processing
    He, Yongjun
    Sun, Guanglu
    Han, Jiqing
    NEUROCOMPUTING, 2016, 173 : 471 - 482
  • [6] Sparse coding over redundant dictionaries for fast adaptation of speech recognition system
    Shahnawazuddin, S.
    Sinha, Rohit
    COMPUTER SPEECH AND LANGUAGE, 2017, 43 : 1 - 17
  • [7] Spectrum enhancement with sparse coding for robust speech recognition
    He, Yongjun
    Sun, Guanglu
    Han, Jiqing
    DIGITAL SIGNAL PROCESSING, 2015, 43 : 59 - 70
  • [8] Performance Evaluation of CS Based Speech Enhancement using Adaptive and Sparse Dictionaries
    Sridhar, K., V
    Kumar, T. Kishore
    2019 4TH INTERNATIONAL CONFERENCE AND WORKSHOPS ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE): THRIVING TECHNOLOGIES, 2019,
  • [9] Sparse Coding with Sparse Dictionaries for Credit Risk Classification
    Mei, Xueyan
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1, 2016, : 23 - 26
  • [10] Speech enhancement via sparse coding with ideal binary mask
    Sun, Juan
    Tang, Yibin
    Jiang, Aimin
    Xu, Ning
    Zhou, Lin
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 537 - 540