NOISE-ROBUST DIGIT RECOGNITION WITH EXEMPLAR-BASED SPARSE REPRESENTATIONS OF VARIABLE LENGTH

被引:0
|
作者
Yilmaz, Emre [1 ]
Gemmeke, Jort F. [1 ]
Van Compernolle, Dirk [1 ]
Van Hamme, Hugo [1 ]
机构
[1] Katholieke Univ Leuven, Dept ESAT, Louvain, Belgium
关键词
Exemplar-based recognition; noise robustness; non-negative sparse coding; multiple dictionaries; CONTINUOUS SPEECH RECOGNITION; SEPARATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces an exemplar-based noise-robust digit recognition system in which noisy speech is modeled as a sparse linear combination of clean speech and noise exemplars. Exemplars are rigid long speech units of different lengths, i.e. no warping mechanism is used for exemplar matching to avoid poor time alignments that would otherwise be provoked by the noise and the natural duration distribution of each unit in the training data is preserved. Speech and noise separation is performed by applying non-negative sparse coding using a separate exemplar dictionary for each labeled unit (in this case half-digits) rather than a single dictionary of all units. This approach does not only provide better classification of speech units but also models the temporal structure of speech and noise more accurately. The system performance is evaluated on the AURORA-2 database. The results show that the proposed system performs significantly better than a comparable system using a single dictionary at positive SNR levels.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] Action recognition using exemplar-based embedding
    Weinland, Daniel
    Boyer, Edmond
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 3033 - 3039
  • [42] Superpixel-Based Noise-Robust Sparse Unmixing of Hyperspectral Image
    Li, Chang
    Sui, Chenhong
    Song, Rencheng
    Cheng, Juan
    Liu, Yu
    Chen, Xun
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [43] Noise-robust speech recognition based on difference of power spectrum
    Xu, JF
    Wei, G
    ELECTRONICS LETTERS, 2000, 36 (14) : 1247 - 1248
  • [44] Noise-Robust Speech Recognition Based on RBF Neural Network
    Hou, Xuemei
    HIGH PERFORMANCE STRUCTURES AND MATERIALS ENGINEERING, PTS 1 AND 2, 2011, 217-218 : 413 - 418
  • [45] Noise-Robust Speaker Recognition Based on Morphological Component Analysis
    He, Yongjun
    Chen, Chen
    Han, Jiqing
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3001 - 3005
  • [46] Robust Visual Tracking Using Exemplar-Based Detectors
    Gao, Changxin
    Chen, Feifei
    Yu, Jin-Gang
    Huang, Rui
    Sang, Nong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (02) : 300 - 312
  • [47] Exemplar-Based Sparse Representation Features: From TIMIT to LVCSR
    Sainath, Tara N.
    Ramabhadran, Bhuvana
    Picheny, Michael
    Nahamoo, David
    Kanevsky, Dimitri
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2598 - 2613
  • [48] Multipitch Estimation of Piano Music by Exemplar-Based Sparse Representation
    Lee, Cheng-Te
    Yang, Yi-Hsuan
    Chen, Homer H.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (03) : 608 - 618
  • [49] NOISE-ROBUST EXEMPLAR MATCHING FOR RESCORING QUERY-BY-EXAMPLE SEARCH
    Yilmaz, Emre
    van Hout, Julien
    Franco, Horacio
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 1 - 7
  • [50] Sparse Kernel Cepstral Coefficients (SKCC): Inner-product based Features for Noise-Robust Speech Recognition
    Fazel, Amin
    Chakrabartty, Shantanu
    2011 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2011, : 2401 - 2404