NOISE-ROBUST DIGIT RECOGNITION WITH EXEMPLAR-BASED SPARSE REPRESENTATIONS OF VARIABLE LENGTH

被引：0

作者：

Yilmaz, Emre ^{[1
]}

Gemmeke, Jort F. ^{[1
]}

Van Compernolle, Dirk ^{[1
]}

Van Hamme, Hugo ^{[1
]}

机构：

[1] Katholieke Univ Leuven, Dept ESAT, Louvain, Belgium

来源：

2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP) | 2012年

关键词：

Exemplar-based recognition; noise robustness; non-negative sparse coding; multiple dictionaries; CONTINUOUS SPEECH RECOGNITION; SEPARATION;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper introduces an exemplar-based noise-robust digit recognition system in which noisy speech is modeled as a sparse linear combination of clean speech and noise exemplars. Exemplars are rigid long speech units of different lengths, i.e. no warping mechanism is used for exemplar matching to avoid poor time alignments that would otherwise be provoked by the noise and the natural duration distribution of each unit in the training data is preserved. Speech and noise separation is performed by applying non-negative sparse coding using a separate exemplar dictionary for each labeled unit (in this case half-digits) rather than a single dictionary of all units. This approach does not only provide better classification of speech units but also models the temporal structure of speech and noise more accurately. The system performance is evaluated on the AURORA-2 database. The results show that the proposed system performs significantly better than a comparable system using a single dictionary at positive SNR levels.

引用

页数：4

共 50 条

[41] Action recognition using exemplar-based embedding
Weinland, Daniel
Boyer, Edmond
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 3033 - 3039
[42] Superpixel-Based Noise-Robust Sparse Unmixing of Hyperspectral Image
Li, Chang
Sui, Chenhong
Song, Rencheng
Cheng, Juan
Liu, Yu
Chen, Xun
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[43] Noise-robust speech recognition based on difference of power spectrum
Xu, JF
Wei, G
ELECTRONICS LETTERS, 2000, 36 (14) : 1247 - 1248
[44] Noise-Robust Speech Recognition Based on RBF Neural Network
Hou, Xuemei
HIGH PERFORMANCE STRUCTURES AND MATERIALS ENGINEERING, PTS 1 AND 2, 2011, 217-218 : 413 - 418
[45] Noise-Robust Speaker Recognition Based on Morphological Component Analysis
He, Yongjun
Chen, Chen
Han, Jiqing
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3001 - 3005
[46] Robust Visual Tracking Using Exemplar-Based Detectors
Gao, Changxin
Chen, Feifei
Yu, Jin-Gang
Huang, Rui
Sang, Nong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (02) : 300 - 312
[47] Exemplar-Based Sparse Representation Features: From TIMIT to LVCSR
Sainath, Tara N.
Ramabhadran, Bhuvana
Picheny, Michael
Nahamoo, David
Kanevsky, Dimitri
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2598 - 2613
[48] Multipitch Estimation of Piano Music by Exemplar-Based Sparse Representation
Lee, Cheng-Te
Yang, Yi-Hsuan
Chen, Homer H.
IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (03) : 608 - 618
[49] NOISE-ROBUST EXEMPLAR MATCHING FOR RESCORING QUERY-BY-EXAMPLE SEARCH
Yilmaz, Emre
van Hout, Julien
Franco, Horacio
2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 1 - 7
[50] Sparse Kernel Cepstral Coefficients (SKCC): Inner-product based Features for Noise-Robust Speech Recognition
Fazel, Amin
Chakrabartty, Shantanu
2011 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2011, : 2401 - 2404

← 1 2 3 4 5 →