NOISE-ROBUST DIGIT RECOGNITION WITH EXEMPLAR-BASED SPARSE REPRESENTATIONS OF VARIABLE LENGTH

被引：0

作者：

Yilmaz, Emre ^{[1
]}

Gemmeke, Jort F. ^{[1
]}

Van Compernolle, Dirk ^{[1
]}

Van Hamme, Hugo ^{[1
]}

机构：

[1] Katholieke Univ Leuven, Dept ESAT, Louvain, Belgium

来源：

2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP) | 2012年

关键词：

Exemplar-based recognition; noise robustness; non-negative sparse coding; multiple dictionaries; CONTINUOUS SPEECH RECOGNITION; SEPARATION;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper introduces an exemplar-based noise-robust digit recognition system in which noisy speech is modeled as a sparse linear combination of clean speech and noise exemplars. Exemplars are rigid long speech units of different lengths, i.e. no warping mechanism is used for exemplar matching to avoid poor time alignments that would otherwise be provoked by the noise and the natural duration distribution of each unit in the training data is preserved. Speech and noise separation is performed by applying non-negative sparse coding using a separate exemplar dictionary for each labeled unit (in this case half-digits) rather than a single dictionary of all units. This approach does not only provide better classification of speech units but also models the temporal structure of speech and noise more accurately. The system performance is evaluated on the AURORA-2 database. The results show that the proposed system performs significantly better than a comparable system using a single dictionary at positive SNR levels.

引用

页数：4

共 50 条

[21] SUPERVISED SPEECH DEREVERBERATION IN NOISY ENVIRONMENTS USING EXEMPLAR-BASED SPARSE REPRESENTATIONS
Baby, Deepak
Van Hamme, Hugo
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 156 - 160
[22] Sparse coding of the modulation spectrum for noise-robust automatic speech recognition
Sara Ahmadi
Seyed Mohammad Ahadi
Bert Cranen
Lou Boves
EURASIP Journal on Audio, Speech, and Music Processing, 2014
[23] Sparse modeling of neural network posterior probabilities for exemplar-based speech recognition
Dighe, Pranay
Asaei, Afsaneh
Bourlard, Herve
SPEECH COMMUNICATION, 2016, 76 : 230 - 244
[24] Exemplar-based logo and trademark recognition
Farajzadeh, Nacer
MACHINE VISION AND APPLICATIONS, 2015, 26 (06) : 791 - 805
[25] Exemplar-based facial expression recognition
Farajzadeh, Nacer
Hashemzadeh, Mandi
INFORMATION SCIENCES, 2018, 460 : 318 - 330
[26] Joint Denoising and Dereverberation Using Exemplar-Based Sparse Representations and Decaying Norm Constraint
Baby, Deepak
Van Hamme, Hugo
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (10) : 2024 - 2035
[27] Sparse coding of the modulation spectrum for noise-robust automatic speech recognition
Ahmadi, Sara
Ahadi, Seyed Mohammad
Cranen, Bert
Boves, Lou
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014, : 1 - 20
[28] Exemplar-based logo and trademark recognition
Nacer Farajzadeh
Machine Vision and Applications, 2015, 26 : 791 - 805
[29] Exemplar-Based Processing for Speech Recognition
Sainath, Tara N.
Ramabhadran, Bhuvana
Nahamoo, David
Kanevsky, Dimitri
Van Compernolle, Dirk
Demuynck, Kris
Gemmeke, Jort Florent
Bellegarda, Jerome R.
Sundaram, Shiva
IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 98 - 113
[30] Improving Sparse Representations in Exemplar-Based Voice Conversion with a Phoneme-Selective Objective Function
Ding, Shaojin
Zhao, Guanlong
Liberatore, Christopher
Gutierrez-Osuna, Ricardo
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 476 - 480

← 1 2 3 4 5 →