NOISE-ROBUST DIGIT RECOGNITION WITH EXEMPLAR-BASED SPARSE REPRESENTATIONS OF VARIABLE LENGTH

被引:0
|
作者
Yilmaz, Emre [1 ]
Gemmeke, Jort F. [1 ]
Van Compernolle, Dirk [1 ]
Van Hamme, Hugo [1 ]
机构
[1] Katholieke Univ Leuven, Dept ESAT, Louvain, Belgium
关键词
Exemplar-based recognition; noise robustness; non-negative sparse coding; multiple dictionaries; CONTINUOUS SPEECH RECOGNITION; SEPARATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces an exemplar-based noise-robust digit recognition system in which noisy speech is modeled as a sparse linear combination of clean speech and noise exemplars. Exemplars are rigid long speech units of different lengths, i.e. no warping mechanism is used for exemplar matching to avoid poor time alignments that would otherwise be provoked by the noise and the natural duration distribution of each unit in the training data is preserved. Speech and noise separation is performed by applying non-negative sparse coding using a separate exemplar dictionary for each labeled unit (in this case half-digits) rather than a single dictionary of all units. This approach does not only provide better classification of speech units but also models the temporal structure of speech and noise more accurately. The system performance is evaluated on the AURORA-2 database. The results show that the proposed system performs significantly better than a comparable system using a single dictionary at positive SNR levels.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] SUPERVISED SPEECH DEREVERBERATION IN NOISY ENVIRONMENTS USING EXEMPLAR-BASED SPARSE REPRESENTATIONS
    Baby, Deepak
    Van Hamme, Hugo
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 156 - 160
  • [22] Sparse coding of the modulation spectrum for noise-robust automatic speech recognition
    Sara Ahmadi
    Seyed Mohammad Ahadi
    Bert Cranen
    Lou Boves
    EURASIP Journal on Audio, Speech, and Music Processing, 2014
  • [23] Sparse modeling of neural network posterior probabilities for exemplar-based speech recognition
    Dighe, Pranay
    Asaei, Afsaneh
    Bourlard, Herve
    SPEECH COMMUNICATION, 2016, 76 : 230 - 244
  • [24] Exemplar-based logo and trademark recognition
    Farajzadeh, Nacer
    MACHINE VISION AND APPLICATIONS, 2015, 26 (06) : 791 - 805
  • [25] Exemplar-based facial expression recognition
    Farajzadeh, Nacer
    Hashemzadeh, Mandi
    INFORMATION SCIENCES, 2018, 460 : 318 - 330
  • [26] Joint Denoising and Dereverberation Using Exemplar-Based Sparse Representations and Decaying Norm Constraint
    Baby, Deepak
    Van Hamme, Hugo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (10) : 2024 - 2035
  • [27] Sparse coding of the modulation spectrum for noise-robust automatic speech recognition
    Ahmadi, Sara
    Ahadi, Seyed Mohammad
    Cranen, Bert
    Boves, Lou
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014, : 1 - 20
  • [28] Exemplar-based logo and trademark recognition
    Nacer Farajzadeh
    Machine Vision and Applications, 2015, 26 : 791 - 805
  • [29] Exemplar-Based Processing for Speech Recognition
    Sainath, Tara N.
    Ramabhadran, Bhuvana
    Nahamoo, David
    Kanevsky, Dimitri
    Van Compernolle, Dirk
    Demuynck, Kris
    Gemmeke, Jort Florent
    Bellegarda, Jerome R.
    Sundaram, Shiva
    IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 98 - 113
  • [30] Improving Sparse Representations in Exemplar-Based Voice Conversion with a Phoneme-Selective Objective Function
    Ding, Shaojin
    Zhao, Guanlong
    Liberatore, Christopher
    Gutierrez-Osuna, Ricardo
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 476 - 480