Selective Gammatone Filterbank Feature for Robust Sound Event Recognition

被引:0
|
作者
Leng, Yi Ren [1 ]
Huy Dat Tran [1 ]
Kitaoka, Norihide [2 ]
Li, Haizhou [1 ]
机构
[1] ASTAR, Inst Infocomm Res, Human Language Technol Dept, Singapore 138632, Singapore
[2] Nagoya Univ, Nagoya, Aichi 4648601, Japan
关键词
gammatone filterbank; Hidden Markov Model; robust recognition; sound event recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper introduces a novel feature based on the raw output of the garnmatone filterbank. Channel selection is used to enhance robustness over a range of signal-to-noise ratios (SNR) of additive noise. The recognition accuracy of the proposed feature is tested on a sound event database using a Hidden Markov Model (HMM) recogniser. A comparison with a series of similar features and the conventional Mel-Frequency Cepstral Coefficients (MFCC) shows that the proposed feature offers significant improvement in low SNR conditions.
引用
收藏
页码:2246 / +
页数:2
相关论文
共 50 条
  • [31] Image Feature Representation of the Subband Power Distribution for Robust Sound Event Classification
    Dennis, Jonathan
    Tran, Huy Dat
    Chng, Eng Siong
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (02): : 367 - 377
  • [32] Acoustic Feature Extraction for Robust Event Recognition on Cleaning Robot Platform
    Park, Sang-wook
    Rho, Jin-sang
    Shin, Min-kyu
    Han, David K.
    Ko, Hanseok
    2014 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2014, : 147 - 148
  • [33] On the Effects of Filterbank Design and Energy Computation on Robust Speech Recognition
    Dimitriadis, Dimitrios
    Maragos, Petros
    Potamianos, Alexandros
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1504 - 1516
  • [34] Filterbank Learning for Deep Neural Network Based Polyphonic Sound Event Detection
    Cakir, Emre
    Ozan, Ezgi Can
    Virtanen, Tuomas
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3399 - 3406
  • [35] MMSE estimation of log-filterbank energies for robust speech recognition
    Stark, Anthony
    Paliwal, Kuldip
    SPEECH COMMUNICATION, 2011, 53 (03) : 403 - 416
  • [36] An 800 nW Switched-Capacitor Feature Extraction Filterbank for Sound Classification
    Villamizar, Daniel Augusto
    Muratore, Dante Gabriel
    Wieser, James B.
    Murmann, Boris
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2021, 68 (04) : 1578 - 1588
  • [37] Pseudo-color cochleagram image feature and sequential feature selection for robust acoustic event recognition
    Sharan, Roneel V.
    Moir, Tom J.
    APPLIED ACOUSTICS, 2018, 140 : 198 - 204
  • [38] GENERALIZED GAUSSIAN DISTRIBUTION KULLBACK-LEIBLER KERNEL FOR ROBUST SOUND EVENT RECOGNITION
    Tran Huy Dat
    Terence, Ng Wen Zheng
    Dennis, Jonathan William
    Ren, Leng Yi
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [39] Parameter Tuning-Free Missing-Feature Reconstruction for Robust Sound Recognition
    Liu, Qi
    Wu, Jibin
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (01) : 78 - 89
  • [40] Missing Feature Kernel and Nonparametric Window Subband Power Distribution for Robust Sound Event Classification
    Tran Huy Dat
    Dennis, Jonathan William
    Terence, Ng Wen Zheng
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 277 - 284