Selective Gammatone Filterbank Feature for Robust Sound Event Recognition

被引：0

作者：

Leng, Yi Ren ^{[1
]}

Huy Dat Tran ^{[1
]}

Kitaoka, Norihide ^{[2
]}

Li, Haizhou ^{[1
]}

机构：

[1] ASTAR, Inst Infocomm Res, Human Language Technol Dept, Singapore 138632, Singapore

[2] Nagoya Univ, Nagoya, Aichi 4648601, Japan

来源：

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年

关键词：

gammatone filterbank; Hidden Markov Model; robust recognition; sound event recognition;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper introduces a novel feature based on the raw output of the garnmatone filterbank. Channel selection is used to enhance robustness over a range of signal-to-noise ratios (SNR) of additive noise. The recognition accuracy of the proposed feature is tested on a sound event database using a Hidden Markov Model (HMM) recogniser. A comparison with a series of similar features and the conventional Mel-Frequency Cepstral Coefficients (MFCC) shows that the proposed feature offers significant improvement in low SNR conditions.

引用

页码：2246 / +

页数：2

共 50 条

[31] Image Feature Representation of the Subband Power Distribution for Robust Sound Event Classification
Dennis, Jonathan
Tran, Huy Dat
Chng, Eng Siong
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (02): : 367 - 377
[32] Acoustic Feature Extraction for Robust Event Recognition on Cleaning Robot Platform
Park, Sang-wook
Rho, Jin-sang
Shin, Min-kyu
Han, David K.
Ko, Hanseok
2014 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2014, : 147 - 148
[33] On the Effects of Filterbank Design and Energy Computation on Robust Speech Recognition
Dimitriadis, Dimitrios
Maragos, Petros
Potamianos, Alexandros
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1504 - 1516
[34] Filterbank Learning for Deep Neural Network Based Polyphonic Sound Event Detection
Cakir, Emre
Ozan, Ezgi Can
Virtanen, Tuomas
2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3399 - 3406
[35] MMSE estimation of log-filterbank energies for robust speech recognition
Stark, Anthony
Paliwal, Kuldip
SPEECH COMMUNICATION, 2011, 53 (03) : 403 - 416
[36] An 800 nW Switched-Capacitor Feature Extraction Filterbank for Sound Classification
Villamizar, Daniel Augusto
Muratore, Dante Gabriel
Wieser, James B.
Murmann, Boris
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2021, 68 (04) : 1578 - 1588
[37] Pseudo-color cochleagram image feature and sequential feature selection for robust acoustic event recognition
Sharan, Roneel V.
Moir, Tom J.
APPLIED ACOUSTICS, 2018, 140 : 198 - 204
[38] GENERALIZED GAUSSIAN DISTRIBUTION KULLBACK-LEIBLER KERNEL FOR ROBUST SOUND EVENT RECOGNITION
Tran Huy Dat
Terence, Ng Wen Zheng
Dennis, Jonathan William
Ren, Leng Yi
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[39] Parameter Tuning-Free Missing-Feature Reconstruction for Robust Sound Recognition
Liu, Qi
Wu, Jibin
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (01) : 78 - 89
[40] Missing Feature Kernel and Nonparametric Window Subband Power Distribution for Robust Sound Event Classification
Tran Huy Dat
Dennis, Jonathan William
Terence, Ng Wen Zheng
SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 277 - 284

← 1 2 3 4 5 →