Enhanced Local Feature Approach for Overlapping Sound Event Recognition

被引:0
|
作者
Dennis, Jonathan [1 ]
Huy Dat Tran [1 ]
机构
[1] ASTAR, Inst Infocomm Res, 1 Fusionopolis Way, Singapore 138632, Singapore
关键词
SPEECH RECOGNITION; NOISE;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a feature-based approach to address the challenging task of recognising overlapping sound events from single channel audio. Our approach is based on our previous work on Local Spectrogram Features (LSFs), where we combined a local spectral representation of the spectrogram with the Generalised Hough Transform (GHT) voting system for recognition. Here we propose to take the output from the GHT and use it as a feature for classification, and demonstrate that such an approach can improve upon the previous knowledge based scoring system. Experiments are carried out on a challenging set of five overlapping sound events, with the addition of non-stationary background noise and volume change. The results show that the proposed system can achieve a detection rate of 99% and 91% in clean and 0dB noise conditions respectively, which is a strong improvement over our previous work.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] Enhanced Local Texture Feature Sets for Face Recognition Under Difficult Lighting Conditions
    Tan, Xiaoyang
    Triggs, Bill
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (06) : 1635 - 1650
  • [32] Enhanced local texture feature sets for face recognition under difficult lighting conditions
    Tan, Xiaoyang
    Triggs, Bill
    ANALYSIS AND MODELING OF FACES AND GESTURES, PROCEEDINGS, 2007, 4778 : 168 - 182
  • [33] Enhanced discriminative global-local feature learning with priority for facial expression recognition
    Zhang, Ziyang
    Tian, Xiang
    Zhang, Yuan
    Guo, Kailing
    Xu, Xiangmin
    INFORMATION SCIENCES, 2023, 630 : 370 - 384
  • [34] Using Blob Detection in Missing Feature Linear-Frequency Cepstral Coefficients for Robust Sound Event Recognition
    Leng, Yi Ren
    Huy Dat Tran
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2505 - 2508
  • [35] Local feature based face recognition
    Huang, Mei
    Zhou, Jiliu
    He, Kun
    Xiong, Shuhua
    Li, Tao
    FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 1, 2006, : 274 - +
  • [36] Local Feature Hashing for Face Recognition
    Zeng, Zhihong
    Fang, Tianhong
    Shah, Shishir
    Kakadiaris, Ioannis A.
    2009 IEEE 3RD INTERNATIONAL CONFERENCE ON BIOMETRICS: THEORY, APPLICATIONS AND SYSTEMS, 2009, : 119 - +
  • [37] An Enhanced Facial Expression Recognition Model Using Local Feature Fusion of Gabor Wavelets and Local Directionality Patterns
    Bellamkonda, Sivaiah
    Gopalan, N. P.
    INTERNATIONAL JOURNAL OF AMBIENT COMPUTING AND INTELLIGENCE, 2020, 11 (01) : 48 - 70
  • [38] Isogeometric analysis of Mindlin plate with local gap and overlapping feature
    Zhao G.
    Du X.
    Wang W.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2017, 43 (03): : 432 - 440
  • [39] Multi-view representation for sound event recognition
    S. Chandrakala
    Venkatraman M
    Shreyas N
    Jayalakshmi S L
    Signal, Image and Video Processing, 2021, 15 : 1211 - 1219
  • [40] Multi-view representation for sound event recognition
    Chandrakala, S.
    Venkatraman, M.
    Shreyas, N.
    Jayalakshmi, S. L.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (06) : 1211 - 1219