Enhanced Local Feature Approach for Overlapping Sound Event Recognition

被引:0
|
作者
Dennis, Jonathan [1 ]
Huy Dat Tran [1 ]
机构
[1] ASTAR, Inst Infocomm Res, 1 Fusionopolis Way, Singapore 138632, Singapore
关键词
SPEECH RECOGNITION; NOISE;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a feature-based approach to address the challenging task of recognising overlapping sound events from single channel audio. Our approach is based on our previous work on Local Spectrogram Features (LSFs), where we combined a local spectral representation of the spectrogram with the Generalised Hough Transform (GHT) voting system for recognition. Here we propose to take the output from the GHT and use it as a feature for classification, and demonstrate that such an approach can improve upon the previous knowledge based scoring system. Experiments are carried out on a challenging set of five overlapping sound events, with the addition of non-stationary background noise and volume change. The results show that the proposed system can achieve a detection rate of 99% and 91% in clean and 0dB noise conditions respectively, which is a strong improvement over our previous work.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] Animal Sound Recognition Based on Double Feature of Spectrogram
    LI Ying
    HUANG Hongkeng
    WU Zhibin
    Chinese Journal of Electronics, 2019, 28 (04) : 667 - 673
  • [42] Cochleagram Image Feature for Improved Robustness in Sound Recognition
    Sharan, Roneel V.
    Moir, Tom J.
    2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2015, : 441 - 444
  • [43] An auditory feature detection circuit for sound pattern recognition
    Schoeneich, Stefan
    Kostarakos, Konstantinos
    Hedwig, Berthold
    SCIENCE ADVANCES, 2015, 1 (08):
  • [44] Animal Sound Recognition Based on Double Feature of Spectrogram
    Li Ying
    Huang Hongkeng
    Wu Zhibin
    CHINESE JOURNAL OF ELECTRONICS, 2019, 28 (04) : 667 - 673
  • [45] Exploiting Temporal Feature Integration for Generalized Sound Recognition
    Stavros Ntalampiras
    Ilyas Potamitis
    Nikos Fakotakis
    EURASIP Journal on Advances in Signal Processing, 2009
  • [46] Exploiting Temporal Feature Integration for Generalized Sound Recognition
    Ntalampiras, Stavros
    Potamitis, Ilyas
    Fakotakis, Nikos
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2009,
  • [47] Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions
    Dennis, Jonathan
    Tran, Huy Dat
    Li, Haizhou
    IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (02) : 130 - 133
  • [48] An Approach for Local Feature Evaluation
    Choi, Yukyung
    Kweon, In So
    2015 12TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2015, : 382 - 383
  • [49] Enhanced Gradient-Based Local Feature Descriptors by Saliency Map for Egocentric Action Recognition
    Zuo, Zheming
    Wei, Bo
    Chao, Fei
    Qu, Yanpeng
    Peng, Yonghong
    Yang, Longzhi
    APPLIED SYSTEM INNOVATION, 2019, 2 (01) : 1 - 14
  • [50] Methodological improvement on local Gabor face recognition based on feature selection and enhanced Borda count
    Perez, Claudio A.
    Cament, Leonardo A.
    Castillo, Luis E.
    PATTERN RECOGNITION, 2011, 44 (04) : 951 - 963