Enhanced Local Feature Approach for Overlapping Sound Event Recognition

被引:0
|
作者
Dennis, Jonathan [1 ]
Huy Dat Tran [1 ]
机构
[1] ASTAR, Inst Infocomm Res, 1 Fusionopolis Way, Singapore 138632, Singapore
关键词
SPEECH RECOGNITION; NOISE;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a feature-based approach to address the challenging task of recognising overlapping sound events from single channel audio. Our approach is based on our previous work on Local Spectrogram Features (LSFs), where we combined a local spectral representation of the spectrogram with the Generalised Hough Transform (GHT) voting system for recognition. Here we propose to take the output from the GHT and use it as a feature for classification, and demonstrate that such an approach can improve upon the previous knowledge based scoring system. Experiments are carried out on a challenging set of five overlapping sound events, with the addition of non-stationary background noise and volume change. The results show that the proposed system can achieve a detection rate of 99% and 91% in clean and 0dB noise conditions respectively, which is a strong improvement over our previous work.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Overlapping Sound Event Recognition using Local Spectrogram Features with the Generalised Hough Transform
    Dennis, Jonathan
    Huy Dat Tran
    Chng, Eng Siong
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2263 - 2266
  • [2] Overlapping sound event recognition using local spectrogram features and the generalised hough transform
    Dennis, J.
    Tran, H. D.
    Chng, E. S.
    PATTERN RECOGNITION LETTERS, 2013, 34 (09) : 1085 - 1093
  • [3] Selective Gammatone Envelope Feature for Robust Sound Event Recognition
    Leng, Yi Ren
    Huy Dat Tran
    Kitaoka, Norihide
    Li, Haizhou
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (05): : 1229 - 1237
  • [4] Selective Gammatone Filterbank Feature for Robust Sound Event Recognition
    Leng, Yi Ren
    Huy Dat Tran
    Kitaoka, Norihide
    Li, Haizhou
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2246 - +
  • [5] Overlapping Local Phase Feature (OLPF) for Robust Face Recognition in Surveillance
    Liu, Qiang
    Ngan, King Ngi
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS (ACIVS 2012), 2012, 7517 : 246 - 257
  • [6] An Enhanced Temporal Feature Integration Method for Environmental Sound Recognition
    Bountourakis, Vasileios
    Vrysis, Lazaros
    Konstantoudakis, Konstantinos
    Vryzas, Nikolaos
    ACOUSTICS, 2019, 1 (02): : 410 - 422
  • [7] Feature validity maintaining approach based on local feature recognition
    Chen, Zheng-Ming
    Gao, Shu-Ming
    Peng, Qun-Sheng
    Ruan Jian Xue Bao/Journal of Software, 2002, 13 (04): : 552 - 560
  • [8] Acoustic feature based unsupervised approach of heart sound event detection
    Das, Sangita
    Pal, Saurabh
    Mitra, Madhuchhanda
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 126
  • [9] An Oxygen Desaturation Event Recognition Algorithm Based on Local Feature Extraction
    Wang, Hanqing
    Li, Min
    Cao, Jinge
    Huang, Longping
    Zhao, Yihan
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2016, : 1679 - 1684
  • [10] Improving sound event detection through enhanced feature extraction and attention mechanisms
    Zhang, Dongping
    Wu, Siyi
    Lu, Zhanhong
    Zhang, Zhehao
    Hu, Haimiao
    Yu, Jiabin
    FRONTIERS OF COMPUTER SCIENCE, 2025, 19 (10)