Enhanced Local Feature Approach for Overlapping Sound Event Recognition

被引:0
|
作者
Dennis, Jonathan [1 ]
Huy Dat Tran [1 ]
机构
[1] ASTAR, Inst Infocomm Res, 1 Fusionopolis Way, Singapore 138632, Singapore
关键词
SPEECH RECOGNITION; NOISE;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a feature-based approach to address the challenging task of recognising overlapping sound events from single channel audio. Our approach is based on our previous work on Local Spectrogram Features (LSFs), where we combined a local spectral representation of the spectrogram with the Generalised Hough Transform (GHT) voting system for recognition. Here we propose to take the output from the GHT and use it as a feature for classification, and demonstrate that such an approach can improve upon the previous knowledge based scoring system. Experiments are carried out on a challenging set of five overlapping sound events, with the addition of non-stationary background noise and volume change. The results show that the proposed system can achieve a detection rate of 99% and 91% in clean and 0dB noise conditions respectively, which is a strong improvement over our previous work.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Enhanced global and local face feature extraction for effective recognition of facial emotions
    Retnamony, Jeen Retna Kumar
    Muniasamy, Sundaram
    Stanley, Berakhah Florence
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (05):
  • [22] An Offline Fuzzy Based Approach for Iris Recognition with Enhanced Feature Detection
    Kodituwakku, S. R.
    Fazeen, M. I. M.
    ADVANCES TECHNIQUES IN COMPUTING SCIENCES AND SOFTWARE ENGINEERING, 2010, : 39 - 44
  • [23] AReN: A Deep Learning Approach for Sound Event Recognition Using a Brain Inspired Representation
    Greco, Antonio
    Petkov, Nicolai
    Saggese, Alessia
    Vento, Mario
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 3610 - 3624
  • [24] Sound-Event Recognition with a Companion Humanoid
    Janvier, Maxime
    Alameda-Pineda, Xavier
    Girin, Laurent
    Horaud, Radu
    2012 12TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2012, : 104 - 111
  • [25] Sound Event Recognition With Probabilistic Distance SVMs
    Huy Dat Tran
    Li, Haizhou
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1556 - 1568
  • [26] Sound recognition: A connectionist approach
    Harb, H
    Chen, LM
    SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 2, PROCEEDINGS, 2003, : 611 - 614
  • [27] A robust approach based on local feature extraction for age invariant face recognition
    Rajesh Kumar Tripathi
    Anand Singh Jalal
    Multimedia Tools and Applications, 2022, 81 : 21223 - 21240
  • [28] Monogenic Binary Coding: An Efficient Local Feature Extraction Approach to Face Recognition
    Yang, Meng
    Zhang, Lei
    Shiu, Simon Chi-Keung
    Zhang, David
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2012, 7 (06) : 1738 - 1751
  • [29] Slow Feature Analysis for Mitotic Event Recognition
    Chu, Jinghui
    Liang, Hailan
    Tong, Zheng
    Lu, Wei
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (03): : 1670 - 1683
  • [30] A robust approach based on local feature extraction for age invariant face recognition
    Tripathi, Rajesh Kumar
    Jalal, Anand Singh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (15) : 21223 - 21240