AUTOMATIC ACOUSTIC SIREN DETECTION IN TRAFFIC NOISE BY PART-BASED MODELS

被引:0
|
作者
Schroeder, Jens [1 ]
Goetze, Stefan [1 ]
Gruetzmacher, Volker [2 ]
Anemueller, Joern [1 ,3 ]
机构
[1] Hearing Speech & Audio Technol, Fraunhofer IDMT, D-26129 Oldenburg, Germany
[2] Adam Opel AG, D-65423 Russelsheim, Germany
[3] Carl von Ossietzky Univ Oldenburg, Dept Phys, D-26111 Oldenburg, Germany
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
acoustic event detection (AED); part-based model (PBM); siren detection;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
State-of-the-art classifiers like hidden Markov models (HMMs) in combination with mel-frequency cepstral coefficients (MFCCs) are flexible in time but rigid in the spectral dimension. In contrast, part-based models (PBMs) originally proposed in computer vision consist of parts in a fully deformable configuration. The present contribution proposes to employ PBMs in the spectro-temporal domain for detection of emergency siren sounds in traffic noise, resulting in a classifier that is robust to shifts in frequency induced, e.g., by Doppler-shift effects. Two improvements over standard machine learning techniques for PBM estimation are proposed: (i) Spectro-temporal part ("appearance") extraction is initialized by interest point detection instead of random initialization and (ii) a discriminative training approach in addition to standard generative training is implemented. Evaluation with self-recorded police sirens and traffic noise gathered on-line demonstrates that PBMs are successful in acoustic siren detection. One hand-labeled and two machine learned PBMs are compared to standard HMMs employing mel-spectrograms and MFCCs in clean and multi condition (multiple SNR) training settings. Results show that in clean condition training, hand-labeled PBMs and HMMs outperform machine-learned PBMs already for test data with moderate additive noise. In multi condition training, the machine learned PBMs outperform HMMs on most SNRs, achieving high accuracies and being nearly optimal up to 5 dB SNR. Thus, our simulation results show that PBMs are a promising approach for acoustic event detection (AED).
引用
收藏
页码:493 / 497
页数:5
相关论文
共 50 条
  • [21] PART-BASED HUMAN DETECTION ON RIEMANNIAN MANIFOLDS
    Tosato, D.
    Farenzena, M.
    Cristani, M.
    Murino, V.
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3469 - 3472
  • [22] FUSING GENERIC OBJECTNESS AND DEFORMABLE PART-BASED MODELS FOR WEAKLY SUPERVISED OBJECT DETECTION
    Tang, Yuxing
    Wang, Xiaofang
    Dellandrea, Emmanuel
    Masnou, Simon
    Chen, Liming
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 4072 - 4076
  • [23] Acoustic Features for Deep Learning-Based Models for Emergency Siren Detection: an Evaluation Study
    Cantarini, Michela
    Brocanelli, Anna
    Gabrielli, Leonardo
    Squartini, Stefano
    PROCEEDINGS OF THE 12TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2021), 2021, : 47 - 53
  • [24] A Compositional Approach to Learning Part-based Models of Objects
    Mottaghi, Roozbeh
    Ranganathan, Ananth
    Yuille, Alan
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [25] RigMesh: Automatic Rigging for Part-Based Shape Modeling and Deformation
    Borosan, Peter
    Jin, Ming
    DeCarlo, Doug
    Gingold, Yotam
    Nealen, Andrew
    ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (06):
  • [26] STATISTICAL PART-BASED MODELS FOR OBJECT CATEGORY RECOGNITION
    Xia, Xiao-Zhen
    Zhang, Shu-Wu
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 1846 - 1850
  • [27] OBJECT TRACKING WITH PART-BASED DISCRIMINATIVE CONTEXT MODELS
    Zhu, Guibo
    Wang, Jinqiao
    Lu, Hanqing
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 4932 - 4936
  • [28] A Part-Based Probabilistic Model for Object Detection with Occlusion
    Zhang, Chunhui
    Zhang, Jun
    Zhao, Heng
    Liang, Jimin
    PLOS ONE, 2014, 9 (01):
  • [29] Deformable Part-Based Model Transfer for Object Detection
    Ruan, Zhiwei
    Wang, Guijin
    Lin, Xinggang
    Xue, Jing-Hao
    Jiang, Yong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (05) : 1394 - 1397
  • [30] Two-stage Part-Based Pedestrian Detection
    Mogelmose, Andreas
    Prioletti, Antonio
    Trivedi, Mohan M.
    Broggi, Alberto
    Moeslund, Thomas B.
    2012 15TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2012, : 67 - 71