AUTOMATIC ACOUSTIC SIREN DETECTION IN TRAFFIC NOISE BY PART-BASED MODELS

被引:0
|
作者
Schroeder, Jens [1 ]
Goetze, Stefan [1 ]
Gruetzmacher, Volker [2 ]
Anemueller, Joern [1 ,3 ]
机构
[1] Hearing Speech & Audio Technol, Fraunhofer IDMT, D-26129 Oldenburg, Germany
[2] Adam Opel AG, D-65423 Russelsheim, Germany
[3] Carl von Ossietzky Univ Oldenburg, Dept Phys, D-26111 Oldenburg, Germany
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
acoustic event detection (AED); part-based model (PBM); siren detection;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
State-of-the-art classifiers like hidden Markov models (HMMs) in combination with mel-frequency cepstral coefficients (MFCCs) are flexible in time but rigid in the spectral dimension. In contrast, part-based models (PBMs) originally proposed in computer vision consist of parts in a fully deformable configuration. The present contribution proposes to employ PBMs in the spectro-temporal domain for detection of emergency siren sounds in traffic noise, resulting in a classifier that is robust to shifts in frequency induced, e.g., by Doppler-shift effects. Two improvements over standard machine learning techniques for PBM estimation are proposed: (i) Spectro-temporal part ("appearance") extraction is initialized by interest point detection instead of random initialization and (ii) a discriminative training approach in addition to standard generative training is implemented. Evaluation with self-recorded police sirens and traffic noise gathered on-line demonstrates that PBMs are successful in acoustic siren detection. One hand-labeled and two machine learned PBMs are compared to standard HMMs employing mel-spectrograms and MFCCs in clean and multi condition (multiple SNR) training settings. Results show that in clean condition training, hand-labeled PBMs and HMMs outperform machine-learned PBMs already for test data with moderate additive noise. In multi condition training, the machine learned PBMs outperform HMMs on most SNRs, achieving high accuracies and being nearly optimal up to 5 dB SNR. Thus, our simulation results show that PBMs are a promising approach for acoustic event detection (AED).
引用
收藏
页码:493 / 497
页数:5
相关论文
共 50 条
  • [1] Vehicle Detection with a Part-based Model for Complex Traffic Conditions
    Li, Ye
    Tian, Bin
    Li, Bo
    Xiong, Gang
    Zhu, Fenghua
    Wang, Kunfeng
    2013 IEEE INTERNATIONAL CONFERENCE ON VEHICULAR ELECTRONICS AND SAFETY (ICVES), 2013, : 110 - 113
  • [2] URBAN STRUCTURE DETECTION WITH DEFORMABLE PART-BASED MODELS
    Randrianarivo, Hicham
    Le Saux, Bertrand
    Ferecatu, Marin
    2013 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2013, : 200 - 203
  • [3] Part-based statistical models for object classification and detection
    Bernstein, EJ
    Amit, Y
    2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, : 734 - 740
  • [4] Object Detection with Discriminatively Trained Part-Based Models
    Forsyth, David
    COMPUTER, 2014, 47 (02) : 6 - 7
  • [5] Object Detection with Discriminatively Trained Part-Based Models
    Felzenszwalb, Pedro F.
    Girshick, Ross B.
    McAllester, David
    Ramanan, Deva
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) : 1627 - 1645
  • [6] Classify vehicles in traffic scene images with deformable part-based models
    Bai, Shuang
    Liu, Zhenyao
    Yao, Chang
    MACHINE VISION AND APPLICATIONS, 2018, 29 (03) : 393 - 403
  • [7] Classify vehicles in traffic scene images with deformable part-based models
    Shuang Bai
    Zhenyao Liu
    Chang Yao
    Machine Vision and Applications, 2018, 29 : 393 - 403
  • [8] Improved Object Detection and Pose Using Part-Based Models
    Jiang, Fangyuan
    Enqvist, Olof
    Kahl, Fredrik
    Astrom, Kalle
    IMAGE ANALYSIS, SCIA 2013: 18TH SCANDINAVIAN CONFERENCE, 2013, 7944 : 396 - 407
  • [9] Part-based local shape models for colon polyp detection
    Bhotika, Rahul
    Mendonca, Paulo R. S.
    Sirohey, Saad A.
    Turner, Wesley D.
    Lee, Ying-lin
    McCoy, Julie M.
    Brown, Rebecca E. B.
    Miller, James V.
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2006, PT 2, 2006, 4191 : 479 - 486
  • [10] Improvements of 3D object detection with part-based models
    Lu, Wen-Hao
    Li, Ya-Li
    Wang, Sheng-Jin
    Ding, Xiao-Qing
    Zidonghua Xuebao/Acta Automatica Sinica, 2012, 38 (04): : 497 - 506