AUTOMATIC ACOUSTIC SIREN DETECTION IN TRAFFIC NOISE BY PART-BASED MODELS

被引:0
|
作者
Schroeder, Jens [1 ]
Goetze, Stefan [1 ]
Gruetzmacher, Volker [2 ]
Anemueller, Joern [1 ,3 ]
机构
[1] Hearing Speech & Audio Technol, Fraunhofer IDMT, D-26129 Oldenburg, Germany
[2] Adam Opel AG, D-65423 Russelsheim, Germany
[3] Carl von Ossietzky Univ Oldenburg, Dept Phys, D-26111 Oldenburg, Germany
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
acoustic event detection (AED); part-based model (PBM); siren detection;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
State-of-the-art classifiers like hidden Markov models (HMMs) in combination with mel-frequency cepstral coefficients (MFCCs) are flexible in time but rigid in the spectral dimension. In contrast, part-based models (PBMs) originally proposed in computer vision consist of parts in a fully deformable configuration. The present contribution proposes to employ PBMs in the spectro-temporal domain for detection of emergency siren sounds in traffic noise, resulting in a classifier that is robust to shifts in frequency induced, e.g., by Doppler-shift effects. Two improvements over standard machine learning techniques for PBM estimation are proposed: (i) Spectro-temporal part ("appearance") extraction is initialized by interest point detection instead of random initialization and (ii) a discriminative training approach in addition to standard generative training is implemented. Evaluation with self-recorded police sirens and traffic noise gathered on-line demonstrates that PBMs are successful in acoustic siren detection. One hand-labeled and two machine learned PBMs are compared to standard HMMs employing mel-spectrograms and MFCCs in clean and multi condition (multiple SNR) training settings. Results show that in clean condition training, hand-labeled PBMs and HMMs outperform machine-learned PBMs already for test data with moderate additive noise. In multi condition training, the machine learned PBMs outperform HMMs on most SNRs, achieving high accuracies and being nearly optimal up to 5 dB SNR. Thus, our simulation results show that PBMs are a promising approach for acoustic event detection (AED).
引用
收藏
页码:493 / 497
页数:5
相关论文
共 50 条
  • [41] Deep & Deformable: Convolutional Mixtures of Deformable Part-based Models
    Songsri-in, Kritaphat
    Trigeorgis, George
    Zafeiriou, Stefanos
    PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 218 - 225
  • [42] Part-Based Object Detection Using Cascades of Boosted Classifiers
    Xia, Xiaozhen
    Yang, Wuyi
    Li, Heping
    Zhang, Shuwu
    COMPUTER VISION - ACCV 2009, PT II, 2010, 5995 : 556 - +
  • [43] Spatial priors for part-based recognition using statistical models
    Crandall, D
    Felzenszwalb, P
    Huttenlocher, D
    2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol 1, Proceedings, 2005, : 10 - 17
  • [44] A Part-Based Gaussian Reweighted Approach for Occluded Vehicle Detection
    Huang, Yu
    Zhou, Zhiheng
    Wang, Tianlei
    Cao, Qian
    Huang, Junchu
    Chen, Zirong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (05) : 1097 - 1101
  • [45] Part-based Deep Network for Pedestrian Detection in Surveillance Videos
    Chen, Qi
    Jiang, Wenhui
    Zhao, Yanyun
    Zhao, Zhicheng
    2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,
  • [46] Part-based adaptive detection of workpieces using differential evolution
    Liu, Wei
    Wang, Peng
    Qiao, Hong
    SIGNAL PROCESSING, 2012, 92 (02) : 301 - 307
  • [47] Multiple Instance Feature for Robust Part-based Object Detection
    Lin, Zhe
    Hua, Gang
    Davis, Larry S.
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 405 - +
  • [48] Carried Baggage Detection and Classification Using Part-Based Model
    Wahyono
    Jo, Kang-Hyun
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2015, PT III, 2015, 9227 : 289 - 296
  • [49] Improved part-based human detection using depth information
    Yamashita, Takayoshi
    Ikemura, Sho
    Fujiyoshi, Hironobu
    Iwahori, Yuji
    IEEJ Transactions on Industry Applications, 2011, 131 (04): : 475 - 481
  • [50] 3D Part-Based Sparse Tracker with Automatic Synchronization and Registration
    Bibi, Adel
    Zhang, Tianzhu
    Ghanem, Bernard
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1439 - 1448