AUTOMATIC ACOUSTIC SIREN DETECTION IN TRAFFIC NOISE BY PART-BASED MODELS

被引:0
|
作者
Schroeder, Jens [1 ]
Goetze, Stefan [1 ]
Gruetzmacher, Volker [2 ]
Anemueller, Joern [1 ,3 ]
机构
[1] Hearing Speech & Audio Technol, Fraunhofer IDMT, D-26129 Oldenburg, Germany
[2] Adam Opel AG, D-65423 Russelsheim, Germany
[3] Carl von Ossietzky Univ Oldenburg, Dept Phys, D-26111 Oldenburg, Germany
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
acoustic event detection (AED); part-based model (PBM); siren detection;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
State-of-the-art classifiers like hidden Markov models (HMMs) in combination with mel-frequency cepstral coefficients (MFCCs) are flexible in time but rigid in the spectral dimension. In contrast, part-based models (PBMs) originally proposed in computer vision consist of parts in a fully deformable configuration. The present contribution proposes to employ PBMs in the spectro-temporal domain for detection of emergency siren sounds in traffic noise, resulting in a classifier that is robust to shifts in frequency induced, e.g., by Doppler-shift effects. Two improvements over standard machine learning techniques for PBM estimation are proposed: (i) Spectro-temporal part ("appearance") extraction is initialized by interest point detection instead of random initialization and (ii) a discriminative training approach in addition to standard generative training is implemented. Evaluation with self-recorded police sirens and traffic noise gathered on-line demonstrates that PBMs are successful in acoustic siren detection. One hand-labeled and two machine learned PBMs are compared to standard HMMs employing mel-spectrograms and MFCCs in clean and multi condition (multiple SNR) training settings. Results show that in clean condition training, hand-labeled PBMs and HMMs outperform machine-learned PBMs already for test data with moderate additive noise. In multi condition training, the machine learned PBMs outperform HMMs on most SNRs, achieving high accuracies and being nearly optimal up to 5 dB SNR. Thus, our simulation results show that PBMs are a promising approach for acoustic event detection (AED).
引用
收藏
页码:493 / 497
页数:5
相关论文
共 50 条
  • [31] Fall Detection with Part-Based Approach for Indoor Environment
    Fathima, A.
    Vaidehi, V.
    Selvaraj, K.
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2014, 10 (04) : 51 - 69
  • [32] Online Learning and Detection with Part-based Circulant Structure
    Akin, Osman
    Mikolajczyk, Krystian
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4229 - 4233
  • [33] PEDESTRIAN DETECTION VIA PART-BASED TOPOLOGY MODEL
    Gao, Wen
    Chen, Xiaogang
    Ye, Qixiang
    Jiao, Jianbin
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 445 - 448
  • [34] Part-based deformable object detection with a single sketch
    Das Bhattacharjee, Sreyasee
    Mittal, Anurag
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 139 : 73 - 87
  • [35] Estimation of the quality of an urban acoustic environment based on traffic noise evaluation models
    Di, Hui
    Liu, Xingpeng
    Zhang, Jiquan
    Tong, Zhijun
    Ji, Meichen
    Li, Fengxu
    Feng, Tianji
    Ma, Qing
    APPLIED ACOUSTICS, 2018, 141 : 115 - 124
  • [36] Weakly Supervised Learning of Deformable Part-Based Models for Object Detection via Region Proposals
    Tang, Yuxing
    Wang, Xiaofang
    Dellandrea, Emmanuel
    Chen, Liming
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (02) : 393 - 407
  • [37] Learning Semantic Part-Based Models from Google Images
    Modolo, Davide
    Ferrari, Vittorio
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (06) : 1502 - 1509
  • [38] Qualitative part-based models in content-based image retrieval
    Bilodeau, Guillaume-Alexandre
    Bergevin, Robert
    MACHINE VISION AND APPLICATIONS, 2007, 18 (05) : 275 - 287
  • [39] Hierarchical online domain adaptation of deformable part-based models
    Xu, Jiaolong
    Vazquez, David
    Mikolajczyk, Krystian
    Lopez, Antonio M.
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 5536 - 5541
  • [40] Qualitative part-based models in content-based image retrieval
    Guillaume-Alexandre Bilodeau
    Robert Bergevin
    Machine Vision and Applications, 2007, 18 : 275 - 287