A MULTI-CHANNEL FUSION FRAMEWORK FOR AUDIO EVENT DETECTION

被引:0
|
作者
Huy Phan [1 ,2 ]
Maass, Marco [1 ]
Hertel, Lars [1 ]
Mazur, Radoslaw [1 ]
Mertins, Alfred [1 ]
机构
[1] Med Univ Lubeck, Inst Signal Proc, D-23538 Lubeck, Germany
[2] Med Univ Lubeck, Grad Sch Comp Med & Life Sci, D-23538 Lubeck, Germany
关键词
Acoustic event detection; classification; multi-channel fusion; regression forests; REGRESSION FORESTS; CLASSIFICATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose in this paper a simple, yet efficient multi-channel fusion framework for joint acoustic event detection and classification. The joint problem on individual channels is posed as a regression problem to estimate event onset and offset positions. As an intermediate result, we also obtain the posterior probabilities which measure the confidence that event onsets and offsets are present at a temporal position. It facilitates the fusion problem by accumulating the posterior probabilities of different channels. The detection hypotheses are then determined based on the summed posterior probabilities. While the proposed fusion framework appears to be simple and natural, it significantly outperforms all the single-channel baseline systems on the ITC-Irst database. We also show that adding channels one by one into the fusion system yields performance improvements, and the performance of the fusion system is always better than those of the individual-channel counterparts.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Multi-channel direct detection of light dark matter: theoretical framework
    Trickle, Tanner
    Zhang, Zhengkang
    Zurek, Kathryn M.
    Inzani, Katherine
    Griffin, Sinead M.
    JOURNAL OF HIGH ENERGY PHYSICS, 2020, 2020 (03)
  • [22] A multi-channel framework for image watermarking
    Zheng, JB
    Feng, DD
    Zhao, RC
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 5099 - 5104
  • [23] A GENERIC CLASSIFICATION SYSTEM FOR MULTI-CHANNEL AUDIO INDEXING: APPLICATION TO SPEECH AND MUSIC DETECTION
    Benaroya, Elie-Laurent
    Peeters, Geoffroy
    2013 14TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES (WIAMIS), 2013,
  • [24] Efficient Transient Signal Detection in Spatial Cue based Multi-Channel Audio Coding
    Lee, Byunghwa
    Hahn, Minsoo
    Kim, Kwangki
    Kim, Jinsul
    2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA), 2014,
  • [25] Multi-channel audio recovery based on tensor decomposition
    Yang, Li-Dong
    Wang, Jing
    Zhao, Yi
    Xie, Xiang
    Kuang, Jing-Ming
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2015, 35 (11): : 1183 - 1188
  • [26] Digital multi-channel audio format for motion pictures
    Miyamori, S
    Ueno, M
    ICCE - INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 1996 DIGEST OF TECHNICAL PAPERS, 1996, : 206 - 207
  • [28] SPATIAL-TEMPORAL MULTI-CHANNEL AUDIO CODING
    Lee, Jonghwa
    Lee, Chulhee
    2008 IEEE SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP, 2008, : 381 - 384
  • [29] Multi-Channel Generalized-ICP: A robust framework for multi-channel scan registration
    Servos, James
    Waslander, Steven L.
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2017, 87 : 247 - 257
  • [30] UNIQUE CHANNEL DETECTION IN A MULTI-CHANNEL SYSTEM
    MARINO, PF
    SIAM REVIEW, 1963, 5 (01) : 93 - &