Sound Event Localization and Detection Based on Deep Learning

被引:0
|
作者
Zhao, Dada [1 ,2 ]
Ding, Kai [2 ]
Qi, Xiaogang [1 ]
Chen, Yu [2 ]
Feng, Hailin [1 ]
机构
[1] Xidian Univ, Sch Math & Stat, Xian 710071, Peoples R China
[2] Sci & Technol Near Surface Detect Lab, Wuxi 214035, Peoples R China
基金
中国国家自然科学基金;
关键词
Location awareness; Feature extraction; Neural networks; Convolutional neural networks; Reverberation; Prediction algorithms; Training; sound event localization and detection (SELD); deep learning; convolutional recursive neural network (CRNN); channel attention mechanism; DATA AUGMENTATION; NEURAL-NETWORKS; SPECTRUM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Acoustic source localization (ASL) and sound event detection (SED) are two widely pursued independent research fields. In recent years, in order to achieve a more complete spatial and temporal representation of sound field, sound event localization and detection (SELD) has become a very active research topic. This paper presents a deep learning-based multi-overlapping sound event localization and detection algorithm in three-dimensional space. Log-Mel spectrum and generalized cross-correlation spectrum are joined together in channel dimension as input features. These features are classified and regressed in parallel after training by a neural network to obtain sound recognition and localization results respectively. The channel attention mechanism is also introduced in the network to selectively enhance the features containing essential information and suppress the useless features. Finally, a thourough comparison confirms the efficiency and effectiveness of the proposed SELD algorithm. Field experiments show that the proposed algorithm is robust to reverberation and environment and can achieve higher recognition and localization accuracy compared with the baseline method.
引用
收藏
页码:294 / 301
页数:8
相关论文
共 50 条
  • [11] Sound Event Detection and Localization with Distance Estimation
    Krause, Daniel Aleksander
    Politis, Archontis
    Mesaros, Annamaria
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 286 - 290
  • [12] Sound learning–based event detection for acoustic surveillance sensors
    Jeong-Sik Park
    Seok-Hoon Kim
    Multimedia Tools and Applications, 2020, 79 : 16127 - 16139
  • [13] A Deep Learning Based Sound Event Location and Detection Algorithm Using Convolutional Recurrent Neural Network
    Zhu, Hongxiang
    Yan, Jun
    2022 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS, CITS, 2022, : 25 - 30
  • [14] Active Learning for Sound Event Detection
    Shuyang Zhao
    Heittola, Toni
    Virtanen, Tuomas
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2895 - 2905
  • [15] Noise Robust Sound Event Detection Using Deep Learning and Audio Enhancement
    Wan, Tongtang
    Zhou, Yi
    Ma, Yongbao
    Liu, Hongqing
    2019 IEEE 19TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2019), 2019,
  • [16] Soccer Video Event Detection Based on Deep Learning
    Yu, Junqing
    Lei, Aiping
    Hu, Yangliu
    MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 377 - 389
  • [17] SOUND EVENT DETECTION BASED ON CURRICULUM LEARNING CONSIDERING LEARNING DIFFICULTY OF EVENTS
    Tonami, Noriyuki
    Imoto, Keisuke
    Okamoto, Yuki
    Fukumori, Takahiro
    Yamashita, Yoichi
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 875 - 879
  • [18] Deep Learning-Based Dereverberation for Sound Source Localization with Beamforming
    Zhai, Qingbo
    Ning, Fangli
    Hou, Hongjie
    Wei, Juan
    Su, Zhaojing
    JOURNAL OF THEORETICAL AND COMPUTATIONAL ACOUSTICS, 2024, 32 (01):
  • [19] CRATI: Contrastive representation-based multimodal sound event localization and detection
    Wu, Shichao
    Wang, Yongru
    Jiang, Yushan
    Zhang, Qianyi
    Liu, Jingtai
    KNOWLEDGE-BASED SYSTEMS, 2024, 305
  • [20] Polyphonic sound event localization and detection based on Multiple Attention Fusion ResNet
    Zhang S.
    Zhang Y.
    Liao Y.
    Pang K.
    Wan Z.
    Zhou S.
    Mathematical Biosciences and Engineering, 2024, 21 (02) : 2004 - 2023