Dataset for polyphonic sound event detection tasks in urban soundscapes: The synthetic polyphonic ambient sound source (SPASS) dataset

被引:1
|
作者
Viveros-Munoz, Rhoddy [1 ]
Huijse, Pablo [2 ,3 ]
Vargas, Victor [1 ]
Espejo, Diego [1 ]
Poblete, Victor [1 ]
Arenas, Jorge P. [1 ]
Vernier, Matthieu [2 ]
Vergara, Diego [1 ]
Suarez, Enrique [1 ]
机构
[1] Univ Austral Chile, Inst Acust, Gen Lagos 2086, Valdivia, Chile
[2] Univ Austral Chile, Inst Informat, Gen Lagos 2086, Valdivia, Chile
[3] Millennium Inst Astrophys, Nuncio Monsenor Sotero Sanz 100, Santiago, Chile
来源
DATA IN BRIEF | 2023年 / 50卷
关键词
Deep learning; Polyphonic sound event detection; Soundscape; Acoustic virtual reality;
D O I
10.1016/j.dib.2023.109552
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents the Synthetic Polyphonic Ambient Sound Source (SPASS) dataset, a publicly available synthetic polyphonic audio dataset. SPASS was designed to train deep neural networks effectively for polyphonic sound event detection (PSED) in urban soundscapes. SPASS contains synthetic recordings from five virtual environments: park, square, street, market, and waterfront. The data collection process consisted of the curation of different monophonic sound sources following a hierarchical class taxonomy, the configuration of the virtual environments with the RAVEN software library, the generation of all stimuli, and the processing of this data to create synthetic recordings of polyphonic sound events with their associated metadata. The dataset contains 50 0 0 audio clips per environment, i.e., 25,0 0 0 stimuli of 10 s each, virtually recorded at a sampling rate of 44.1 kHz. This effort is part of the project "Integrated System for the Analysis of Environmental Sound Sources: FuSA System" in the city of Valdivia, Chile, which aims to develop a system for detecting and classifying environmental sound sources through deep Artificial Neural Network (ANN) models. (c) 2023 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/ )
引用
收藏
页数:8
相关论文
共 50 条
  • [21] EVALUATION OF POST-PROCESSING ALGORITHMS FOR POLYPHONIC SOUND EVENT DETECTION
    Cances, Leo
    Guyot, Patrice
    Pellegrini, Thomas
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 318 - 322
  • [22] BLSTM-HMM HYBRID SYSTEM COMBINED WITH SOUND ACTIVITY DETECTION NETWORK FOR POLYPHONIC SOUND EVENT DETECTION
    Hayashi, Tomoki
    Watanabe, Shinji
    Toda, Tomoki
    Hori, Takaaki
    Le Roux, Jonathan
    Takeda, Kazuya
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 766 - 770
  • [23] Polyphonic Sound Event Detection Using Multi Label Deep Neural Networks
    Cakir, Emre
    Heittola, Toni
    Huttunen, Heikki
    Virtanen, Tuomas
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [24] Filterbank Learning for Deep Neural Network Based Polyphonic Sound Event Detection
    Cakir, Emre
    Ozan, Ezgi Can
    Virtanen, Tuomas
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3399 - 3406
  • [25] POLYPHONIC SOUND EVENT DETECTION USING TRANSPOSED CONVOLUTIONAL RECURRENT NEURAL NETWORK
    Chatterjee, Chandra Churh
    Mulimani, Manjunath
    Koolagudi, Shashidhar G.
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 661 - 665
  • [26] RECURRENT NEURAL NETWORKS FOR POLYPHONIC SOUND EVENT DETECTION IN REAL LIFE RECORDINGS
    Parascandolo, Giambattista
    Huttunen, Heikki
    Virtanen, Tuomas
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6440 - 6444
  • [27] Polyphonic sound event localization and detection using channel-wise FusionNet
    Spoorthy, V.
    Kooolagudi, Shashidhar G.
    APPLIED INTELLIGENCE, 2024, 54 (06) : 5015 - 5026
  • [28] A FIRST ATTEMPT AT POLYPHONIC SOUND EVENT DETECTION USING CONNECTIONIST TEMPORAL CLASSIFICATION
    Wang, Yun
    Metze, Florian
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2986 - 2990
  • [29] Polyphonic sound event localization and detection based on Multiple Attention Fusion ResNet
    Zhang S.
    Zhang Y.
    Liao Y.
    Pang K.
    Wan Z.
    Zhou S.
    Mathematical Biosciences and Engineering, 2024, 21 (02) : 2004 - 2023
  • [30] SOUND EVENT DETECTION AND SEPARATION: A BENCHMARK ON DESED SYNTHETIC SOUNDSCAPES
    Turpault, Nicolas
    Serizel, Romain
    Wisdom, Scott
    Erdogan, Hakan
    Hershey, John R.
    Fonseca, Eduardo
    Seetharaman, Prem
    Salamon, Justin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 840 - 844