Dataset for polyphonic sound event detection tasks in urban soundscapes: The synthetic polyphonic ambient sound source (SPASS) dataset

被引:1
|
作者
Viveros-Munoz, Rhoddy [1 ]
Huijse, Pablo [2 ,3 ]
Vargas, Victor [1 ]
Espejo, Diego [1 ]
Poblete, Victor [1 ]
Arenas, Jorge P. [1 ]
Vernier, Matthieu [2 ]
Vergara, Diego [1 ]
Suarez, Enrique [1 ]
机构
[1] Univ Austral Chile, Inst Acust, Gen Lagos 2086, Valdivia, Chile
[2] Univ Austral Chile, Inst Informat, Gen Lagos 2086, Valdivia, Chile
[3] Millennium Inst Astrophys, Nuncio Monsenor Sotero Sanz 100, Santiago, Chile
来源
DATA IN BRIEF | 2023年 / 50卷
关键词
Deep learning; Polyphonic sound event detection; Soundscape; Acoustic virtual reality;
D O I
10.1016/j.dib.2023.109552
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents the Synthetic Polyphonic Ambient Sound Source (SPASS) dataset, a publicly available synthetic polyphonic audio dataset. SPASS was designed to train deep neural networks effectively for polyphonic sound event detection (PSED) in urban soundscapes. SPASS contains synthetic recordings from five virtual environments: park, square, street, market, and waterfront. The data collection process consisted of the curation of different monophonic sound sources following a hierarchical class taxonomy, the configuration of the virtual environments with the RAVEN software library, the generation of all stimuli, and the processing of this data to create synthetic recordings of polyphonic sound events with their associated metadata. The dataset contains 50 0 0 audio clips per environment, i.e., 25,0 0 0 stimuli of 10 s each, virtually recorded at a sampling rate of 44.1 kHz. This effort is part of the project "Integrated System for the Analysis of Environmental Sound Sources: FuSA System" in the city of Valdivia, Chile, which aims to develop a system for detecting and classifying environmental sound sources through deep Artificial Neural Network (ANN) models. (c) 2023 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/ )
引用
收藏
页数:8
相关论文
共 50 条
  • [1] The SPASS dataset: A new synthetic polyphonic dataset with spatiotemporal labels of sound sources
    Viveros-Munoz, Rhoddy
    Huijse, Pablo
    Vargas, Victor
    Espejo, Diego
    Poblete, Victor
    Arenas, Jorge P.
    Vernier, Matthieu
    Vergara, Diego
    Suarez, Enrique
    APPLIED ACOUSTICS, 2023, 214
  • [2] Metrics for Polyphonic Sound Event Detection
    Mesaros, Annamaria
    Heittola, Toni
    Virtanen, Tuomas
    APPLIED SCIENCES-BASEL, 2016, 6 (06):
  • [3] Event Specific Attention for Polyphonic Sound Event Detection
    Sundar, Harshavardhan
    Sun, Ming
    Wang, Chao
    INTERSPEECH 2021, 2021, : 566 - 570
  • [4] Augmented Strategy For Polyphonic Sound Event Detection
    Wang, Bolun
    Fu, Zhong-Hua
    Wu, Hao
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1496 - 1500
  • [5] A Comprehensive Review of Polyphonic Sound Event Detection
    Chan, T. K.
    Chin, Cheng Siong
    IEEE ACCESS, 2020, 8 : 103339 - 103373
  • [6] A Capsule based Approach for Polyphonic Sound Event Detection
    Liu, Yaming
    Tang, Jian
    Song, Yan
    Dai, Lirong
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1853 - 1857
  • [7] A survey of Deep Learning for Polyphonic Sound event detection
    Dang, An
    Vu, Toan H.
    Wang, Jia-Ching
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT), 2017, : 75 - 78
  • [8] USING SEQUENTIAL INFORMATION IN POLYPHONIC SOUND EVENT DETECTION
    Huang, Guangpu
    Heittola, Toni
    Virtanen, Tuomas
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 291 - 295
  • [9] PEER COLLABORATIVE LEARNING FOR POLYPHONIC SOUND EVENT DETECTION
    Endo, Hayato
    Nishizaki, Hiromitsu
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 826 - 830
  • [10] POLYPHONIC SOUND EVENT AND SOUND ACTIVITY DETECTION: A MULTI-TASK APPROACH
    Pankajakshan, Arjun
    Bear, Helen L.
    Benetos, Emmanouil
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 323 - 327