Dataset for polyphonic sound event detection tasks in urban soundscapes: The synthetic polyphonic ambient sound source (SPASS) dataset

被引:1
|
作者
Viveros-Munoz, Rhoddy [1 ]
Huijse, Pablo [2 ,3 ]
Vargas, Victor [1 ]
Espejo, Diego [1 ]
Poblete, Victor [1 ]
Arenas, Jorge P. [1 ]
Vernier, Matthieu [2 ]
Vergara, Diego [1 ]
Suarez, Enrique [1 ]
机构
[1] Univ Austral Chile, Inst Acust, Gen Lagos 2086, Valdivia, Chile
[2] Univ Austral Chile, Inst Informat, Gen Lagos 2086, Valdivia, Chile
[3] Millennium Inst Astrophys, Nuncio Monsenor Sotero Sanz 100, Santiago, Chile
来源
DATA IN BRIEF | 2023年 / 50卷
关键词
Deep learning; Polyphonic sound event detection; Soundscape; Acoustic virtual reality;
D O I
10.1016/j.dib.2023.109552
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents the Synthetic Polyphonic Ambient Sound Source (SPASS) dataset, a publicly available synthetic polyphonic audio dataset. SPASS was designed to train deep neural networks effectively for polyphonic sound event detection (PSED) in urban soundscapes. SPASS contains synthetic recordings from five virtual environments: park, square, street, market, and waterfront. The data collection process consisted of the curation of different monophonic sound sources following a hierarchical class taxonomy, the configuration of the virtual environments with the RAVEN software library, the generation of all stimuli, and the processing of this data to create synthetic recordings of polyphonic sound events with their associated metadata. The dataset contains 50 0 0 audio clips per environment, i.e., 25,0 0 0 stimuli of 10 s each, virtually recorded at a sampling rate of 44.1 kHz. This effort is part of the project "Integrated System for the Analysis of Environmental Sound Sources: FuSA System" in the city of Valdivia, Chile, which aims to develop a system for detecting and classifying environmental sound sources through deep Artificial Neural Network (ANN) models. (c) 2023 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/ )
引用
收藏
页数:8
相关论文
共 50 条
  • [31] POLYPHONIC SOUND EVENT DETECTION USING CONVOLUTIONAL BIDIRECTIONAL LSTM AND SYNTHETIC DATA-BASED TRANSFER LEARNING
    Jung, Seokwon
    Park, Jungbae
    Lee, Sangwan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 885 - 889
  • [32] A TRACK-WISE ENSEMBLE EVENT INDEPENDENT NETWORK FOR POLYPHONIC SOUND EVENT LOCALIZATION AND DETECTION
    Hu, Jinbo
    Cao, Yin
    Wu, Ming
    Kong, Qiuqiang
    Yang, Feiran
    Plumbley, Mark D.
    Yang, Jun
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 9196 - 9200
  • [33] Robust polyphonic sound event detection by using multi frame size denoising autoencoder
    Zhou, Jianchao
    Chen, Xiaoou
    Yang, Deshun
    2018 IEEE 20TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2018,
  • [34] Comparative Assessment of Data Augmentation for Semi-Supervised Polyphonic Sound Event Detection
    Delphin-Poulat, Lionel
    Nicol, Rozenn
    Plapous, Cyril
    Peron, Katell
    PROCEEDINGS OF THE 2020 27TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2020, : 46 - 53
  • [35] Convolutional Neural Networks with Multi-task Loss for Polyphonic Sound Event Detection
    Liu, Huang
    Wang, Xiu
    Guan, Fa-Qian
    Hu, Jin-Sen
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
  • [36] A Survey of Polyphonic Sound Event Detection Based on Non-negative Matrix Factorization
    Manh-Quan Bui
    Viet-Hang Duong
    Mathulaprangsan, Seksan
    Bach-Tung Pham
    Lee, Wei-Jing
    Wang, Jia-Ching
    2016 INTERNATIONAL COMPUTER SYMPOSIUM (ICS), 2016, : 351 - 354
  • [37] Polyphonic Sound Event Detection Using Modified Recurrent Temporal Pyramid Neural Network
    Venkatesh, Spoorthy
    Koolagudi, Shashidhar G.
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I, 2024, 2009 : 554 - 564
  • [38] Adaptive Memory-Controlled Self-Attention for Polyphonic Sound Event Detection
    Wang, Mei
    Yao, Yu
    Qiu, Hongbin
    Song, Xiyu
    SYMMETRY-BASEL, 2022, 14 (02):
  • [39] ARCHEO: A Dataset for Sound Event Detection in Areas of Touristic Interest
    Psallidas, Theodoros
    Mitsou, Alexander
    Pikramenos, George
    Spyrou, Evaggelos
    Giannakopoulos, Theodore
    2020 15TH INTERNATIONAL WORKSHOP ON SEMANTIC AND SOCIAL MEDIA ADAPTATION AND PERSONALIZATION (SMAP 2020), 2020, : 39 - 44
  • [40] Polyphonic Sound Event Detection Using Temporal-Frequency Attention and Feature Space Attention
    Jin, Ye
    Wang, Mei
    Luo, Liyan
    Zhao, Dinghao
    Liu, Zhanqi
    SENSORS, 2022, 22 (18)