Dataset for polyphonic sound event detection tasks in urban soundscapes: The synthetic polyphonic ambient sound source (SPASS) dataset

被引:1
|
作者
Viveros-Munoz, Rhoddy [1 ]
Huijse, Pablo [2 ,3 ]
Vargas, Victor [1 ]
Espejo, Diego [1 ]
Poblete, Victor [1 ]
Arenas, Jorge P. [1 ]
Vernier, Matthieu [2 ]
Vergara, Diego [1 ]
Suarez, Enrique [1 ]
机构
[1] Univ Austral Chile, Inst Acust, Gen Lagos 2086, Valdivia, Chile
[2] Univ Austral Chile, Inst Informat, Gen Lagos 2086, Valdivia, Chile
[3] Millennium Inst Astrophys, Nuncio Monsenor Sotero Sanz 100, Santiago, Chile
来源
DATA IN BRIEF | 2023年 / 50卷
关键词
Deep learning; Polyphonic sound event detection; Soundscape; Acoustic virtual reality;
D O I
10.1016/j.dib.2023.109552
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents the Synthetic Polyphonic Ambient Sound Source (SPASS) dataset, a publicly available synthetic polyphonic audio dataset. SPASS was designed to train deep neural networks effectively for polyphonic sound event detection (PSED) in urban soundscapes. SPASS contains synthetic recordings from five virtual environments: park, square, street, market, and waterfront. The data collection process consisted of the curation of different monophonic sound sources following a hierarchical class taxonomy, the configuration of the virtual environments with the RAVEN software library, the generation of all stimuli, and the processing of this data to create synthetic recordings of polyphonic sound events with their associated metadata. The dataset contains 50 0 0 audio clips per environment, i.e., 25,0 0 0 stimuli of 10 s each, virtually recorded at a sampling rate of 44.1 kHz. This effort is part of the project "Integrated System for the Analysis of Environmental Sound Sources: FuSA System" in the city of Valdivia, Chile, which aims to develop a system for detecting and classifying environmental sound sources through deep Artificial Neural Network (ANN) models. (c) 2023 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/ )
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Specialized Decision Surface and Disentangled Feature for Weakly-Supervised Polyphonic Sound Event Detection
    Lin, Liwei
    Wang, Xiangdong
    Liu, Hong
    Qian, Yueliang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1466 - 1478
  • [42] CONTRASTIVE LOSS BASED FRAME-WISE FEATURE DISENTANGLEMENT FOR POLYPHONIC SOUND EVENT DETECTION
    Guan, Yadong
    Han, Jiqing
    Song, Hongwei
    Song, Wenjie
    Zheng, Guibin
    Zheng, Tieran
    He, Yongjun
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 1021 - 1025
  • [43] Sound Event Detection by Pseudo-Labeling in Weakly Labeled Dataset
    Park, Chungho
    Kim, Donghyeon
    Ko, Hanseok
    SENSORS, 2021, 21 (24)
  • [44] WEARABLE SELD DATASET: DATASET FOR SOUND EVENT LOCALIZATION AND DETECTION USING WEARABLE DEVICES AROUND HEAD
    Nagatomo, Kento
    Yasuda, Masahiro
    Yatabe, Kohei
    Saito, Shoichiro
    Oikawa, Yasuhiro
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 156 - 160
  • [45] SALSA-LITE: A FAST AND EFFECTIVE FEATURE FOR POLYPHONIC SOUND EVENT LOCALIZATION AND DETECTION WITH MICROPHONE ARRAYS
    Thi Ngoc Tho Nguyen
    Jones, Douglas L.
    Watcharasupat, Karn N.
    Huy Phan
    Gan, Woon-Seng
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 716 - 720
  • [46] A Transfer Learning Based Feature Extractor for Polyphonic Sound Event Detection Using Connectionist Temporal Classification
    Wang, Yun
    Metze, Florian
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3097 - 3101
  • [47] A CRNN System for Sound Event Detection Based on Gastrointestinal Sound Dataset Collected by Wearable Auscultation Devices
    Zheng, Xue
    Zhang, Chun
    Chen, Ping
    Zhao, Kang
    Jiang, Hanjun
    Jiang, Zhiwei
    Pan, Huafeng
    Wang, Zhihua
    Jia, Wen
    IEEE ACCESS, 2020, 8 (08): : 157892 - 157905
  • [48] A BENCHMARK OF STATE-OF-THE-ART SOUND EVENT DETECTION SYSTEMS EVALUATED ON SYNTHETIC SOUNDSCAPES
    Ronchini, Francesca
    Serizel, Romain
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1031 - 1035
  • [49] Teacher-Student Framework for Polyphonic Semi-supervised Sound Event Detection: Survey and Empirical Analysis
    Diffallah, Zhor
    Ykhlef, Hadjer
    Bouarfa, Hafida
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (05)
  • [50] A Method Based on Dual Cross-Modal Attention and Parameter Sharing for Polyphonic Sound Event Localization and Detection
    Lee, Sang-Hoon
    Hwang, Jung-Wook
    Song, Min-Hwan
    Park, Hyung-Min
    APPLIED SCIENCES-BASEL, 2022, 12 (10):