Dataset for polyphonic sound event detection tasks in urban soundscapes: The synthetic polyphonic ambient sound source (SPASS) dataset

被引：1

作者：

Viveros-Munoz, Rhoddy ^{[1
]}

Huijse, Pablo ^{[2
,3
]}

Vargas, Victor ^{[1
]}

Espejo, Diego ^{[1
]}

Poblete, Victor ^{[1
]}

Arenas, Jorge P. ^{[1
]}

Vernier, Matthieu ^{[2
]}

Vergara, Diego ^{[1
]}

Suarez, Enrique ^{[1
]}

机构：

[1] Univ Austral Chile, Inst Acust, Gen Lagos 2086, Valdivia, Chile

[2] Univ Austral Chile, Inst Informat, Gen Lagos 2086, Valdivia, Chile

[3] Millennium Inst Astrophys, Nuncio Monsenor Sotero Sanz 100, Santiago, Chile

来源：

DATA IN BRIEF | 2023年 / 50卷

关键词：

Deep learning; Polyphonic sound event detection; Soundscape; Acoustic virtual reality;

D O I：

10.1016/j.dib.2023.109552

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

This paper presents the Synthetic Polyphonic Ambient Sound Source (SPASS) dataset, a publicly available synthetic polyphonic audio dataset. SPASS was designed to train deep neural networks effectively for polyphonic sound event detection (PSED) in urban soundscapes. SPASS contains synthetic recordings from five virtual environments: park, square, street, market, and waterfront. The data collection process consisted of the curation of different monophonic sound sources following a hierarchical class taxonomy, the configuration of the virtual environments with the RAVEN software library, the generation of all stimuli, and the processing of this data to create synthetic recordings of polyphonic sound events with their associated metadata. The dataset contains 50 0 0 audio clips per environment, i.e., 25,0 0 0 stimuli of 10 s each, virtually recorded at a sampling rate of 44.1 kHz. This effort is part of the project "Integrated System for the Analysis of Environmental Sound Sources: FuSA System" in the city of Valdivia, Chile, which aims to develop a system for detecting and classifying environmental sound sources through deep Artificial Neural Network (ANN) models. (c) 2023 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/ )

引用

页数：8

共 50 条

[41] Specialized Decision Surface and Disentangled Feature for Weakly-Supervised Polyphonic Sound Event Detection
Lin, Liwei
Wang, Xiangdong
Liu, Hong
Qian, Yueliang
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1466 - 1478
[42] CONTRASTIVE LOSS BASED FRAME-WISE FEATURE DISENTANGLEMENT FOR POLYPHONIC SOUND EVENT DETECTION
Guan, Yadong
Han, Jiqing
Song, Hongwei
Song, Wenjie
Zheng, Guibin
Zheng, Tieran
He, Yongjun
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 1021 - 1025
[43] Sound Event Detection by Pseudo-Labeling in Weakly Labeled Dataset
Park, Chungho
Kim, Donghyeon
Ko, Hanseok
SENSORS, 2021, 21 (24)
[44] WEARABLE SELD DATASET: DATASET FOR SOUND EVENT LOCALIZATION AND DETECTION USING WEARABLE DEVICES AROUND HEAD
Nagatomo, Kento
Yasuda, Masahiro
Yatabe, Kohei
Saito, Shoichiro
Oikawa, Yasuhiro
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 156 - 160
[45] SALSA-LITE: A FAST AND EFFECTIVE FEATURE FOR POLYPHONIC SOUND EVENT LOCALIZATION AND DETECTION WITH MICROPHONE ARRAYS
Thi Ngoc Tho Nguyen
Jones, Douglas L.
Watcharasupat, Karn N.
Huy Phan
Gan, Woon-Seng
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 716 - 720
[46] A Transfer Learning Based Feature Extractor for Polyphonic Sound Event Detection Using Connectionist Temporal Classification
Wang, Yun
Metze, Florian
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3097 - 3101
[47] A CRNN System for Sound Event Detection Based on Gastrointestinal Sound Dataset Collected by Wearable Auscultation Devices
Zheng, Xue
Zhang, Chun
Chen, Ping
Zhao, Kang
Jiang, Hanjun
Jiang, Zhiwei
Pan, Huafeng
Wang, Zhihua
Jia, Wen
IEEE ACCESS, 2020, 8 (08): : 157892 - 157905
[48] A BENCHMARK OF STATE-OF-THE-ART SOUND EVENT DETECTION SYSTEMS EVALUATED ON SYNTHETIC SOUNDSCAPES
Ronchini, Francesca
Serizel, Romain
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1031 - 1035
[49] Teacher-Student Framework for Polyphonic Semi-supervised Sound Event Detection: Survey and Empirical Analysis
Diffallah, Zhor
Ykhlef, Hadjer
Bouarfa, Hafida
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (05)
[50] A Method Based on Dual Cross-Modal Attention and Parameter Sharing for Polyphonic Sound Event Localization and Detection
Lee, Sang-Hoon
Hwang, Jung-Wook
Song, Min-Hwan
Park, Hyung-Min
APPLIED SCIENCES-BASEL, 2022, 12 (10):

← 1 2 3 4 5 →