Sound Event Detection: A tutorial

被引:94
|
作者
Mesaros, Annamaria [1 ]
Heittola, Toni [2 ]
Virtanen, Tuomas [3 ]
Plumbley, Mark D. [4 ,5 ]
机构
[1] Tampere Univ, Machine Listening Grp, Korkeakoulunkatu 33014, Finland
[2] Tampere Univ, Korkeakoulunkatu 33014, Finland
[3] Tampere Univ, Audio Res Grp, Korkeakoulunkatu 33014, Finland
[4] Univ Surrey, Ctr Vis Speech & Signal Proc, Signal Proc, Guildford GU2 7XH, Surrey, England
[5] Univ Surrey, Sch Comp Sci & Elect Engn, Guildford GU2 7XH, Surrey, England
基金
英国工程与自然科学研究理事会; 芬兰科学院; 欧洲研究理事会;
关键词
AUDIO;
D O I
10.1109/MSP.2021.3090678
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Imagine standing on a street corner in the city. With your eyes closed you can hear and recognize a succession of sounds: cars passing by, people speaking, their footsteps when they walk by, and the continuous falling of rain. The recognition of all these sounds and interpretation of the perceived scene as a city street soundscape comes naturally to humans. It is, however, the result of years of "training": encountering and learning associations among the vast varieties of sounds in everyday life, the sources producing these sounds, and the names given to them.
引用
收藏
页码:67 / 83
页数:17
相关论文
共 50 条
  • [21] DiffSED: Sound Event Detection with Denoising Diffusion
    Bhosale, Swapnil
    Nag, Sauradip
    Kanojia, Diptesh
    Deng, Jiankang
    Zhu, Xiatian
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 792 - 800
  • [22] SOUND EVENT DETECTION WITH ADAPTIVE FREQUENCY SELECTION
    Wang, Zhepei
    Casebeer, Jonah
    Clemmitt, Adam
    Tzinis, Efthymios
    Smaragdis, Paris
    2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2021, : 41 - 45
  • [23] Sound Event Detection in the DCASE 2017 Challenge
    Mesaros, Annamaria
    Diment, Aleksandr
    Elizalde, Benjamin
    Heittola, Toni
    Vincent, Emmanuel
    Raj, Bhiksha
    Virtanen, Tuomas
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (06) : 992 - 1006
  • [24] Augmented Strategy For Polyphonic Sound Event Detection
    Wang, Bolun
    Fu, Zhong-Hua
    Wu, Hao
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1496 - 1500
  • [25] Context-dependent sound event detection
    Heittola, Toni
    Mesaros, Annamaria
    Eronen, Antti
    Virtanen, Tuomas
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
  • [26] FEW-SHOT SOUND EVENT DETECTION
    Wang, Yu
    Salamon, Justin
    Bryan, Nicholas J.
    Bello, Juan Pablo
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 81 - 85
  • [27] MULTIMODAL EVALUATION METHOD FOR SOUND EVENT DETECTION
    Modaresi, Seyed M. R.
    Osmani, Aomar
    Razzazi, Mohammadreza
    Chibani, Abdelghani
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1026 - 1030
  • [28] Sound Event Detection and Localization with Distance Estimation
    Krause, Daniel Aleksander
    Politis, Archontis
    Mesaros, Annamaria
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 286 - 290
  • [29] A FRAMEWORK FOR THE ROBUST EVALUATION OF SOUND EVENT DETECTION
    Bilen, Cagdas
    Ferroni, Giacomo
    Tuveri, Francesco
    Azcarreta, Juan
    Krstulovic, Sacha
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 61 - 65
  • [30] Context-dependent sound event detection
    Toni Heittola
    Annamaria Mesaros
    Antti Eronen
    Tuomas Virtanen
    EURASIP Journal on Audio, Speech, and Music Processing, 2013