Sound event triage: detecting sound events considering priority of classes

被引:0
|
作者
Tonami, Noriyuki [1 ]
Imoto, Keisuke [2 ]
机构
[1] Ritsumeikan Univ, Grad Sch Informat Sci & Engn, Kusatsu, Japan
[2] Doshisha Univ, Fac Sci & Engn, Dept Informat Syst Design, Kyotanabe, Japan
关键词
Sound event triage; Sound event detection; Loss-conditional training; ACOUSTIC EVENT;
D O I
10.1186/s13636-022-00270-7
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a new task for sound event detection (SED): sound event triage (SET). The goal of SET is to detect an arbitrary number of high-priority event classes while allowing misdetections of low-priority event classes where the priority is given for each event class. In conventional methods of SED for targeting a specific sound event class, it is only possible to give priority to a single event class. Moreover, the level of priority is not adjustable, i.e, the conventional methods can use only types of target event class such as one-hot vector, as inputs. To flexibly control much information on the target event, the proposed SET exploits not only types of target sound but also the extent to which each target sound is detected with priority. To implement the detection of events with priority, we propose class-weighted training, in which loss functions and the network are stochastically weighted by the priority parameter of each class. As this is the first paper on SET, we particularly introduce an implementation of single target SET, which is a subtask of SET. The results of the experiments using the URBAN-SED dataset show that the proposed method of single target SET outperforms the conventional SED method by 8.70, 6.66, and 6.09 percentage points for "air_conditioner," "car_horn," and "street_music," respectively, in terms of the intersection-based F-score. For the average score of classes, the proposed methods increase the intersection-based F-score by up to 3.37 percentage points compared with the conventional SED and other target-class-conditioned models.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Sound event triage: detecting sound events considering priority of classes
    Noriyuki Tonami
    Keisuke Imoto
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [2] Categorization of sound events for automatic sound event classification
    Elizalde, Benjamin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (03):
  • [3] SOUND EVENT DETECTION BASED ON CURRICULUM LEARNING CONSIDERING LEARNING DIFFICULTY OF EVENTS
    Tonami, Noriyuki
    Imoto, Keisuke
    Okamoto, Yuki
    Fukumori, Takahiro
    Yamashita, Yoichi
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 875 - 879
  • [4] SOUND CLASSES AND WRITING OF SOUND CLASSES IN PHONOMETRY
    BERGSVEI.S
    BIBLIOTHECA PHONETICA, 1968, (05): : 260 - &
  • [5] SOUND EVENT DETECTION BY MULTITASK LEARNING OF SOUND EVENTS AND SCENES WITH SOFT SCENE LABELS
    Imoto, Keisuke
    Tonami, Noriyuki
    Koizumi, Yuma
    Yasuda, Masahiro
    Yamanishi, Ryosuke
    Yamashita, Yoichi
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 621 - 625
  • [6] Cosine-similarity penalty to discriminate sound classes in weakly-supervised sound event detection
    Pellegrini, Thomas
    Cances, Leo
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [7] Sound and event
    Ortega Saenz, Fernanda
    REVISTA MUSICAL CHILENA, 2017, 71 (228) : 120 - 124
  • [8] Eavesdropping by the eye: detecting sound events and the culture of acoustic intelligence
    Bijsterveld, Karin
    SOUND STUDIES, 2023, 9 (02) : 233 - 252
  • [9] SnoreNet: Detecting Snore Events from Raw Sound Recordings
    Sun, Jingpeng
    Hu, Xiyuan
    Zhao, Yingying
    Sun, Shuchen
    Chen, Chen
    Peng, Silong
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 4977 - 4981
  • [10] Audiotory Movie Summarization by Detecting Scene Changes and Sound Events
    Lu, Tong
    Weng, Yangbing
    Wang, Gongyou
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 756 - 760