Sound event triage: detecting sound events considering priority of classes

被引:0
|
作者
Tonami, Noriyuki [1 ]
Imoto, Keisuke [2 ]
机构
[1] Ritsumeikan Univ, Grad Sch Informat Sci & Engn, Kusatsu, Japan
[2] Doshisha Univ, Fac Sci & Engn, Dept Informat Syst Design, Kyotanabe, Japan
关键词
Sound event triage; Sound event detection; Loss-conditional training; ACOUSTIC EVENT;
D O I
10.1186/s13636-022-00270-7
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a new task for sound event detection (SED): sound event triage (SET). The goal of SET is to detect an arbitrary number of high-priority event classes while allowing misdetections of low-priority event classes where the priority is given for each event class. In conventional methods of SED for targeting a specific sound event class, it is only possible to give priority to a single event class. Moreover, the level of priority is not adjustable, i.e, the conventional methods can use only types of target event class such as one-hot vector, as inputs. To flexibly control much information on the target event, the proposed SET exploits not only types of target sound but also the extent to which each target sound is detected with priority. To implement the detection of events with priority, we propose class-weighted training, in which loss functions and the network are stochastically weighted by the priority parameter of each class. As this is the first paper on SET, we particularly introduce an implementation of single target SET, which is a subtask of SET. The results of the experiments using the URBAN-SED dataset show that the proposed method of single target SET outperforms the conventional SED method by 8.70, 6.66, and 6.09 percentage points for "air_conditioner," "car_horn," and "street_music," respectively, in terms of the intersection-based F-score. For the average score of classes, the proposed methods increase the intersection-based F-score by up to 3.37 percentage points compared with the conventional SED and other target-class-conditioned models.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] HEMODYNAMIC AND SOUND EVENTS PRECEDING FIRST HEART SOUND IN MITRAL STENOSIS
    LAKIER, JB
    BARLOW, JB
    POCOCK, WA
    GALE, GE
    BRITISH HEART JOURNAL, 1972, 34 (11): : 1152 - &
  • [42] Sematic discription of sound environment in terms of typesandaudibletime of environmental sound events
    Hiraguri Y.
    Kawai K.
    Journal of Environmental Engineering, 2010, 75 (657) : 937 - 944
  • [43] Detecting Selected Instruments in the Sound Signal
    Kostrzewa, Daniel
    Szwajnoch, Pawel
    Brzeski, Robert
    Mrozek, Dariusz
    APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [44] Detecting change in stochastic sound sequences
    Skerritt-Davis, Benjamin
    Elhilali, Mounya
    PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (05)
  • [45] DETECTING SOUND OBJECTS IN AUDIO RECORDINGS
    Kumar, Anurag
    Singh, Rita
    Raj, Bhiksha
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 905 - 909
  • [46] An improved method of detecting engine misfire by sound quality metrics of radiated sound
    Singh, Sneha
    Potala, Sagar
    Mohanty, Amiya R.
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2019, 233 (12) : 3112 - 3124
  • [47] A dataset of Solicited Cough Sound for Tuberculosis Triage Testing
    Huddart, Sophie
    Yadav, Vijay
    Sieberts, Solveig K.
    Omberg, Larson
    Raberahona, Mihaja
    Rakotoarivelo, Rivo
    Lyimo, Issa N.
    Lweno, Omar
    Christopher, Devasahayam J.
    Nhung, Nguyen Viet
    Theron, Grant
    Worodria, William
    Yu, Charles Y.
    Bachman, Christine M.
    Burkot, Stephen
    Dewan, Puneet
    Kulhare, Sourabh
    Small, Peter M.
    Cattamanchi, Adithya
    Jaganath, Devan
    Lapierre, Simon Grandjean
    SCIENTIFIC DATA, 2024, 11 (01)
  • [48] POLYPHONIC SOUND EVENT AND SOUND ACTIVITY DETECTION: A MULTI-TASK APPROACH
    Pankajakshan, Arjun
    Bear, Helen L.
    Benetos, Emmanouil
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 323 - 327
  • [49] Event Specific Attention for Polyphonic Sound Event Detection
    Sundar, Harshavardhan
    Sun, Ming
    Wang, Chao
    INTERSPEECH 2021, 2021, : 566 - 570
  • [50] Sound event aware environmental sound segmentation with Mask U-Net
    Sudo, Y.
    Itoyama, K.
    Nishida, K.
    Nakadai, K.
    ADVANCED ROBOTICS, 2020, 34 (20) : 1280 - 1290