Attentive Convolutional Recurrent Neural Network Using Phoneme-Level Acoustic Representation for Rare Sound Event Detection

被引:3
|
作者
Upadhyay, Shreya G. [1 ,2 ]
Su, Bo-Hao [1 ,2 ]
Lee, Chi-Chun [1 ,2 ]
机构
[1] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu, Taiwan
[2] MOST Joint Res Ctr AI Technol & All Vista Healthc, Hsinchu, Taiwan
来源
INTERSPEECH 2020 | 2020年
关键词
sound event detection; convolution recurrent neural network; attention; automatic speech recognition; CLASSIFICATION;
D O I
10.21437/Interspeech.2020-2585
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
A well-trained Acoustic Sound Event Detection system captures the patterns of the sound to accurately detect events of interest in an auditory scene, which enables applications across domains of multimedia, smart living, and even health monitoring. Due to the scarcity and the weak labelling nature of the sound event data, it is often challenging to train an accurate and robust acoustic event detection model directly, especially for those rare occurrences. In this paper, we proposed an architecture which takes the advantage of integrating ASR network representations as additional input when training a sound event detector. Here we used the convolutional bi-directional recurrent neural network (CBRNN), which includes both spectral and temporal attentions, as the SED classifier and further combined the ASR feature representations when performing the end-to-end CBRNN training. Our experiments on the TUT 2017 rare sound event detection dataset showed that with the inclusion of ASR features, the overall discriminative performance of the end-to-end sound event detection system has improved; the average performance of our proposed framework in terms of f-score and error rates are 97 % and 0.05 % respectively.
引用
收藏
页码:3102 / 3106
页数:5
相关论文
共 50 条
  • [21] Abnormal Event Detection using Recurrent Neural Network
    Zhou, Xu-gang
    Zhang, Li-qing
    2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATIONS (CSA), 2015, : 222 - 226
  • [22] Segmenting Acoustic Signal with Articulatory Movement using Recurrent Neural Network for Phoneme Acquisition
    Kanda, Hisashi
    Ogata, Tetsuya
    Komatani, Kazunori
    Okuno, Hiroshi G.
    2008 IEEE/RSJ INTERNATIONAL CONFERENCE ON ROBOTS AND INTELLIGENT SYSTEMS, VOLS 1-3, CONFERENCE PROCEEDINGS, 2008, : 1712 - 1717
  • [23] Sound Event Detection Based on Bidirectional Temporal Convolutional Network and Gated Recurrent Unit
    Chen Yihan
    Guo Min
    Li Zhiqiang
    20TH INT CONF ON UBIQUITOUS COMP AND COMMUNICAT (IUCC) / 20TH INT CONF ON COMP AND INFORMATION TECHNOLOGY (CIT) / 4TH INT CONF ON DATA SCIENCE AND COMPUTATIONAL INTELLIGENCE (DSCI) / 11TH INT CONF ON SMART COMPUTING, NETWORKING, AND SERV (SMARTCNS), 2021, : 445 - 450
  • [24] Polyphonic Sound Event Detection Based on Residual Convolutional Recurrent Neural Network With Semi-Supervised Loss Function
    Kim, Nam Kyun
    Kim, Hong Kook
    IEEE ACCESS, 2021, 9 (09): : 7564 - 7575
  • [25] Minimally Supervised Sound Event Detection Using a Neural Network
    Agarwal, Aditya
    Quadri, Syed Munawwar
    Murthy, Savitha
    Sitaram, Dinkar
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 2495 - 2500
  • [26] Robust technique for environmental sound classification using convolutional recurrent neural network
    Anam Bansal
    Naresh Kumar Garg
    Multimedia Tools and Applications, 2024, 83 : 54755 - 54772
  • [27] Robust technique for environmental sound classification using convolutional recurrent neural network
    Bansal, Anam
    Garg, Naresh Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 54755 - 54772
  • [28] Sound event localization and detection using element-wise attention gate and asymmetric convolutional recurrent neural networks
    Yan, Lean
    Guo, Min
    Li, Zhiqiang
    AI COMMUNICATIONS, 2023, 36 (02) : 147 - 157
  • [29] Event Detection and Classification Using Deep Compressed Convolutional Neural Network
    Swapnika, K.
    Vasumathi, D.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (12) : 312 - 322
  • [30] Event Detection and Classification Using Deep Compressed Convolutional Neural Network
    Swapnika, K.
    Vasumathi, D.
    International Journal of Advanced Computer Science and Applications, 2022, 13 (12): : 312 - 322