Environmental sound classification using temporal-frequency attention based convolutional neural network

被引:0
|
作者
Wenjie Mu
Bo Yin
Xianqing Huang
Jiali Xu
Zehua Du
机构
[1] Ocean University of China,College of Information Science and Engineering
[2] Pilot National Laboratory for Marine Science and Technology,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Environmental sound classification is one of the important issues in the audio recognition field. Compared with structured sounds such as speech and music, the time–frequency structure of environmental sounds is more complicated. In order to learn time and frequency features from Log-Mel spectrogram more effectively, a temporal-frequency attention based convolutional neural network model (TFCNN) is proposed in this paper. Firstly, an experiment that is used as motivation in proposed method is designed to verify the effect of a specific frequency band in the spectrogram on model classification. Secondly, two new attention mechanisms, temporal attention mechanism and frequency attention mechanism, are proposed. These mechanisms can focus on key frequency bands and semantic related time frames on the spectrogram to reduce the influence of background noise and irrelevant frequency bands. Then, a feature information complementarity is formed by combining these mechanisms to more accurately capture the critical time–frequency features. In such a way, the representation ability of the network model can be greatly improved. Finally, experiments on two public data sets, UrbanSound 8 K and ESC-50, demonstrate the effectiveness of the proposed method.
引用
收藏
相关论文
共 50 条
  • [41] EEG Classification Using Hybrid Convolutional Neural Network with Attention Mechanism
    Ciurea, Alexe
    Manoila, Cristina-Petruta
    Ionescu, Bogdan
    ADVANCES IN DIGITAL HEALTH AND MEDICAL BIOENGINEERING, VOL 1, EHB-2023, 2024, 109 : 783 - 791
  • [42] Audio classification using attention-augmented convolutional neural network
    Wu, Yu
    Mao, Hua
    Yi, Zhang
    KNOWLEDGE-BASED SYSTEMS, 2018, 161 : 90 - 100
  • [43] Convolutional Neural Network-Gated Recurrent Unit Neural Network with Feature Fusion for Environmental Sound Classification
    Jinfang Yu Zhang
    Youming Zeng
    Da Li
    Automatic Control and Computer Sciences, 2021, 55 : 311 - 318
  • [44] Convolutional Neural Network-Gated Recurrent Unit Neural Network with Feature Fusion for Environmental Sound Classification
    Zhang, Yu
    Zeng, Jinfang
    Li, Youming
    Chen, Da
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2021, 55 (04) : 311 - 318
  • [45] Convolutional Neural Network with SDP-based Attention for Relation Classification
    Li, Ning
    Zhang, Hui
    Chen, Yong
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 615 - 618
  • [46] Hyperspectral Image Classification Based on Dilated Convolutional Attention Neural Network
    Zhang Xiangdong
    Wang Tengjun
    Zhu Shaojun
    Yang Yun
    ACTA OPTICA SINICA, 2021, 41 (03)
  • [47] Image Classification based on Self-attention Convolutional Neural Network
    Cai, Xiaohong
    Li, Ming
    Cao, Hui
    Ma, Jingang
    Wang, Xiaoyan
    Zhuang, Xuqiang
    SIXTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2021, 11913
  • [48] Attention-Based Convolutional Neural Network for Earthquake Event Classification
    Ku, Bonhwa
    Kim, Gwantae
    Ahn, Jae-Kwang
    Lee, Jimin
    Ko, Hanseok
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (12) : 2057 - 2061
  • [49] Indoor emergency situation recognition based on self-attention residual temporal convolutional neural network using sound and activity information
    Lee, Ju-Hwan
    Kim, Hyoung-Gook
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2024, 43 (06): : 637 - 643
  • [50] Environmental Sound Classification Method Based on Compact Bilinear Attention Network
    Dong S.
    Xia Z.
    Cai W.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (06): : 102 - 107