Environmental sound classification using temporal-frequency attention based convolutional neural network

被引:0
|
作者
Wenjie Mu
Bo Yin
Xianqing Huang
Jiali Xu
Zehua Du
机构
[1] Ocean University of China,College of Information Science and Engineering
[2] Pilot National Laboratory for Marine Science and Technology,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Environmental sound classification is one of the important issues in the audio recognition field. Compared with structured sounds such as speech and music, the time–frequency structure of environmental sounds is more complicated. In order to learn time and frequency features from Log-Mel spectrogram more effectively, a temporal-frequency attention based convolutional neural network model (TFCNN) is proposed in this paper. Firstly, an experiment that is used as motivation in proposed method is designed to verify the effect of a specific frequency band in the spectrogram on model classification. Secondly, two new attention mechanisms, temporal attention mechanism and frequency attention mechanism, are proposed. These mechanisms can focus on key frequency bands and semantic related time frames on the spectrogram to reduce the influence of background noise and irrelevant frequency bands. Then, a feature information complementarity is formed by combining these mechanisms to more accurately capture the critical time–frequency features. In such a way, the representation ability of the network model can be greatly improved. Finally, experiments on two public data sets, UrbanSound 8 K and ESC-50, demonstrate the effectiveness of the proposed method.
引用
收藏
相关论文
共 50 条
  • [11] Robust technique for environmental sound classification using convolutional recurrent neural network
    Anam Bansal
    Naresh Kumar Garg
    Multimedia Tools and Applications, 2024, 83 : 54755 - 54772
  • [12] Robust technique for environmental sound classification using convolutional recurrent neural network
    Bansal, Anam
    Garg, Naresh Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 54755 - 54772
  • [13] Environment Sound Classification using Multiple Feature Channels and Attention based Deep Convolutional Neural Network
    Sharma, Jivitesh
    Granmo, Ole-Christoffer
    Goodwin, Morten
    INTERSPEECH 2020, 2020, : 1186 - 1190
  • [14] Temporal Self-Attention-Based Residual Network for An Environmental Sound Classification
    Tripathi, Achyut Mani
    Paul, Konark
    INTERSPEECH 2022, 2022, : 1516 - 1520
  • [15] Fast environmental sound classification based on resource adaptive convolutional neural network
    Zheng Fang
    Bo Yin
    Zehua Du
    Xianqing Huang
    Scientific Reports, 12
  • [16] Fast environmental sound classification based on resource adaptive convolutional neural network
    Fang, Zheng
    Yin, Bo
    Du, Zehua
    Huang, Xianqing
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [17] Convolutional neural network based traffic sound classification robust to environmental noise
    Lee, Jaejun
    Kim, Wansoo
    Lee, Kyogu
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2018, 37 (06): : 469 - 474
  • [18] Deep Convolutional Neural Network with Mixup for Environmental Sound Classification
    Zhang, Zhichao
    Xu, Shugong
    Cao, Shan
    Zhang, Shunqing
    PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 356 - 367
  • [19] Animal Sound Classification Using A Convolutional Neural Network
    Sasmaz, Emre
    Tek, F. Boray
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2018, : 625 - 629
  • [20] Environmental sound classification using a regularized deep convolutional neural network with data augmentation
    Mushtaq, Zohaib
    Su, Shun-Feng
    APPLIED ACOUSTICS, 2020, 167