Acoustic event recognition using cochleagram image and convolutional neural networks

被引:42
|
作者
Sharan, Roneel V. [1 ]
Moir, Tom J. [2 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
[2] Auckland Univ Technol, Sch Engn, Private Bag 92006, Auckland 1142, New Zealand
关键词
Acoustic event recognition; Cochleagram; Convolutional neural network; Mel-spectrogram; Spectrogram; FEATURES; CLASSIFICATION;
D O I
10.1016/j.apacoust.2018.12.006
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Convolutional neural networks (CNN) have produced encouraging results in image classification tasks and have been increasingly adopted in audio classification applications. However, in using CNN for acoustic event recognition, the first hurdle is finding the best image representation of an audio signal. In this work, we evaluate the performance of four time-frequency representations for use with CNN. Firstly, we consider the conventional spectrogram image. Secondly, we apply moving average to the spectrogram along the frequency domain to obtain what we refer as the smoothed spectrogram. Thirdly, we use the mel-spectrogram which utilizes the mel-filter, as in mel-frequency cepstral coefficients. Finally, we propose the use of a cochleagram image the frequency components of which are based on the frequency selectivity property of the human cochlea. We test the proposed techniques on an acoustic event database containing 50 sound classes. The results show that the proposed cochleagram time-frequency image representation gives the best classification performance when used with CNN. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:62 / 66
页数:5
相关论文
共 50 条
  • [21] Image recognition using convolutional neural networks for classification of honey bee subspecies
    Dario De Nart
    Cecilia Costa
    Gennaro Di Prisco
    Emanuele Carpana
    Apidologie, 2022, 53
  • [22] An Image Representation of Skeletal Data for Action Recognition using Convolutional Neural Networks
    Vernikos, Ioannis
    Mathe, Eirini
    Papadakis, Antonios
    Spyrou, Evaggelos
    Mylonas, Phivos
    12TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS (PETRA 2019), 2019, : 325 - 326
  • [23] Image Classification Using Biomimetic Pattern Recognition with Convolutional Neural Networks Features
    Zhou, Liangji
    Li, Qingwu
    Huo, Guanying
    Zhou, Yan
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2017, 2017
  • [24] Image recognition using convolutional neural networks for classification of honey bee subspecies
    de Nart, Dario
    Costa, Cecilia
    di Prisco, Gennaro
    Carpana, Emanuele
    APIDOLOGIE, 2022, 53 (01)
  • [25] Personality Recognition Using Convolutional Neural Networks
    Gimenez, Maite
    Paredes, Roberto
    Rosso, Paolo
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2017, PT II, 2018, 10762 : 313 - 323
  • [26] Recognition of flowers using convolutional neural networks
    Alkhonin, Abdulrahman
    Almutairi, Abdulelah
    Alburaidi, Abdulmajeed
    Saudagar, Abdul Khader Jilani
    INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2020, 8 (03) : 186 - 197
  • [27] Image Classification Using Convolutional Neural Networks
    Filippov, S. A.
    AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2024, 58 (SUPPL3) : S143 - S149
  • [28] Weighted pooling for image recognition of deep convolutional neural networks
    Zhu, Xiaoning
    Meng, Qingyue
    Ding, Bojian
    Gu, Lize
    Yang, Yixian
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 4): : S9371 - S9383
  • [29] Deep Learning based on Image Recognition Convolutional Neural Networks
    Alamri, Salah
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (04): : 559 - 566
  • [30] Multiactivation Pooling Method in Convolutional Neural Networks for Image Recognition
    Zhao, Qi
    Lyu, Shuchang
    Zhang, Boxue
    Feng, Wenquan
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2018,