Acoustic event recognition using cochleagram image and convolutional neural networks

被引：42

作者：

Sharan, Roneel V. ^{[1
]}

Moir, Tom J. ^{[2
]}

机构：

[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia

[2] Auckland Univ Technol, Sch Engn, Private Bag 92006, Auckland 1142, New Zealand

来源：

APPLIED ACOUSTICS | 2019年 / 148卷

关键词：

Acoustic event recognition; Cochleagram; Convolutional neural network; Mel-spectrogram; Spectrogram; FEATURES; CLASSIFICATION;

D O I：

10.1016/j.apacoust.2018.12.006

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Convolutional neural networks (CNN) have produced encouraging results in image classification tasks and have been increasingly adopted in audio classification applications. However, in using CNN for acoustic event recognition, the first hurdle is finding the best image representation of an audio signal. In this work, we evaluate the performance of four time-frequency representations for use with CNN. Firstly, we consider the conventional spectrogram image. Secondly, we apply moving average to the spectrogram along the frequency domain to obtain what we refer as the smoothed spectrogram. Thirdly, we use the mel-spectrogram which utilizes the mel-filter, as in mel-frequency cepstral coefficients. Finally, we propose the use of a cochleagram image the frequency components of which are based on the frequency selectivity property of the human cochlea. We test the proposed techniques on an acoustic event database containing 50 sound classes. The results show that the proposed cochleagram time-frequency image representation gives the best classification performance when used with CNN. (C) 2018 Elsevier Ltd. All rights reserved.

引用

页码：62 / 66

页数：5

共 50 条

[31] HOW CONVOLUTIONAL NEURAL NETWORKS CAN ENHANCE IMAGE RECOGNITION
Hijazi, Samer
Kumar, Rishi
Rowen, Chris
ELECTRONICS WORLD, 2016, 122 (1959): : 22 - 24
[32] Effective Training of Convolutional Neural Networks for Insect Image Recognition
Martineau, Maxime
Raveaux, Romain
Chatelain, Clement
Conte, Donatello
Venturini, Gilles
ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2018, 2018, 11182 : 426 - 437
[33] MBVCNN: Joint Convolutional Neural Networks Method for Image Recognition
Tong, Tong
Mu, Xiaodong
Zhang, Li
Yi, Zhaoxiang
Hu, Pei
MATERIALS SCIENCE, ENERGY TECHNOLOGY, AND POWER ENGINEERING I, 2017, 1839
[34] Convolutional Neural Networks for Food Image Recognition: An Experimental Study
Ng, Yi Sen
Xue, Wanqi
Wang, Wei
Qi, Panpan
MADIMA'19: PROCEEDINGS OF THE 5TH INTERNATIONAL WORKSHOP ON MULTIMEDIA ASSISTED DIETARY MANAGEMENT, 2019, : 33 - 41
[35] Image Recognition of Marine Organisms Based on Convolutional Neural Networks
Hu, Yiyang
Ye, Taotao
Gu, Yening
Ding, Xiuhuan
2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 39 - 43
[36] Reservoir Computing with Untrained Convolutional Neural Networks for Image Recognition
Tong, Zhiqiang
Tanaka, Gouhei
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1289 - 1294
[37] Weighted pooling for image recognition of deep convolutional neural networks
Xiaoning Zhu
Qingyue Meng
Bojian Ding
Lize Gu
Yixian Yang
Cluster Computing, 2019, 22 : 9371 - 9383
[38] Parallelizing Convolutional Neural Networks for Action Event Recognition in Surveillance Videos
Wang, Qicong
Zhao, Jinhao
Gong, Dingxi
Shen, Yehu
Li, Maozhen
Lei, Yunqi
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 45 (04) : 734 - 759
[39] Exploiting Feature Hierarchies with Convolutional Neural Networks for Cultural Event Recognition
Liu, Mengyi
Liu, Xin
Li, Yan
Chen, Xilin
Hauptmann, Alexander G.
Shan, Shiguang
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 274 - 279
[40] Parallelizing Convolutional Neural Networks for Action Event Recognition in Surveillance Videos
Qicong Wang
Jinhao Zhao
Dingxi Gong
Yehu Shen
Maozhen Li
Yunqi Lei
International Journal of Parallel Programming, 2017, 45 : 734 - 759

← 1 2 3 4 5 →