Environmental sound recognition using continuous wavelet transform and convolutional neural networks

被引:0
|
作者
Mondragón F.J. [1 ]
Pérez-Meana H.M. [1 ]
Calderón G. [1 ]
Jiménez J. [1 ]
机构
[1] Escuela Superior de Ingeniería Mecánica y Eléctrica Culhuacan, SEPI, Avenida Santa Ana 1000, San Francisco Culhuacan, Culhuacan CTM V, Coyoacán
来源
Informacion Tecnologica | 2021年 / 32卷 / 02期
关键词
Continuous wavelet transform; Deep neural network; Environmental sound recognition; Spectrogram;
D O I
10.4067/S0718-07642021000200061
中图分类号
学科分类号
摘要
This paper proposes a scheme in which a time-frequency representation is first obtained using the continuous wavelet transform (CWT), which has a logarithmic resolution in the frequency domain, like that of the human ear. The development of these environmental sound classification systems is a topic of extensive research due to its application in several fields of science and engineering. Like other classification schemes, they are based on the extraction of specific parameters that are inserted in the classification stage. The CWT is then inserted into a deep learning neural network to carry out the classification task. The evaluation results obtained using several databases such as ESC-50, TUT Acoustic Scene, and SONAM-50 show that the proposed scheme provides a classification performance that is better than that provided by other previously proposed schemes. © 2021 Centro de Informacion Tecnologica. All rights reserved.
引用
收藏
页码:61 / 78
页数:17
相关论文
共 50 条
  • [21] Automatic digital modulation recognition using wavelet transform and neural networks
    Wu, ZL
    Ren, GH
    Wang, XX
    Zhao, YQ
    ADVANCES IN NEURAL NETWORKS - ISNN 2004, PT 1, 2004, 3173 : 936 - 940
  • [22] Iris recognition using discrete wavelet transform and artificial neural networks
    Alim, OA
    Sharkas, M
    Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 337 - 340
  • [23] Recognition of ultrasound images using wavelet transform and artificial neural networks
    Juarez-Landin, C
    Ponomaryov, V
    Sanchez-Ramirez, JL
    IEEE CAMSAP 2005: First International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, 2005, : 173 - 176
  • [24] Environmental Sound Recognition Using Masked Conditional Neural Networks
    Medhat, Fady
    Chesmore, David
    Robinson, John
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2017, 2017, 10604 : 373 - 385
  • [25] Application of continuous wavelet transform and convolutional neural networks in fault diagnosis of PMSM stator windings
    Pietrzak, Przemyslaw
    Wolkiewicz, Marcin
    BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES, 2024, 72 (05)
  • [26] Continuous speech recognition by convolutional neural networks
    Zhang, Qing-Qing
    Liu, Yong
    Pan, Jie-Lin
    Yan, Yong-Hong
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2015, 37 (09): : 1212 - 1217
  • [27] Wavelet Convolutional Neural Networks for Handwritten Digits Recognition
    Ben Chaabane, Chiraz
    Mellouli, Dorra
    Hamdani, Tarek M.
    Alimi, Adel M.
    Abraham, Ajith
    HYBRID INTELLIGENT SYSTEMS, HIS 2017, 2018, 734 : 305 - 310
  • [28] Smartphone-based gait recognition using convolutional neural networks and dual-tree complex wavelet transform
    Sezavar, Ahmadreza
    Atta, Randa
    Ghanbari, Mohammad
    MULTIMEDIA SYSTEMS, 2022, 28 (06) : 2307 - 2317
  • [29] Smartphone-based gait recognition using convolutional neural networks and dual-tree complex wavelet transform
    Ahmadreza Sezavar
    Randa Atta
    Mohammad Ghanbari
    Multimedia Systems, 2022, 28 : 2307 - 2317
  • [30] Environmental sound recognition by multilayered neural networks
    Toyoda, Y
    Huang, J
    Ding, SX
    Liu, Y
    FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2004, : 123 - 127