Environmental sound recognition using continuous wavelet transform and convolutional neural networks

被引:0
|
作者
Mondragón F.J. [1 ]
Pérez-Meana H.M. [1 ]
Calderón G. [1 ]
Jiménez J. [1 ]
机构
[1] Escuela Superior de Ingeniería Mecánica y Eléctrica Culhuacan, SEPI, Avenida Santa Ana 1000, San Francisco Culhuacan, Culhuacan CTM V, Coyoacán
来源
Informacion Tecnologica | 2021年 / 32卷 / 02期
关键词
Continuous wavelet transform; Deep neural network; Environmental sound recognition; Spectrogram;
D O I
10.4067/S0718-07642021000200061
中图分类号
学科分类号
摘要
This paper proposes a scheme in which a time-frequency representation is first obtained using the continuous wavelet transform (CWT), which has a logarithmic resolution in the frequency domain, like that of the human ear. The development of these environmental sound classification systems is a topic of extensive research due to its application in several fields of science and engineering. Like other classification schemes, they are based on the extraction of specific parameters that are inserted in the classification stage. The CWT is then inserted into a deep learning neural network to carry out the classification task. The evaluation results obtained using several databases such as ESC-50, TUT Acoustic Scene, and SONAM-50 show that the proposed scheme provides a classification performance that is better than that provided by other previously proposed schemes. © 2021 Centro de Informacion Tecnologica. All rights reserved.
引用
收藏
页码:61 / 78
页数:17
相关论文
共 50 条
  • [41] Sound Classification Using Convolutional Neural Networks
    Jaiswal, Kaustumbh
    Patel, Dhairya Kalpeshbhai
    2018 SEVENTH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING IN EMERGING MARKETS (CCEM), 2018, : 81 - 84
  • [42] Continuous Emotion Recognition with Spatiotemporal Convolutional Neural Networks
    Teixeira, Thomas
    Granger, Eric
    Lameiras Koerich, Alessandro
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [43] Continuous Speech Emotion Recognition with Convolutional Neural Networks
    Vryzas, Nikolaos
    Vrysis, Lazaros
    Matsiola, Maria
    Kotsakis, Rigas
    Dimoulas, Charalampos
    Kalliris, George
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2020, 68 (1-2): : 14 - 24
  • [44] Continuous speech emotion recognition with convolutional neural networks
    Vryzas, Nikolaos
    Vrysis, Lazaros
    Matsiola, Maria
    Kotsakis, Rigas
    Dimoulas, Charalampos
    Kalliris, George
    AES: Journal of the Audio Engineering Society, 2020, 68 (1-2): : 14 - 24
  • [45] Large-scale Continuous Gesture Recognition Using Convolutional Neural Networks
    Wang, Pichao
    Li, Wanqing
    Liu, Song
    Zhang, Yuyao
    Gao, Zhimin
    Ogunbona, Philip
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 13 - 18
  • [46] Image recognition based on wavelet transform and artificial neural networks
    Zhai, Jun-Hai
    Zhang, Su-Fang
    Liu, Li-Juan
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 789 - +
  • [47] Wavelet Transform Assisted Neural Networks for Human Activity Recognition
    Sengupta, Roshwin
    Polian, Ilia
    Hayes, John P.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 1254 - 1258
  • [48] Optimization of Convolutional Neural Networks for Classifying Power Quality Disturbances Using Wavelet Synchrosqueezed Transform
    Akkaya, Sitki
    TRAITEMENT DU SIGNAL, 2024, 41 (02) : 599 - 614
  • [49] Object Detection in Aerial Navigation using Wavelet Transform and Convolutional Neural Networks: A First Approach
    Fortuna-Cervantes, J. M.
    Ramirez-Torres, M. T.
    Martinez-Carranza, J.
    Murguia-Ibarra, J. S.
    Mejia-Carlos, M.
    PROGRAMMING AND COMPUTER SOFTWARE, 2020, 46 (08) : 536 - 547
  • [50] Object Detection in Aerial Navigation using Wavelet Transform and Convolutional Neural Networks: A First Approach
    J. M. Fortuna-Cervantes
    M. T. Ramírez-Torres
    J. Martínez-Carranza
    J. S. Murguía-Ibarra
    M. Mejía-Carlos
    Programming and Computer Software, 2020, 46 : 536 - 547