Environmental sound recognition using continuous wavelet transform and convolutional neural networks

被引:0
|
作者
Mondragón F.J. [1 ]
Pérez-Meana H.M. [1 ]
Calderón G. [1 ]
Jiménez J. [1 ]
机构
[1] Escuela Superior de Ingeniería Mecánica y Eléctrica Culhuacan, SEPI, Avenida Santa Ana 1000, San Francisco Culhuacan, Culhuacan CTM V, Coyoacán
来源
Informacion Tecnologica | 2021年 / 32卷 / 02期
关键词
Continuous wavelet transform; Deep neural network; Environmental sound recognition; Spectrogram;
D O I
10.4067/S0718-07642021000200061
中图分类号
学科分类号
摘要
This paper proposes a scheme in which a time-frequency representation is first obtained using the continuous wavelet transform (CWT), which has a logarithmic resolution in the frequency domain, like that of the human ear. The development of these environmental sound classification systems is a topic of extensive research due to its application in several fields of science and engineering. Like other classification schemes, they are based on the extraction of specific parameters that are inserted in the classification stage. The CWT is then inserted into a deep learning neural network to carry out the classification task. The evaluation results obtained using several databases such as ESC-50, TUT Acoustic Scene, and SONAM-50 show that the proposed scheme provides a classification performance that is better than that provided by other previously proposed schemes. © 2021 Centro de Informacion Tecnologica. All rights reserved.
引用
收藏
页码:61 / 78
页数:17
相关论文
共 50 条
  • [31] Neural Networks for Automatic Environmental Sound Recognition
    Segarceanu, Svetlana
    Suciu, George
    Gavat, Inge
    2021 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2021, : 7 - 12
  • [32] Applying the Wavelet Transform to Radar Signals for Drone Classification using Convolutional Neural Networks
    Hunter, Emily
    Raval, Divy
    Carniglia, Peter
    Balaji, Bhashyam
    RADAR SENSOR TECHNOLOGY XXVI, 2022, 12108
  • [33] Multifocus image fusion using convolutional neural networks in the discrete wavelet transform domain
    Wang, Zeyu
    Li, Xiongfei
    Duan, Haoran
    Zhang, Xiaoli
    Wang, Hancheng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (24) : 34483 - 34512
  • [34] Persian sign language (PSL) recognition using wavelet transform and neural networks
    Karami, Ali
    Zanj, Bahman
    Sarkaleh, Azadeh Kiani
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (03) : 2661 - 2667
  • [35] Automatic Modulation Recognition Using Wavelet Transform and Neural Networks in Wireless Systems
    Hassan, K.
    Dayoub, I.
    Hamouda, W.
    Berbineau, M.
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,
  • [36] Multifocus image fusion using convolutional neural networks in the discrete wavelet transform domain
    Zeyu Wang
    Xiongfei Li
    Haoran Duan
    Xiaoli Zhang
    Hancheng Wang
    Multimedia Tools and Applications, 2019, 78 : 34483 - 34512
  • [37] Automatic Modulation Recognition Using Wavelet Transform and Neural Networks in Wireless Systems
    K. Hassan
    I. Dayoub
    W. Hamouda
    M. Berbineau
    EURASIP Journal on Advances in Signal Processing, 2010
  • [38] Visual recognition of noisy fastening bolts using neural networks and wavelet transform
    Mazzeo, PL
    Nitti, M
    Stella, E
    Distante, A
    PROCEEDINGS OF THE FOURTH IASTED INTERNATIONAL CONFERENCE ON VISUALIZATION, IMAGING, AND IMAGE PROCESSING, 2004, : 566 - 571
  • [39] Environmental Sound Classification using Deep Convolutional Neural Networks and Data Augmentation
    Davis, Nithya
    Suresh, K.
    2018 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2018, : 41 - 45
  • [40] Heart sound recognition through analysis of wavelet transform and neural network
    Hong, JP
    Lee, JJ
    Jung, SB
    Hong, SH
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (06) : 1116 - 1121