Environmental sound recognition using continuous wavelet transform and convolutional neural networks

被引:0
|
作者
Mondragón F.J. [1 ]
Pérez-Meana H.M. [1 ]
Calderón G. [1 ]
Jiménez J. [1 ]
机构
[1] Escuela Superior de Ingeniería Mecánica y Eléctrica Culhuacan, SEPI, Avenida Santa Ana 1000, San Francisco Culhuacan, Culhuacan CTM V, Coyoacán
来源
Informacion Tecnologica | 2021年 / 32卷 / 02期
关键词
Continuous wavelet transform; Deep neural network; Environmental sound recognition; Spectrogram;
D O I
10.4067/S0718-07642021000200061
中图分类号
学科分类号
摘要
This paper proposes a scheme in which a time-frequency representation is first obtained using the continuous wavelet transform (CWT), which has a logarithmic resolution in the frequency domain, like that of the human ear. The development of these environmental sound classification systems is a topic of extensive research due to its application in several fields of science and engineering. Like other classification schemes, they are based on the extraction of specific parameters that are inserted in the classification stage. The CWT is then inserted into a deep learning neural network to carry out the classification task. The evaluation results obtained using several databases such as ESC-50, TUT Acoustic Scene, and SONAM-50 show that the proposed scheme provides a classification performance that is better than that provided by other previously proposed schemes. © 2021 Centro de Informacion Tecnologica. All rights reserved.
引用
收藏
页码:61 / 78
页数:17
相关论文
共 50 条
  • [1] Fundamental Heart Sound Classification using the Continuous Wavelet Transform and Convolutional Neural Networks
    Meintjes, Andries
    Lowe, Andrew
    Legget, Malcolm
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 409 - 412
  • [2] Lightweight Environmental Sound Recognition Using Convolutional Neural Networks
    Tang, Liwen
    Du, Zhouyang
    Wang, Yawen
    Xiao, Zhuoling
    Lin, Jiazhen
    Zhang, Xiaoyan
    2021 6TH INTERNATIONAL CONFERENCE ON UK-CHINA EMERGING TECHNOLOGIES (UCET 2021), 2021, : 215 - 220
  • [3] Wavelet Transform for the Analysis of Convolutional Neural Networks in Texture Recognition
    Florindo, Joao Batista
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 502 - 509
  • [4] Directional Continuous Wavelet Transform Applied to Handwritten Numerals Recognition Using Neural Networks
    Romero, Diego J.
    Seijas, Leticia M.
    Ruedin, Ana M.
    JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2007, 7 (01): : 66 - 71
  • [5] Goat Face Recognition Model Based on Wavelet Transform and Convolutional Neural Networks
    Huang L.
    Qian B.
    Guan F.
    Hou Z.
    Zhang Q.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2023, 54 (05): : 278 - 287
  • [6] An EMG-Based Personal Identification Method Using Continuous Wavelet Transform and Convolutional Neural Networks
    Lu, Lijing
    Mao, Jingna
    Wang, Wuqi
    Ding, Guangxin
    Zhang, Zhiwei
    2019 IEEE BIOMEDICAL CIRCUITS AND SYSTEMS CONFERENCE (BIOCAS 2019), 2019,
  • [7] ROBUST SOUND EVENT RECOGNITION USING CONVOLUTIONAL NEURAL NETWORKS
    Zhang, Haomin
    McLoughlin, Ian
    Song, Yan
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 559 - 563
  • [8] Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic Music Using Discrete Wavelet Transform
    Dash, Sukanta Kumar
    Solanki, S. S.
    Chakraborty, Soubhik
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (7) : 4239 - 4271
  • [9] Biometric Speaker Recognition Using Neural Networks and Wavelet Transform
    Daghbosheh, Mohammed
    Hattab, Ezz
    Bisher, Ahmad
    2011 INTERNATIONAL CONFERENCE ON CIVIL ENGINEERING AND INFORMATION TECHNOLOGY (CEIT 2011), 2011, : 1 - 8
  • [10] Prefiltering for pattern recognition using wavelet transform and neural networks
    Yang, F
    Paindavoine, M
    ADVANCES IN IMAGING AND ELECTRON PHYSICS, VOL 127, 2003, 127 : 125 - 206