Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction

被引:0
|
作者
Yousef Abd Al-Hattab
Hasan Firdaus Zaki
Amir Akramin Shafie
机构
[1] International Islamic University Malaysia,Department of Mechatronics Engineering
来源
关键词
Convolutional neural networks (CNN); Mel-frequency cepstral coefficients (MFCC); Environmental sound classification; Feature extraction; Urbansound8Kdataset;
D O I
暂无
中图分类号
学科分类号
摘要
The classification of environmental sounds is important for emerging applications such as automatic audio surveillance, audio forensics, and robot navigation. Existing techniques combined multiple features and stacked many CNN layers (very deep learning) to reach the desired accuracy. Instead of using many features and going deeper by stacking layers that are resource extensive, this paper proposes a novel technique that uses only a single feature, namely the Mel-Frequency Cepstral Coefficient (MFCC) and just three layers of CNN. We demonstrate that such a simple network can considerably outperform several conventional and deep learning-based algorithms. Through parameters fine-tuning of the data input, we reported a model that is significantly less complex in the architecture yet has recorded a similar accuracy of 95.59% compared to state-of-the-art deep models on UrbanSound8k dataset.
引用
收藏
页码:14495 / 14506
页数:11
相关论文
共 50 条
  • [41] Robust technique for environmental sound classification using convolutional recurrent neural network
    Bansal, Anam
    Garg, Naresh Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 54755 - 54772
  • [42] Convolutional Neural Network-Gated Recurrent Unit Neural Network with Feature Fusion for Environmental Sound Classification
    Zhang, Yu
    Zeng, Jinfang
    Li, Youming
    Chen, Da
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2021, 55 (04) : 311 - 318
  • [43] Development of Convolutional Neural Networks (CNNs) for Feature Extraction
    Eikmeier, Nicole
    Westerkamp, Rachel
    Zelnio, Edmund
    ALGORITHMS FOR SYNTHETIC APERTURE RADAR IMAGERY XXV, 2018, 10647
  • [44] Acoustic Scene Classification using Convolutional Neural Networks and Multi-Scale Multi-Feature Extraction
    Dang, An
    Vu, Toan H.
    Wang, Jia-Ching
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [45] Power quality event classification using optimized Bayesian convolutional neural networks
    Ekici, Sami
    Ucar, Ferhat
    Dandil, Besir
    Arghandeh, Reza
    ELECTRICAL ENGINEERING, 2021, 103 (01) : 67 - 77
  • [46] Power quality event classification using optimized Bayesian convolutional neural networks
    Sami Ekici
    Ferhat Ucar
    Besir Dandil
    Reza Arghandeh
    Electrical Engineering, 2021, 103 : 67 - 77
  • [47] Eating Sound Dataset for 20 Food Types and Sound Classification Using Convolutional Neural Networks
    Ma, Jeannette Shijie
    Maureira, Marcello A. Gomez
    van Rijn, Jan N.
    COMPANION PUBLICATON OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION (ICMI '20 COMPANION), 2020, : 348 - 351
  • [48] Performance comparison of lung sound classification using various convolutional neural networks
    Kim, Gee Yeun
    Kim, Hyoung-Gook
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (05): : 568 - 573
  • [49] Convolutional Recurrent Neural Networks for Urban Sound Classification using Raw Waveforms
    Sang, Jonghee
    Park, Soomyung
    Lee, Junwoo
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2444 - 2448
  • [50] Environmental sound recognition using continuous wavelet transform and convolutional neural networks
    Mondragón F.J.
    Pérez-Meana H.M.
    Calderón G.
    Jiménez J.
    Informacion Tecnologica, 2021, 32 (02): : 61 - 78