Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction

被引:0
|
作者
Yousef Abd Al-Hattab
Hasan Firdaus Zaki
Amir Akramin Shafie
机构
[1] International Islamic University Malaysia,Department of Mechatronics Engineering
来源
关键词
Convolutional neural networks (CNN); Mel-frequency cepstral coefficients (MFCC); Environmental sound classification; Feature extraction; Urbansound8Kdataset;
D O I
暂无
中图分类号
学科分类号
摘要
The classification of environmental sounds is important for emerging applications such as automatic audio surveillance, audio forensics, and robot navigation. Existing techniques combined multiple features and stacked many CNN layers (very deep learning) to reach the desired accuracy. Instead of using many features and going deeper by stacking layers that are resource extensive, this paper proposes a novel technique that uses only a single feature, namely the Mel-Frequency Cepstral Coefficient (MFCC) and just three layers of CNN. We demonstrate that such a simple network can considerably outperform several conventional and deep learning-based algorithms. Through parameters fine-tuning of the data input, we reported a model that is significantly less complex in the architecture yet has recorded a similar accuracy of 95.59% compared to state-of-the-art deep models on UrbanSound8k dataset.
引用
收藏
页码:14495 / 14506
页数:11
相关论文
共 50 条
  • [1] Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction
    Al-Hattab, Yousef Abd
    Zaki, Hasan Firdaus
    Shafie, Amir Akramin
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (21): : 14495 - 14506
  • [2] Feature extraction and classification of heart sound using 1D convolutional neural networks
    Fen Li
    Ming Liu
    Yuejin Zhao
    Lingqin Kong
    Liquan Dong
    Xiaohua Liu
    Mei Hui
    EURASIP Journal on Advances in Signal Processing, 2019
  • [3] Feature extraction and classification of heart sound using 1D convolutional neural networks
    Li, Fen
    Liu, Ming
    Zhao, Yuejin
    Kong, Lingqin
    Dong, Liquan
    Liu, Xiaohua
    Hui, Mei
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2019, 2019 (01)
  • [4] ENVIRONMENTAL SOUND CLASSIFICATION WITH CONVOLUTIONAL NEURAL NETWORKS
    Piczak, Karol J.
    2015 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2015,
  • [5] Heart sound classification algorithm based on bispectral feature extraction and convolutional neural networks
    Peng, Liyong
    Quan, Haiyan
    Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2024, 41 (05): : 977 - 985
  • [6] Sound Classification Using Convolutional Neural Networks
    Jaiswal, Kaustumbh
    Patel, Dhairya Kalpeshbhai
    2018 SEVENTH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING IN EMERGING MARKETS (CCEM), 2018, : 81 - 84
  • [7] Environmental Sound Classification using Deep Convolutional Neural Networks and Data Augmentation
    Davis, Nithya
    Suresh, K.
    2018 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2018, : 41 - 45
  • [8] Regularized Deep Convolutional Neural Networks for Feature Extraction and Classification
    Jayech, Khaoula
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 431 - 439
  • [9] Detection of Diabetic Retinopathy using Convolutional Neural Networks for Feature Extraction and Classification (DRFEC)
    Das, Dolly
    Biswas, Saroj Kumar
    Bandyopadhyay, Sivaji
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29943 - 30001
  • [10] Articulatory Feature Classification Using Convolutional Neural Networks
    Merkx, Danny
    Scharenborg, Odette
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2142 - 2146