Classification of Environmental Sounds with Convolutional Neural Networks

被引:0
|
作者
Dincer, Yalcin [1 ]
Inik, Ozkan [2 ]
机构
[1] Bingol Univ, Tekn Bilimler Meslek Yuksekokulu, Bilgisayar Teknol Bolumu, Bingol, Turkiye
[2] Tokat Gaziosmanpasa Univ, Muhendislik & Mimarlik Fak, Bilgisayar Muhendisligi Bolumu, Tokat, Turkiye
来源
KONYA JOURNAL OF ENGINEERING SCIENCES | 2023年 / 11卷 / 02期
关键词
Deep Learning; Convolutional Neural Network; Environmental Sound Classification; ESC10; UrbanSound8K; SURVEILLANCE; MATRIX; RECOGNITION;
D O I
10.36306/konjes.1201558
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The use of sound data is critical for predicting the effects of environmental activities and gathering information about the environment of these activities. Sound data is utilized to obtain basic information about the functioning of urban activities such as noise pollution, security systems, health care, and local services. In this sense, Environmental Sound Classification (ESC) is becoming critical. Due to the increasing amount of data and time constraints in analysis, there is a need for new and powerful artificial intelligence methods that enable instant automatic identification of sounds. These methods can be developed with Convolutional Neural Networks (CNN) models, which have achieved high accuracy rates in other fields. For this reason, in this study, a new CNN based method is proposed for the classification of two different CSR datasets. In this method, the sounds are first converted into image format. Then, novel ESA models are designed for the classification of these sounds in image format. For each dataset, the ESA models with the highest accuracy rate were obtained among the multiple ESA models designed. The datasets used in the study are ESC10 and UrbanSound8K, respectively. The sound recordings in these datasets were converted to image format with 32x32x3 and 224x224x3 dimensions, and four different image format datasets were obtained. The CNN models developed to classify these datasets are named ESC10_ESA32, ESC10_ESA224, URBANSOUND8K_ESA32, and URBANSOUND8K_ESA224, respectively. These models were trained on the datasets using 10-fold cross-validation. In the obtained results, the average accuracy rates of the ESC10_ESA32, ESC10_ESA224, URBANSOUND8K_ESA32, and URBANSOUND8K_ESA224 models are 80.75%, 82.25%, 88.60%, and 84.33%, respectively. When these results are compared with other baseline studies in the literature on the same datasets, it is seen that these models achieve better results.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Classification of lung sounds using convolutional neural networks
    Murat Aykanat
    Özkan Kılıç
    Bahar Kurt
    Sevgi Saryal
    EURASIP Journal on Image and Video Processing, 2017
  • [2] Lung sounds classification using convolutional neural networks
    Bardou, Dalal
    Zhang, Kun
    Ahmad, Sayed Mohammad
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2018, 88 : 58 - 69
  • [3] Classification of lung sounds using convolutional neural networks
    Aykanat, Murat
    Kilic, Ozkan
    Kurt, Bahar
    Saryal, Sevgi
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2017,
  • [4] ENVIRONMENTAL SOUND CLASSIFICATION WITH CONVOLUTIONAL NEURAL NETWORKS
    Piczak, Karol J.
    2015 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2015,
  • [5] Classification of Environmental Sounds Using Convolutional Neural Network with Bispectral Analysis
    Hirata, Katsumi
    Kato, Takehito
    Oshima, Ryuichi
    2019 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2019,
  • [6] Automatic Heart and Lung Sounds Classification using Convolutional Neural Networks
    Chen, Qiyu
    Zhang, Weibin
    Tian, Xiang
    Zhang, Xiaoxue
    Chen, Shaoqiong
    Lei, Wenkang
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [7] Pre-trained Convolutional Neural Networks for the Lung Sounds Classification
    Vaityshyn, Valentyn
    Porieva, Hanna
    Makarenkova, Anastasiia
    2019 IEEE 39TH INTERNATIONAL CONFERENCE ON ELECTRONICS AND NANOTECHNOLOGY (ELNANO), 2019, : 522 - 525
  • [8] Convolutional Neural Networks for the Classification of Bronchopulmonary System Diseases with the Use of Lung Sounds
    Vaityshyn, Valentyn
    Chekhovych, Mariia
    Poreva, Anna
    2018 IEEE 38TH INTERNATIONAL CONFERENCE ON ELECTRONICS AND NANOTECHNOLOGY (ELNANO), 2018, : 383 - 386
  • [9] Cochleogram-based adventitious sounds classification using convolutional neural networks
    Mang, L. D.
    Canadas-Quesada, F. J.
    Carabias-Orti, J. J.
    Combarro, E. F.
    Ranilla, J.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 82
  • [10] Classification of Respiratory Sounds into Crackles and Noncrackles Categories via Convolutional Neural Networks
    Qi Dexuan
    Ye, Yuan
    Zhao Haiwen
    Wu Wenjuan
    Guo Shijie
    2024 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, ICMA 2024, 2024, : 800 - 805