Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

被引:1
|
作者
Gourisaria M.K. [1 ]
Agrawal R. [1 ]
Sahni M. [2 ]
Singh P.K. [3 ]
机构
[1] School of Computer Engineering, KIIT Deemed to Be University, Odisha, Bhubaneswar
[2] Department of Mathematics, Pandit Deendayal Energy University, Gujarat, Gandhinagar
[3] Central University of Jammu, Jammu & Kashmir, Bagla Suchani
来源
Discover Internet of Things | 2024年 / 4卷 / 01期
关键词
Artificial Neural Network; Audio Classification; Audio file management; Audio visualization; Automated Systems; Mel Frequency Cepstral Coefficients; Short-Time Fourier Transform;
D O I
10.1007/s43926-023-00049-y
中图分类号
学科分类号
摘要
In the era of automated and digitalized information, advanced computer applications deal with a major part of the data that comprises audio-related information. Advancements in technology have ushered in a new era where cutting-edge devices can deliver comprehensive insights into audio content, leveraging sophisticated algorithms such such as Mel Frequency Cepstral Coefficients (MFCCs) and Short-Time Fourier Transform (STFT) to extract and provide pertinent information. Our study helps in not only efficient audio file management and audio file retrievals but also plays a vital role in security, the robotics industry, and investigations. Beyond its industrial applications, our model exhibits remarkable versatility in the corporate sector, particularly in tasks like siren sound detection and more. Embracing this capability holds the promise of catalyzing the development of advanced automated systems, paving the way for increased efficiency and safety across various corporate domains. The primary aim of our experiment is to focus on creating highly efficient audio classification models that can be seamlessly automated and deployed within the industrial sector, addressing critical needs for enhanced productivity and performance. Despite the dynamic nature of environmental sounds and the presence of noises, our presented audio classification model comes out to be efficient and accurate. The novelty of our research work reclines to compare two different audio datasets having similar characteristics and revolves around classifying the audio signals into several categories using various machine learning techniques and extracting MFCCs and STFTs features from the audio signals. We have also tested the results after and before the noise removal for analyzing the effect of the noise on the results including the precision, recall, specificity, and F1-score. Our experiment shows that the ANN model outperforms the other six audio models with the accuracy of 91.41% and 91.27% on respective datasets. © The Author(s) 2023.
引用
收藏
相关论文
共 50 条
  • [31] A Comprehensive Analysis on Question Classification Using Machine Learning and Deep Learning Techniques
    Kogilavani, S., V
    Malliga, S.
    Preethi, A.
    Nandhini, L.
    Praveen, S. R.
    MOBILE COMPUTING AND SUSTAINABLE INFORMATICS, 2022, 68 : 825 - 838
  • [32] A Comparative Study of FFT, STFT and Wavelet Techniques for Induction Machine Fault Diagnostic Analysis
    Mehala, Neelam
    Dahiya, Ratna
    PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, MAN-MACHINE SYSTEMS AND CYBERNETICS (CIMMACS '08), 2008, : 203 - 208
  • [33] Statistical Features Identification for Sentiment Analysis using Machine Learning Techniques
    Kamal, Ahmad
    Abulaish, Muhammad
    2013 INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL AND BUSINESS INTELLIGENCE (ISCBI), 2013, : 178 - 181
  • [34] Frog classification using machine learning techniques
    Huang, Chenn-Jung
    Yang, Yi-Ju
    Yang, Dian-Xiu
    Chen, You-Jia
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 3737 - 3743
  • [35] Diabetes Classification Using Machine Learning Techniques
    Phongying, Methaporn
    Hiriote, Sasiprapa
    COMPUTATION, 2023, 11 (05)
  • [36] A Comparative Sentiment Analysis Of Sentence Embedding Using Machine Learning Techniques
    Poornima, A.
    Priya, K. Sathiya
    2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 493 - 496
  • [37] Segmented Glioma Classification Using Radiomics-Based Machine Learning: A Comparative Analysis of Feature Selection Techniques
    Jlassi, Amal
    Omri, Amel
    ElBedoui, Khaoula
    Barhoumi, Walid
    AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2023, 2024, 14546 : 425 - 447
  • [38] Severity classification of software code smells using machine learning techniques: A comparative study
    Abdou, Ashraf
    Darwish, Nagy
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2024, 36 (01)
  • [39] Automatic Classification of Bird Sounds: Using MFCC and Mel Spectrogram Features with Deep Learning
    Carvalho, Silvestre
    Gomes, Elsa Ferreira
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2023, 10 (01) : 39 - 54
  • [40] Machine Learning Comparative Analysis for Plant Classification
    Imanov, Elbrus
    Alzouhbi, Abdallah Khaled
    13TH INTERNATIONAL CONFERENCE ON THEORY AND APPLICATION OF FUZZY SYSTEMS AND SOFT COMPUTING - ICAFS-2018, 2019, 896 : 586 - 593