Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

被引:1
|
作者
Gourisaria M.K. [1 ]
Agrawal R. [1 ]
Sahni M. [2 ]
Singh P.K. [3 ]
机构
[1] School of Computer Engineering, KIIT Deemed to Be University, Odisha, Bhubaneswar
[2] Department of Mathematics, Pandit Deendayal Energy University, Gujarat, Gandhinagar
[3] Central University of Jammu, Jammu & Kashmir, Bagla Suchani
来源
Discover Internet of Things | 2024年 / 4卷 / 01期
关键词
Artificial Neural Network; Audio Classification; Audio file management; Audio visualization; Automated Systems; Mel Frequency Cepstral Coefficients; Short-Time Fourier Transform;
D O I
10.1007/s43926-023-00049-y
中图分类号
学科分类号
摘要
In the era of automated and digitalized information, advanced computer applications deal with a major part of the data that comprises audio-related information. Advancements in technology have ushered in a new era where cutting-edge devices can deliver comprehensive insights into audio content, leveraging sophisticated algorithms such such as Mel Frequency Cepstral Coefficients (MFCCs) and Short-Time Fourier Transform (STFT) to extract and provide pertinent information. Our study helps in not only efficient audio file management and audio file retrievals but also plays a vital role in security, the robotics industry, and investigations. Beyond its industrial applications, our model exhibits remarkable versatility in the corporate sector, particularly in tasks like siren sound detection and more. Embracing this capability holds the promise of catalyzing the development of advanced automated systems, paving the way for increased efficiency and safety across various corporate domains. The primary aim of our experiment is to focus on creating highly efficient audio classification models that can be seamlessly automated and deployed within the industrial sector, addressing critical needs for enhanced productivity and performance. Despite the dynamic nature of environmental sounds and the presence of noises, our presented audio classification model comes out to be efficient and accurate. The novelty of our research work reclines to compare two different audio datasets having similar characteristics and revolves around classifying the audio signals into several categories using various machine learning techniques and extracting MFCCs and STFTs features from the audio signals. We have also tested the results after and before the noise removal for analyzing the effect of the noise on the results including the precision, recall, specificity, and F1-score. Our experiment shows that the ANN model outperforms the other six audio models with the accuracy of 91.41% and 91.27% on respective datasets. © The Author(s) 2023.
引用
收藏
相关论文
共 50 条
  • [1] Deepfake Audio Detection via MFCC Features Using Machine Learning
    Hamza, Ameer
    Javed, Abdul Rehman Rehman
    Iqbal, Farkhund
    Kryvinska, Natalia
    Almadhor, Ahmad S.
    Jalil, Zunera
    Borghol, Rouba
    IEEE Access, 2022, 10 : 134018 - 134028
  • [2] Deepfake Audio Detection via MFCC Features Using Machine Learning
    Hamza, Ameer
    Javed, Abdul Rehman
    Iqbal, Farkhund
    Kryvinska, Natalia
    Almadhor, Ahmad S. S.
    Jalil, Zunera
    Borghol, Rouba
    IEEE ACCESS, 2022, 10 : 134018 - 134028
  • [3] Comparative Analysis of Machine Learning Algorithms for Audio Signals Classification
    Mahana, Poonam
    Singh, Gurbhej
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (06): : 49 - 55
  • [4] Network Traffic Classification Techniques and Comparative Analysis Using Machine Learning Algorithms
    Shafiq, Muhammad
    Yu, Xiangzhan
    Laghari, Asif Ali
    Yao, Lu
    Karn, Abin Kumar
    Abdessamia, Oudil
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 2451 - 2455
  • [5] Sentiment Classification Using Machine Learning Techniques with Syntax Features
    Zou, Huang
    Tang, Xinhuai
    Xie, Bin
    Liu, Bing
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2015, : 175 - 179
  • [6] Invoice Classification Using Deep Features and Machine Learning Techniques
    Tarawneh, Ahmad S.
    Hassanat, Ahmad B.
    Chetverikov, Dmitry
    Lendak, Imre
    Verma, Chaman
    2019 IEEE JORDAN INTERNATIONAL JOINT CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION TECHNOLOGY (JEEIT), 2019, : 855 - 859
  • [7] An Analysis of Audio Classification Techniques using Deep Learning Architectures
    Imran, Mohammed Safwat
    Rahman, Afia Fahmida
    Tanvir, Sifat
    Kadir, Hamim Hassan
    Iqbal, Junaid
    Mostakim, Moira
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 805 - 812
  • [8] Comparative Analysis of Diabetic Retinopathy Classification Approaches Using Machine Learning and Deep Learning Techniques
    Ruchika Bala
    Arun Sharma
    Nidhi Goel
    Archives of Computational Methods in Engineering, 2024, 31 : 919 - 955
  • [9] Comparative Analysis of Diabetic Retinopathy Classification Approaches Using Machine Learning and Deep Learning Techniques
    Bala, Ruchika
    Sharma, Arun
    Goel, Nidhi
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2024, 31 (02) : 919 - 955
  • [10] A Comparative Analysis of Machine Learning Techniques for Classification and Detection of Malware
    Al-Janabi, Maryam
    Altamimi, Ahmad Mousa
    2020 21ST INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2020,