Voice pathology detection using machine learning algorithms based on different voice databases

被引:0
|
作者
Latiff, Nurul Mu'azzah Abdul [1 ]
Al-Dhief, Fahad Taha [1 ,2 ]
Sazihan, Nurul Fariesya Suhaila Md [1 ]
Baki, Marina Mat [3 ]
Abd Malik, Nik Noordini Nik [1 ]
Albadr, Musatafa Abbas Abbood [4 ]
Abbas, Ali Hashim [5 ]
机构
[1] Univ Teknol Malaysia, Fac Elect Engn, Fac Engn, Utm Johor Bahru, Johor, Malaysia
[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Bangi 43600, Malaysia
[3] Univ Kebangsaan Malaysia Med Ctr, Fac Med, Dept Otorhinolaryngol, Kuala Lumpur, Malaysia
[4] Basrah Univ Oil & Gas, Coll Ind Management Oil & Gas, Dept Petr Project Management, Al Basrah, Iraq
[5] Imam Jaafar Al Sadiq Univ, Coll Informat Technol, Dept Comp Tech Engn, Al Muthanna, Iraq
关键词
Machine learning; Voice pathology detection; OSELM; SVM; DT; NB; MFCC; SVD; MVPD; CLASSIFICATION; TRANSFORM; FEATURES;
D O I
10.1016/j.rineng.2025.103937
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The application of machine learning in analyzing voice disorders has become crucial for non-invasive voice pathology detection using voice signals. However, current systems face challenges such as low detection accuracy, limited databases, and evaluation metrics. More importantly, most existing studies rely on training and testing algorithms based on the same database, limiting their applicability in real-world scenarios with diverse data sources. Unlike traditional approaches that focus solely on single-database training and testing, this study presents a cross-database evaluation strategy to assess the robustness and generalizability of machine learning algorithms for voice pathology detection. Several algorithms, including Online Sequential Extreme Learning Machine (OSELM), Support Vector Machine (SVM), Decision Tree (DT), and Na & iuml;ve Bayes (NB), were evaluated using two databases: the Saarbrucken Voice Database (SVD) and the Malaysian Voice Pathology Database (MVPD). Two scenarios were considered: (1) training and testing on the same database and (2) training on one database and testing on another. The proposed study uses the Mel-Frequency Cepstral Coefficient (MFCC) technique for extracting features from voices. The algorithms are assessed using many evaluation metrics such as accuracy, precision, sensitivity, specificity, F-measure, and G-mean. Experimental results demonstrate that the OSELM algorithm achieves superior performance across both scenarios, with accuracies of up to 85.71 % in Scenario 1 and 80.77 % in Scenario 2, outperforming other algorithms. This novel approach highlights the reliability of OSELM and the importance of cross-database testing for developing robust and generalizable voice pathology detection systems.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] A Machine Learning-Based Voice Analysis for the Detection of Dysphagia Biomarkers
    Cesarini, Valerio
    Casiddu, Niccolo
    Porfirione, Claudia
    Massazza, Giulia
    Saggio, Giovanni
    Costantini, Giovanni
    2021 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR INDUSTRY 4.0 & IOT (IEEE METROIND4.0 & IOT), 2021, : 407 - 411
  • [22] Vulnerabilities in Machine Learning-Based Voice Disorder Detection Systems
    Perelli, Gianpaolo
    Panzino, Andrea
    Casula, Roberto
    Micheletto, Marco
    Orru, Giulia
    Marcialis, Gian Luca
    2024 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY, WIFS 2024, 2024,
  • [23] Voice pathology detection on spontaneous speech data using deep learning models
    Farazi, Sahar
    Shekofteh, Yasser
    International Journal of Speech Technology, 2024, 27 (03) : 739 - 751
  • [24] An Efficient SMOTE-Based Deep Learning Model for Voice Pathology Detection
    Lee, Ji-Na
    Lee, Ji-Yeoun
    APPLIED SCIENCES-BASEL, 2023, 13 (06):
  • [25] Identification of Voice Disorders: A Comparative Study of Machine Learning Algorithms
    Coelho, Sharal
    Shashirekha, Hosahalli Lakshmaiah
    SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 565 - 578
  • [26] Gender Recognition by Voice Using Machine Learning
    Bhatia, Rohit
    Singh, Nagendra Pratap
    ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2021, 2022, 1534 : 307 - 318
  • [27] Voice pathology detection by using the deep network architecture
    Ankishan, Haydar
    Inam, Sitki Cagdas
    APPLIED SOFT COMPUTING, 2021, 106
  • [28] Using Modulation Spectra for Voice Pathology Detection and Classification
    Markaki, Maria
    Stylianou, Yannis
    2009 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-20, 2009, : 2514 - 2517
  • [29] Voice Pathology Detection Using Vocal Tract Area
    Muhammad, Ghulam
    UKSIM-AMSS SEVENTH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2013), 2013, : 164 - 168
  • [30] DETECTION AND CLASSIFICATION OF VOICE PATHOLOGY USING FEATURE SELECTION
    Al Mojaly, Malak
    Muhammad, Ghulam
    Alsulaiman, Mansour
    2014 IEEE/ACS 11TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2014, : 571 - 577