Voice pathology detection using machine learning algorithms based on different voice databases

被引:0
|
作者
Latiff, Nurul Mu'azzah Abdul [1 ]
Al-Dhief, Fahad Taha [1 ,2 ]
Sazihan, Nurul Fariesya Suhaila Md [1 ]
Baki, Marina Mat [3 ]
Abd Malik, Nik Noordini Nik [1 ]
Albadr, Musatafa Abbas Abbood [4 ]
Abbas, Ali Hashim [5 ]
机构
[1] Univ Teknol Malaysia, Fac Elect Engn, Fac Engn, Utm Johor Bahru, Johor, Malaysia
[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Bangi 43600, Malaysia
[3] Univ Kebangsaan Malaysia Med Ctr, Fac Med, Dept Otorhinolaryngol, Kuala Lumpur, Malaysia
[4] Basrah Univ Oil & Gas, Coll Ind Management Oil & Gas, Dept Petr Project Management, Al Basrah, Iraq
[5] Imam Jaafar Al Sadiq Univ, Coll Informat Technol, Dept Comp Tech Engn, Al Muthanna, Iraq
关键词
Machine learning; Voice pathology detection; OSELM; SVM; DT; NB; MFCC; SVD; MVPD; CLASSIFICATION; TRANSFORM; FEATURES;
D O I
10.1016/j.rineng.2025.103937
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The application of machine learning in analyzing voice disorders has become crucial for non-invasive voice pathology detection using voice signals. However, current systems face challenges such as low detection accuracy, limited databases, and evaluation metrics. More importantly, most existing studies rely on training and testing algorithms based on the same database, limiting their applicability in real-world scenarios with diverse data sources. Unlike traditional approaches that focus solely on single-database training and testing, this study presents a cross-database evaluation strategy to assess the robustness and generalizability of machine learning algorithms for voice pathology detection. Several algorithms, including Online Sequential Extreme Learning Machine (OSELM), Support Vector Machine (SVM), Decision Tree (DT), and Na & iuml;ve Bayes (NB), were evaluated using two databases: the Saarbrucken Voice Database (SVD) and the Malaysian Voice Pathology Database (MVPD). Two scenarios were considered: (1) training and testing on the same database and (2) training on one database and testing on another. The proposed study uses the Mel-Frequency Cepstral Coefficient (MFCC) technique for extracting features from voices. The algorithms are assessed using many evaluation metrics such as accuracy, precision, sensitivity, specificity, F-measure, and G-mean. Experimental results demonstrate that the OSELM algorithm achieves superior performance across both scenarios, with accuracies of up to 85.71 % in Scenario 1 and 80.77 % in Scenario 2, outperforming other algorithms. This novel approach highlights the reliability of OSELM and the importance of cross-database testing for developing robust and generalizable voice pathology detection systems.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] PHASE-BASED INFORMATION FOR VOICE PATHOLOGY DETECTION
    Drugman, Thomas
    Dubuisson, Thomas
    Dutoit, Thierry
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4612 - 4615
  • [32] Detection of Minor and Major Depression through Voice as a Biomarker Using Machine Learning
    Shin, Daun
    Cho, Won Ik
    Park, C. Hyung Keun
    Rhee, Sang Jin
    Kim, Min Ji
    Lee, Hyunju
    Kim, Nam Soo
    Ahn, Yong Min
    JOURNAL OF CLINICAL MEDICINE, 2021, 10 (14)
  • [33] Voice Based Authentication System for Web Applications using Machine Learning
    Kadu, Rakesh K.
    Assudani, Purshottam J.
    Bhojane, Sahil
    Agrawal, Tanish
    Siddhawar, Vidhi
    Kale, Yash
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2022, 13 (05): : 1312 - 1320
  • [34] Detection of voice impairment for parkinson's disease using machine learning tools
    Laila, Radouani
    Salwa, Lagdali
    Mohammed, Rziza
    2020 10TH INTERNATIONAL SYMPOSIUM ON SIGNAL, IMAGE, VIDEO AND COMMUNICATIONS (ISIVC), 2021,
  • [35] Investigation of Voice Pathology Detection and Classification on Different Frequency Regions Using Correlation Functions
    Al-nasheri, Ahmed
    Muhammad, Ghulam
    Alsulaiman, Mansour
    Ali, Zulfiqar
    JOURNAL OF VOICE, 2017, 31 (01) : 3 - 15
  • [36] Automatic COVID-19 detection using machine learning and voice recording
    Benmalek E.
    Elmhamdi J.
    Jilbab A.
    Jbari A.
    Research on Biomedical Engineering, 2023, 39 (03) : 597 - 612
  • [37] Evaluation of Glottal Epoch Detection Algorithms on Different Voice Types
    Cabral, Joao P.
    Kane, John
    Gobl, Christer
    Carson-Berndsen, Julie
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2000 - +
  • [38] Meta-analysis of voice disorders databases and applied machine learning techniques
    Syed S.A.
    Rashid M.
    Hussain S.
    Mathematical Biosciences and Engineering, 2020, 17 (06): : 7958 - 7979
  • [39] Meta-analysis of voice disorders databases and applied machine learning techniques
    Syed, Sidra Abid
    Rashid, Munaf
    Hussain, Samreen
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2020, 17 (06) : 7958 - 7979
  • [40] A Novel Voice Feature AVA and its Application to the Pathological Voice Detection Through Machine Learning
    Altaf, Abdulrehman
    Mahdin, Hairulnizam
    Maskat, Ruhaila
    Shaharudin, Shazlyn Milleana
    Altaf, Abdullah
    Mahmood, Awais
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 1085 - 1092