Voice pathology detection using machine learning algorithms based on different voice databases

被引:0
|
作者
Latiff, Nurul Mu'azzah Abdul [1 ]
Al-Dhief, Fahad Taha [1 ,2 ]
Sazihan, Nurul Fariesya Suhaila Md [1 ]
Baki, Marina Mat [3 ]
Abd Malik, Nik Noordini Nik [1 ]
Albadr, Musatafa Abbas Abbood [4 ]
Abbas, Ali Hashim [5 ]
机构
[1] Univ Teknol Malaysia, Fac Elect Engn, Fac Engn, Utm Johor Bahru, Johor, Malaysia
[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Bangi 43600, Malaysia
[3] Univ Kebangsaan Malaysia Med Ctr, Fac Med, Dept Otorhinolaryngol, Kuala Lumpur, Malaysia
[4] Basrah Univ Oil & Gas, Coll Ind Management Oil & Gas, Dept Petr Project Management, Al Basrah, Iraq
[5] Imam Jaafar Al Sadiq Univ, Coll Informat Technol, Dept Comp Tech Engn, Al Muthanna, Iraq
关键词
Machine learning; Voice pathology detection; OSELM; SVM; DT; NB; MFCC; SVD; MVPD; CLASSIFICATION; TRANSFORM; FEATURES;
D O I
10.1016/j.rineng.2025.103937
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The application of machine learning in analyzing voice disorders has become crucial for non-invasive voice pathology detection using voice signals. However, current systems face challenges such as low detection accuracy, limited databases, and evaluation metrics. More importantly, most existing studies rely on training and testing algorithms based on the same database, limiting their applicability in real-world scenarios with diverse data sources. Unlike traditional approaches that focus solely on single-database training and testing, this study presents a cross-database evaluation strategy to assess the robustness and generalizability of machine learning algorithms for voice pathology detection. Several algorithms, including Online Sequential Extreme Learning Machine (OSELM), Support Vector Machine (SVM), Decision Tree (DT), and Na & iuml;ve Bayes (NB), were evaluated using two databases: the Saarbrucken Voice Database (SVD) and the Malaysian Voice Pathology Database (MVPD). Two scenarios were considered: (1) training and testing on the same database and (2) training on one database and testing on another. The proposed study uses the Mel-Frequency Cepstral Coefficient (MFCC) technique for extracting features from voices. The algorithms are assessed using many evaluation metrics such as accuracy, precision, sensitivity, specificity, F-measure, and G-mean. Experimental results demonstrate that the OSELM algorithm achieves superior performance across both scenarios, with accuracies of up to 85.71 % in Scenario 1 and 80.77 % in Scenario 2, outperforming other algorithms. This novel approach highlights the reliability of OSELM and the importance of cross-database testing for developing robust and generalizable voice pathology detection systems.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Voice Pathology Detection Using Support Vector Machine Based on Different Number of Voice Signals
    AL-Dhief, Fahad Taha
    Latiff, Nurul Mu'azzah Abdul
    Baki, Marina Mat
    Abd Malik, Nik Noordini Nik
    Sabri, Naseer
    Albadr, Musatafa Abbas Abbood
    2021 26TH IEEE ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS {APCC), 2021, : 1 - 6
  • [2] Voice Pathology Detection Using Machine Learning Technique
    AL-Dhief, Fahad Taha
    Mu, Nurul
    Abd Malik, Nik Noordini Nik
    Sabri, Naseer
    Baki, Marina Mat
    Albadr, Musatafa Abbas Abbood
    Abbas, Aymen Fadhil
    Hussein, Yaqdhan Mahmood
    Mohammed, Mazin Abed
    2020 IEEE 5TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATION TECHNOLOGIES (ISTT), 2020, : 99 - 104
  • [3] Voice disorder detection using machine learning algorithms: An application in speech and language pathology
    Rehman, Mujeeb Ur
    Shafique, Arslan
    Azhar, Qurat-Ul-Ain
    Jamal, Sajjad Shaukat
    Gheraibia, Youcef
    Usman, Aminu Bello
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [4] An Investigation of Multidimensional Voice Program Parameters in Three Different Databases for Voice Pathology Detection and Classification
    Al-nasheri, Ahmed
    Muhammad, Ghulam
    Alsulaiman, Mansour
    Ali, Zulfiqar
    Mesallam, Tamer A.
    Farahat, Mohamed
    Malki, Khalid H.
    Bencherif, Mohamed A.
    JOURNAL OF VOICE, 2017, 31 (01) : 113.e9 - 113.e18
  • [5] An Investigation of MDVP Parameters for Voice Pathology Detection on Three Different Databases
    Al-nasheri, Ahmed
    Ali, Zulfigar
    Muhammad, Ghulam
    Alsulaiman, Mansour
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2952 - 2956
  • [6] A Survey of Voice Pathology Surveillance Systems Based on Internet of Things and Machine Learning Algorithms
    Al-Dhief, Fahad Taha
    Latiff, Nurul Mu'azzah Abdul
    Abd Malik, Nik Noordini Nik
    Salim, Naseer Sabri
    Baki, Marina Mat
    Albadr, Musatafa Abbas Abbood
    Mohammed, Mazin Abed
    IEEE ACCESS, 2020, 8 : 64514 - 64533
  • [7] The accuracy of an Online Sequential Extreme Learning Machine in detecting voice pathology using the Malaysian Voice Pathology Database
    Nur Ain Nabila Za’im
    Fahad Taha AL-Dhief
    Mawaddah Azman
    Majid Razaq Mohamed Alsemawi
    Nurul Mu′azzah Abdul Latiff
    Marina Mat Baki
    Journal of Otolaryngology - Head & Neck Surgery, 52
  • [8] The accuracy of an Online Sequential Extreme Learning Machine in detecting voice pathology using the Malaysian Voice Pathology Database
    Za'im, Nur Ain Nabila
    AL-Dhief, Fahad Taha
    Azman, Mawaddah
    Alsemawi, Majid Razaq Mohamed
    Abdul Latiff, Nurul Mu'azzah
    Baki, Marina Mat
    JOURNAL OF OTOLARYNGOLOGY-HEAD & NECK SURGERY, 2023, 52 (01)
  • [9] Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms
    Mesallam, Tamer A.
    Farahat, Mohamed
    Malki, Khalid H.
    Alsulaiman, Mansour
    Ali, Zulfiqar
    Al-nasheri, Ahmed
    Muhammad, Ghulam
    JOURNAL OF HEALTHCARE ENGINEERING, 2017, 2017
  • [10] Voice Pathology Detection with MDVP Parameters Using Arabic Voice Pathology Database
    Al-nasheri, Ahmed
    Ali, Zulfiqar
    Muhammad, Ghulam
    Alsulaiman, Mansour
    Almalki, Khalid H.
    Mesallam, Tamer A.
    Farahat, Mohamed
    2015 5TH NATIONAL SYMPOSIUM ON INFORMATION TECHNOLOGY: TOWARDS NEW SMART WORLD (NSITNSW), 2015,