Voice pathology detection using machine learning algorithms based on different voice databases

被引:0
|
作者
Latiff, Nurul Mu'azzah Abdul [1 ]
Al-Dhief, Fahad Taha [1 ,2 ]
Sazihan, Nurul Fariesya Suhaila Md [1 ]
Baki, Marina Mat [3 ]
Abd Malik, Nik Noordini Nik [1 ]
Albadr, Musatafa Abbas Abbood [4 ]
Abbas, Ali Hashim [5 ]
机构
[1] Univ Teknol Malaysia, Fac Elect Engn, Fac Engn, Utm Johor Bahru, Johor, Malaysia
[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Bangi 43600, Malaysia
[3] Univ Kebangsaan Malaysia Med Ctr, Fac Med, Dept Otorhinolaryngol, Kuala Lumpur, Malaysia
[4] Basrah Univ Oil & Gas, Coll Ind Management Oil & Gas, Dept Petr Project Management, Al Basrah, Iraq
[5] Imam Jaafar Al Sadiq Univ, Coll Informat Technol, Dept Comp Tech Engn, Al Muthanna, Iraq
关键词
Machine learning; Voice pathology detection; OSELM; SVM; DT; NB; MFCC; SVD; MVPD; CLASSIFICATION; TRANSFORM; FEATURES;
D O I
10.1016/j.rineng.2025.103937
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The application of machine learning in analyzing voice disorders has become crucial for non-invasive voice pathology detection using voice signals. However, current systems face challenges such as low detection accuracy, limited databases, and evaluation metrics. More importantly, most existing studies rely on training and testing algorithms based on the same database, limiting their applicability in real-world scenarios with diverse data sources. Unlike traditional approaches that focus solely on single-database training and testing, this study presents a cross-database evaluation strategy to assess the robustness and generalizability of machine learning algorithms for voice pathology detection. Several algorithms, including Online Sequential Extreme Learning Machine (OSELM), Support Vector Machine (SVM), Decision Tree (DT), and Na & iuml;ve Bayes (NB), were evaluated using two databases: the Saarbrucken Voice Database (SVD) and the Malaysian Voice Pathology Database (MVPD). Two scenarios were considered: (1) training and testing on the same database and (2) training on one database and testing on another. The proposed study uses the Mel-Frequency Cepstral Coefficient (MFCC) technique for extracting features from voices. The algorithms are assessed using many evaluation metrics such as accuracy, precision, sensitivity, specificity, F-measure, and G-mean. Experimental results demonstrate that the OSELM algorithm achieves superior performance across both scenarios, with accuracies of up to 85.71 % in Scenario 1 and 80.77 % in Scenario 2, outperforming other algorithms. This novel approach highlights the reliability of OSELM and the importance of cross-database testing for developing robust and generalizable voice pathology detection systems.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Voice Pathology Detection on the Saarbrucken Voice Database with Calibration and Fusion of Scores Using MultiFocal Toolkit
    Martinez, David
    Lleida, Eduardo
    Ortega, Alfonso
    Miguel, Antonio
    Villalba, Jesus
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 99 - +
  • [42] Voice spoofing detection based on acoustic and glottal flow features using conventional machine learning techniques
    Raoudha Rahmeni
    Anis Ben Aicha
    Yassine Ben Ayed
    Multimedia Tools and Applications, 2022, 81 : 31443 - 31467
  • [43] Investigation of Glottal Flow Parameters for Voice Pathology Detection on SVD and MEEI Databases.
    Ezzine, Kadria
    Frikha, Mondher
    2018 4TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2018,
  • [44] Voice spoofing detection based on acoustic and glottal flow features using conventional machine learning techniques
    Rahmeni, Raoudha
    Ben Aicha, Anis
    Ben Ayed, Yassine
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (22) : 31443 - 31467
  • [45] Voice Detection in Traditionnal Tunisian Music using Audio Features and Supervised Learning Algorithms
    Ziadi, Wissem
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (01) : 26 - 31
  • [46] Voice Pathology Detection By Fuzzy Logic
    Panek, Dania
    Skalski, Andrzej
    Gajda, Janusz
    2015 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC), 2015, : 289 - 293
  • [47] An Online Learning Algorithm for Voice Activation Detection Based on a Pretrained Online Extreme Learning Machine
    Zhang, Tianle
    Hou, Muzhou
    Weng, Futian
    Yang, Yunlei
    Sun, Hongli
    Wang, Zheng
    Gao, Zhong
    Luo, Jianshu
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
  • [48] Joint learning for voice based disease detection
    Wu, Kebin
    Zhang, David
    Lu, Guangming
    Guo, Zhenhua
    PATTERN RECOGNITION, 2019, 87 : 130 - 139
  • [49] Demographic and Symptomatic Features of Voice Disorders and Their Potential Application in Classification Using Machine Learning Algorithms
    Tsui, Sheng-Yang
    Tsao, Yu
    Lin, Chii-Wann
    Fang, Shih-Hau
    Lin, Feng-Chuan
    Wang, Chi-Te
    FOLIA PHONIATRICA ET LOGOPAEDICA, 2018, 70 (3-4) : 174 - 182
  • [50] Gender Recognition by Voice using Machine Learning Techniques
    Jain, Sweta
    Pandey, Neha
    Choudhari, Vaidehi
    Yawalkar, Pratik
    Admane, Amey
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (01): : 175 - 181