XEMLPD: an explainable ensemble machine learning approach for Parkinson disease diagnosis with optimized features

被引:0
|
作者
Fahmida Khanom [1 ]
Shuvo Biswas [2 ]
Mohammad Shorif Uddin [3 ]
Rafid Mostafiz [4 ]
机构
[1] American International University – Bangladesh,Department of Mathematics
[2] Mawlana Bhashani Science and Technology University,Department of Information and Communication Technology
[3] Jahangirnagar University,Department of Computer Science and Engineering
[4] Noakhali Science and Technology University,Institute of Information Technology
关键词
Parkinson's disease; Ensemble machine learning; Feature optimization; Explainable AI; CAD systems;
D O I
10.1007/s10772-024-10152-2
中图分类号
学科分类号
摘要
Parkinson's disease (PD) is a progressive neurological disorder that gradually worsens over time, making early diagnosis difficult. Traditionally, diagnosis relies on a neurologist's detailed assessment of the patient's medical history and multiple scans. Recently, artificial intelligence (AI)-based computer-aided diagnosis (CAD) systems have demonstrated superior performance by capturing complex, nonlinear patterns in clinical data. However, the opaque nature of many AI models, often referred to as "black box" systems, has raised concerns about their transparency, resulting in hesitation among clinicians to trust their outputs. To address this challenge, we propose an explainable ensemble machine learning framework, XEMLPD, designed to provide both global and local interpretability in PD diagnosis while maintaining high predictive accuracy. Our study utilized two clinical datasets, carefully curated and optimized through a two-step data preprocessing technique that handled outliers and ensured data balance, thereby reducing bias. Several ensemble machine learning (EML) models—boosting, bagging, stacking, and voting—were evaluated, with optimized features selected using techniques such as SelectedKBest, mRMR, PCA, and LDA. Among these, the stacking model combined with LDA feature optimization consistently delivered the highest accuracy. To ensure transparency, we integrated explainable AI methods—SHapley Adaptive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME)—into the stacking model. These methods were applied post-evaluation, ensuring that each prediction is accompanied by a detailed explanation. By offering both global and local interpretability, the XEMLPD framework provides clear insights into the decision-making process of the model. This transparency aids clinicians in developing better treatment strategies and enhances the overall prognosis for PD patients. Additionally, our framework serves as a valuable tool for clinical data scientists in creating more reliable and interpretable CAD systems.
引用
收藏
页码:1055 / 1083
页数:28
相关论文
共 50 条
  • [31] Application of Machine Learning to Parkinson’s Disease Diagnosis
    Li X.
    Jiang M.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2024, 53 (02): : 315 - 320
  • [32] Applications of Machine Learning to Diagnosis of Parkinson's Disease
    Lai, Hong
    Li, Xu-Ying
    Xu, Fanxi
    Zhu, Junge
    Li, Xian
    Song, Yang
    Wang, Xianlin
    Wang, Zhanjun
    Wang, Chaodong
    BRAIN SCIENCES, 2023, 13 (11)
  • [33] Machine learning for assisting cervical cancer diagnosis: An ensemble approach
    Lu, Jiayi
    Song, Enmin
    Ghoneim, Ahmed
    Alrashoud, Mubarak
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 106 : 199 - 205
  • [34] Machine learning Ensemble for the Parkinson’s disease using protein sequences
    Priya Arora
    Ashutosh Mishra
    Avleen Malhi
    Multimedia Tools and Applications, 2022, 81 : 32215 - 32242
  • [35] Machine learning Ensemble for the Parkinson's disease using protein sequences
    Arora, Priya
    Mishra, Ashutosh
    Malhi, Avleen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (22) : 32215 - 32242
  • [36] Ensemble transfer learning meets explainable AI: A deep learning approach for leaf disease detection
    Raval, Hetarth
    Chaki, Jyotismita
    ECOLOGICAL INFORMATICS, 2024, 84
  • [37] Optimized Ensemble Machine Learning Approach for Emotion Detection from Thermal Images
    Katual, Jayaprakash
    Kaul, Amit
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (02)
  • [38] Maximizing Biogas Yield Using an Optimized Stacking Ensemble Machine Learning Approach
    Mukasine, Angelique
    Sibomana, Louis
    Jayavel, Kayalvizhi
    Nkurikiyeyezu, Kizito
    Hitimana, Eric
    ENERGIES, 2024, 17 (02)
  • [39] Explainable machine learning for motor fault diagnosis
    Wang, Yuming
    Wang, Peng
    2023 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, I2MTC, 2023,
  • [40] Tuberculosis Disease Diagnosis Based on an Optimized Machine Learning Model
    Hrizi, Olfa
    Gasmi, Karim
    Ben Ltaifa, Ibtihel
    Alshammari, Hamoud
    Karamti, Hanen
    Krichen, Moez
    Ben Ammar, Lassaad
    Mahmood, Mahmood A.
    JOURNAL OF HEALTHCARE ENGINEERING, 2022, 2022