Dual-stage explainable ensemble learning model for diabetes diagnosis

被引:0
|
作者
Elgendy, Ibrahim A. [1 ]
Hosny, Mohamed [1 ]
Albashrawi, Mousa Ahmad [1 ]
Alsenan, Shrooq [2 ]
机构
[1] King Fahd Univ Petr & Minerals, KFUPM Business Sch, IRC Finance & Digital Econ, Dhahran 31261, Saudi Arabia
[2] Princess Nourah bint Abdulrahman Univ, Coll Comp & Informat Sci, Informat Syst Dept, Riyadh 11671, Saudi Arabia
关键词
Diabetes diagnosis; Ensemble learning; Explainable artificial intelligence; Autoencoder; Healthcare;
D O I
10.1016/j.eswa.2025.126899
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Early diagnosis of diabetes is crucial for effective management and prevention of complications. However, traditional diagnostic methods are often constrained by the complexity of clinical datasets. To this end, this study proposes a novel explainable machine learning (ML) framework to enhance diabetes prediction. Specifically, the developed methodology involves the detection of outliers using local outlier factor and data reconstruction through a sparse autoencoder. Subsequently, multiple imputation strategies are employed to effectively address missing or erroneous data, while the synthetic minority oversampling technique is applied to mitigate class imbalance. Afterward, a stacking ensemble model, consisting of seven base ML models, is developed for classification, and the outputs of these base models are aggregated using four meta models. To enhance interpretability, two layers of model explainability are implemented. Feature importance analysis is conducted to identify the significance of input variables and Shapley additive explanations is employed to assess the contribution of each base model to the meta model predictions. The results demonstrated that replacing missing data with zeros or mean values led to a noticeable decrease inaccuracy compared to Knearest neighbor imputation or removing samples. Notably, hypertension and kidney failure are pivotal features in the diabetes diagnosis process. Among the base models, Extra Trees model had the most significant impact on the meta model decisions. The stacking multi-layer perceptron model achieved the highest accuracy of 92.54% for diabetes detection, surpassing the performance of standalone ML techniques. This approach enhances diagnostic precision and provides transparency in model predictions, essential for clinical applications.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Position control of a dual-stage actuator
    Sasaki, Minoru
    Ozeki, Tomohito
    Ito, Satoshi
    Tamagawa, Hirohisa
    INTERNATIONAL JOURNAL OF APPLIED ELECTROMAGNETICS AND MECHANICS, 2010, 33 (1-2) : 839 - 847
  • [22] Internal model control for the dual-stage actuator in hard disk drives
    Al-Mamun, A
    Lee, TH
    Pan, L
    Suthasun, T
    IECON-2002: PROCEEDINGS OF THE 2002 28TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-4, 2002, : 1606 - 1611
  • [23] Position control of a dual-stage actuator
    Sasaki, Minoru
    Ozeki, Tomohito
    Ito, Satoshi
    Tamagawa, Hirohisa
    APPLIED ELECTROMAGNETICS AND MECHANICS (II), 2009, 13 : 357 - 358
  • [24] A simplified dual-stage model predictive controller for modular multilevel converters
    Chakraborty, Rupak
    Gajare, Pranjal Mathu
    Chaki, Rupam
    Dey, Anubrata
    ELECTRIC POWER SYSTEMS RESEARCH, 2023, 223
  • [25] Chemomechanics of dual-stage reprocessable thermosets
    Luo, Chaoqian
    Zhang, Biao
    Zhang, Wang
    Yuan, Chao
    Dunn, Martin
    Ge, Qi
    Yu, Kai
    JOURNAL OF THE MECHANICS AND PHYSICS OF SOLIDS, 2019, 126 : 168 - 186
  • [26] Integrated dual-stage deformable mirrors
    Griffith, Mike
    Laycock, Leslie
    Archer, Nick
    Myers, Richard
    Kirby, Andrew
    Doel, Peter
    Brooks, David
    ADAPTIVE OPTICS SYSTEMS II, 2010, 7736
  • [27] DUAL-STAGE CORRELATED DIFFUSION IMAGING
    Wong, Alexander
    Khalvati, Farzad
    Haider, Masoom A.
    2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI), 2015, : 75 - 78
  • [28] Random Forest Model Predictions Afford Dual-Stage Antimalarial Agents
    Mughal, Haseeb
    Bell, Elise C.
    Mughal, Khadija
    Derbyshire, Emily R.
    Freundlich, Joel S.
    ACS INFECTIOUS DISEASES, 2022, 8 (08): : 1553 - 1562
  • [29] Kinetic modeling of dual-stage oxidative extractive denitrogenation of model fuel
    Kumari, Snehlata
    Sengupta, Sonali
    CHEMICAL ENGINEERING COMMUNICATIONS, 2024, 211 (10) : 1572 - 1587
  • [30] A hybrid super ensemble learning model for the early-stage prediction of diabetes risk
    Dogru, Ayse
    Buyrukoglu, Selim
    Ari, Murat
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (03) : 785 - 797