eDiaPredict: An Ensemble-based Framework for Diabetes Prediction

被引:31
|
作者
Singh, Ashima [1 ]
Dhillon, Arwinder [1 ]
Kumar, Neeraj [1 ]
Hossain, M. Shamim [2 ,3 ]
Muhammad, Ghulam [4 ]
Kumar, Manoj [5 ]
机构
[1] Thapar Univ, Comp Sci & Engn Dept, Patiala, Punjab, India
[2] King Saud Univ, Res Chair Pervas & Mobile Comp, Riyadh 11543, Saudi Arabia
[3] King Saud Univ, Dept Software Engn, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia
[4] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Riyadh, Saudi Arabia
[5] SMVD Univ, Katra, India
关键词
Diabetes prediction; ensembled models; XGBoost; decision tree; random forest;
D O I
10.1145/3415155
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Medical systems incorporate modern computational intelligence in healthcare. Machine learning techniques are applied to predict the onset and reoccurrence of the disease, identify biomarkers for survivability analysis depending upon certain health conditions of the patient. Early prediction of diseases like diabetes is essential as the number of diabetic patients of all age groups is increasing rapidly. To identify underlying reasons for the onset of diabetes in its early stage has become a challenging task for medical practitioners. Continuously increasing diabetic patient data has necessitated for the applications of efficient machine learning algorithms, which learns from the trends of the underlying data and recognizes the critical conditions in patients. In this article, an ensemble-based framework named eDiaPredict is proposed. It uses ensemble modeling, which includes an ensemble of different machine learning algorithms comprising XGBoost, Random Forest, Support Vector Machine, Neural Network, and Decision tree to predict diabetes status among patients. The performance of eDiaPredict has been evaluated using various performance parameters like accuracy, sensitivity, specificity, Gini Index, precision, area under curve, area under convex hull, minimum error rate, and minimum weighted coefficient. The effectiveness of the proposed approach is shown by its application on the PIMA Indian diabetes dataset wherein an accuracy of 95% is achieved.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] A Novel Advanced Performance Ensemble-Based Model (APEM) Framework: A Case Study on Diabetes Prediction
    Yunianta, Arda
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (10) : 1193 - 1204
  • [2] Model Ensemble-Based Prognostic Framework for Fatigue Crack Growth Prediction
    Hoang-Phuong Nguyen
    Zio, Enrico
    Liu, Jie
    2017 2ND INTERNATIONAL CONFERENCE ON SYSTEM RELIABILITY AND SAFETY (ICSRS), 2017, : 327 - 331
  • [3] Accurate eQTL prioritization with an ensemble-based framework
    Zeng, Haoyang
    Edwards, Matthew D.
    Guo, Yuchun
    Gifford, David K.
    HUMAN MUTATION, 2017, 38 (09) : 1259 - 1265
  • [4] Ensemble-based prediction of RNA secondary structures
    Nima Aghaeepour
    Holger H Hoos
    BMC Bioinformatics, 14
  • [5] Hybrid Ensemble-Based Travel Mode Prediction
    Golik, Pawel
    Grzenda, Maciej
    Sienkiewicz, Elzbieta
    ADVANCES IN INTELLIGENT DATA ANALYSIS XXII, PT I, IDA 2024, 2024, 14641 : 191 - 202
  • [6] Ensemble-based Blackbox Attacks on Dense Prediction
    Cai, Zikui
    Tan, Yaoteng
    Asif, M. Salman
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4045 - 4055
  • [7] Ensemble-based prediction of RNA secondary structures
    Aghaeepour, Nima
    Hoos, Holger H.
    BMC BIOINFORMATICS, 2013, 14
  • [8] An ensemble-based framework for mispronunciation detection of Arabic phonemes
    Calik, Sukru Selim
    Kucukmanisa, Ayhan
    Kilimci, Zeynep Hilal
    APPLIED ACOUSTICS, 2023, 212
  • [9] Towards ensemble-based use case point prediction
    Shukla, Suyash
    Kumar, Sandeep
    SOFTWARE QUALITY JOURNAL, 2023, 31 (03) : 843 - 864
  • [10] Towards ensemble-based use case point prediction
    Suyash Shukla
    Sandeep Kumar
    Software Quality Journal, 2023, 31 : 843 - 864