eDiaPredict: An Ensemble-based Framework for Diabetes Prediction

被引:31
|
作者
Singh, Ashima [1 ]
Dhillon, Arwinder [1 ]
Kumar, Neeraj [1 ]
Hossain, M. Shamim [2 ,3 ]
Muhammad, Ghulam [4 ]
Kumar, Manoj [5 ]
机构
[1] Thapar Univ, Comp Sci & Engn Dept, Patiala, Punjab, India
[2] King Saud Univ, Res Chair Pervas & Mobile Comp, Riyadh 11543, Saudi Arabia
[3] King Saud Univ, Dept Software Engn, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia
[4] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Riyadh, Saudi Arabia
[5] SMVD Univ, Katra, India
关键词
Diabetes prediction; ensembled models; XGBoost; decision tree; random forest;
D O I
10.1145/3415155
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Medical systems incorporate modern computational intelligence in healthcare. Machine learning techniques are applied to predict the onset and reoccurrence of the disease, identify biomarkers for survivability analysis depending upon certain health conditions of the patient. Early prediction of diseases like diabetes is essential as the number of diabetic patients of all age groups is increasing rapidly. To identify underlying reasons for the onset of diabetes in its early stage has become a challenging task for medical practitioners. Continuously increasing diabetic patient data has necessitated for the applications of efficient machine learning algorithms, which learns from the trends of the underlying data and recognizes the critical conditions in patients. In this article, an ensemble-based framework named eDiaPredict is proposed. It uses ensemble modeling, which includes an ensemble of different machine learning algorithms comprising XGBoost, Random Forest, Support Vector Machine, Neural Network, and Decision tree to predict diabetes status among patients. The performance of eDiaPredict has been evaluated using various performance parameters like accuracy, sensitivity, specificity, Gini Index, precision, area under curve, area under convex hull, minimum error rate, and minimum weighted coefficient. The effectiveness of the proposed approach is shown by its application on the PIMA Indian diabetes dataset wherein an accuracy of 95% is achieved.
引用
收藏
页数:26
相关论文
共 50 条
  • [21] Software Defect Prediction Using an Intelligent Ensemble-Based Model
    Ali, Misbah
    Mazhar, Tehseen
    Arif, Yasir
    Al-Otaibi, Shaha
    Ghadi, Yazeed Yasin
    Shahzad, Tariq
    Khan, Muhammad Amir
    Hamam, Habib
    IEEE ACCESS, 2024, 12 : 20376 - 20395
  • [22] Ensemble-Based Methodology for the Prediction of Drug-Target Interactions
    Coelho, Edgar D.
    Luis Oliveira, Jose
    Arrais, Joel P.
    2016 IEEE 29TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2016, : 36 - 41
  • [23] Ensemble-based classifiers
    Rokach, Lior
    ARTIFICIAL INTELLIGENCE REVIEW, 2010, 33 (1-2) : 1 - 39
  • [24] KFPredict: An ensemble learning prediction framework for diabetes based on fusion of key features
    Qi, Huamei
    Song, Xiaomeng
    Liu, Shengzong
    Zhang, Yan
    Wong, Kelvin K. L.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 231
  • [25] A reliable ensemble-based classification framework for glioma brain tumor segmentation
    Barzegar, Zeynab
    Jamzad, Mansour
    SIGNAL IMAGE AND VIDEO PROCESSING, 2020, 14 (08) : 1591 - 1599
  • [26] A Scalable Ensemble-based Framework to Analyse Users' Digital Footprints for Cybersecurity
    Folino, Gianluigi
    Pisani, Francesco Sergio
    Godano, Carla Otranto
    ERCIM NEWS, 2022, (129): : 25 - 27
  • [27] A parallel ensemble-based framework for reservoir history matching and uncertainty characterization
    Reza Tavakoli
    Gergina Pencheva
    Mary F. Wheeler
    Benjamin Ganis
    Computational Geosciences, 2013, 17 : 83 - 97
  • [28] An elastic framework for ensemble-based large-scale data assimilation
    Friedemann, Sebastian
    Raffin, Bruno
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2022, 36 (04): : 543 - 563
  • [29] An ensemble-based framework for user behaviour anomaly detection and classification for cybersecurity
    Gianluigi Folino
    Carla Otranto Godano
    Francesco Sergio Pisani
    The Journal of Supercomputing, 2023, 79 : 11660 - 11683
  • [30] A parallel ensemble-based framework for reservoir history matching and uncertainty characterization
    Tavakoli, Reza
    Pencheva, Gergina
    Wheeler, Mary F.
    Ganis, Benjamin
    COMPUTATIONAL GEOSCIENCES, 2013, 17 (01) : 83 - 97