CLASSIFICATION OF DIABETES USING ENSEMBLE MACHINE LEARNING TECHNIQUES

被引:0
|
作者
Ashisha G.R. [1 ]
Mary X.A. [2 ]
Raja J.M. [3 ]
机构
[1] Electronics and Instrumentation Engineering, Karunya Institute of Technology and Sciences, Coimbatore
[2] Robotics Engineering, Karunya Institute of Technology and Sciences, Coimbatore
[3] Computer Science Engineering, Karunya Institute of Technology and Sciences, Coimbatore
来源
Scalable Computing | 2024年 / 25卷 / 04期
关键词
Diabetes; Ensemble Voting Classifier; Gradient Boost; Machine Learning; Random Over Sampling;
D O I
10.12694/scpe.v25i4.2873
中图分类号
学科分类号
摘要
Diabetes is a widespread chronic condition that impacts people all over the globe and requires a clear and timely diagnosis. Untreated diabetes leads to retinopathy, nephropathy, and damage to the nervous system. In this context, Machine Learning (ML) might be used to detect health problems early, diagnose them, and track their progress. Ensemble techniques are a promising approach that combines many classifiers to improve forecast accuracy and resilience. This study investigates the categorization of diabetes using an ensemble machine learning technique known as a voting classifier. Using a variety of classifiers, including Light Gradient Boosting Machine (LightGBM), Gradient Boost classifier (GBC), and Random Forest (RF). The predictions are aggregated using voting methods to get a final classification result. The research is carried out using two benchmarking datasets: the Pima Indian Diabetes Dataset (PIDD) and the German Dataset. The Boruta technique is used to choose the best attributes from the datasets, while the Random Over Sampling approach balances the range of classes and eliminates abnormal data using the interquartile range approach. The findings showed that the combination of the Boruta feature selection algorithm and ensemble Voting Classifier performed better for both PIDD and German datasets with an accuracy of 93% and 90% respectively. These algorithms are evaluated and the maximum accuracy is produced using the combination of the Boruta feature selection algorithm and ensemble Voting Classifier. This research helps medical professionals in the early prediction of diabetes, reducing physician’s time. © 2024 SCPE.
引用
收藏
页码:3172 / 3180
页数:8
相关论文
共 50 条
  • [1] Diabetes Classification Using Machine Learning Techniques
    Phongying, Methaporn
    Hiriote, Sasiprapa
    COMPUTATION, 2023, 11 (05)
  • [2] Intelligent Classification of Liver Diseases using Ensemble Machine Learning Techniques
    Nithyashri
    Goel, Harsh
    Hada, Manvendra Singh
    2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024, 2024, : 1183 - 1188
  • [3] QUIC Network Traffic Classification Using Ensemble Machine Learning Techniques
    Almuhammadi, Sultan
    Alnajim, Abdullatif
    Ayub, Mohammed
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [4] DIABETES TWITTER ANALYSIS USING IMPROVED ENSEMBLE MACHINE LEARNING TECHNIQUES
    Prabha, V. Diviya
    Rathipriya, R.
    ADVANCES AND APPLICATIONS IN MATHEMATICAL SCIENCES, 2021, 21 (01): : 241 - 250
  • [5] A Novel Ensemble Bagging Classification Method for Breast Cancer Classification Using Machine Learning Techniques
    Ponnaganti, Naga Deepti
    Anitha, Raju
    TRAITEMENT DU SIGNAL, 2022, 39 (01) : 229 - 237
  • [6] Optimization of an Analysis Method for Diabetes Prediction Using Classical and Ensemble Machine Learning Techniques
    Naranjo, Edison
    Arguero, Berenice
    Hurtado, Remigio
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2024, VOL 3, 2024, 1013 : 527 - 536
  • [7] Classification of Diabetes using Machine Learning
    Ul Islam, Nair
    Khanam, Ruqaiya
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021, : 185 - +
  • [8] Machine Learning Techniques for Diabetes Classification: A Comparative Study
    Mustafa, Hiri
    Mohamed, Chrayah
    Nabil, Ourdani
    Noura, Aknin
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 785 - 790
  • [9] Machine Learning Techniques for Classification of Diabetes and Cardiovascular Diseases
    Alic, Berina
    Gurbeta, Lejla
    Badnjevic, Almir
    2017 6TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2017, : 538 - 541
  • [10] A review on prediction of diabetes using machine learning and data mining classification techniques
    Pati, Abhilash
    Parhi, Manoranjan
    Pattanayak, Binod Kumar
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2023, 41 (01) : 83 - 109