Predicting Type 2 Diabetes Using Logistic Regression and Machine Learning Approaches

被引:91
|
作者
Joshi, Ram D. [1 ]
Dhakal, Chandra K. [2 ]
机构
[1] Texas Tech Univ, Dept Econ, Lubbock, TX 79409 USA
[2] Univ Georgia, Dept Agr & Appl Econ, Athens, GA 30602 USA
关键词
decision tree; diabetes risk factors; machine learning; prediction accuracy; INSULIN-RESISTANCE; RISK-FACTORS; LIFE-STYLE; MELLITUS; RECOMMENDATIONS; POPULATION; DISEASES; OBESITY; TOOL;
D O I
10.3390/ijerph18147346
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Diabetes mellitus is one of the most common human diseases worldwide and may cause several health-related complications. It is responsible for considerable morbidity, mortality, and economic loss. A timely diagnosis and prediction of this disease could provide patients with an opportunity to take the appropriate preventive and treatment strategies. To improve the understanding of risk factors, we predict type 2 diabetes for Pima Indian women utilizing a logistic regression model and decision tree-a machine learning algorithm. Our analysis finds five main predictors of type 2 diabetes: glucose, pregnancy, body mass index (BMI), diabetes pedigree function, and age. We further explore a classification tree to complement and validate our analysis. The six-fold classification tree indicates glucose, BMI, and age are important factors, while the ten-node tree implies glucose, BMI, pregnancy, diabetes pedigree function, and age as the significant predictors. Our preferred specification yields a prediction accuracy of 78.26% and a cross-validation error rate of 21.74%. We argue that our model can be applied to make a reasonable prediction of type 2 diabetes, and could potentially be used to complement existing preventive measures to curb the incidence of diabetes and reduce associated costs.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Diabetes Predicting mHealth Application Using Machine Learning
    Khan, Nabila Shahnaz
    Muaz, Mehedi Hasan
    Kabir, Anusha
    Islam, Muhammad Nazrul
    2017 IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (IEEE WIECON-ECE 2017), 2017, : 237 - 240
  • [32] Prediction of type 2 diabetes mellitus onset using logistic regression-based scorecards
    Edlitz, Yochai
    Segal, Eran
    ELIFE, 2022, 11
  • [33] Comparison between traditional logistic regression and machine learning for predicting mortality in adult sepsis patients
    Wu, Hongsheng
    Liao, Biling
    Ji, Tengfei
    Ma, Keqiang
    Luo, Yumei
    Zhang, Shengmin
    FRONTIERS IN MEDICINE, 2025, 11
  • [34] Comparison of machine learning and logistic regression models in predicting psoriasis treatment outcome: A scoping review
    Haw, W.
    Hussain, A.
    Reynolds, N. J.
    Griffiths, C.
    Peek, N.
    Warren, R. B.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2022, 142 (12) : S200 - S200
  • [35] DIABETES PREDICTION USING DIFFERENT MACHINE LEARNING APPROACHES
    Sonar, Priyanka
    JayaMalini, K.
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 367 - 371
  • [36] Predicting the Type of Nanostructure Using Data Mining Techniques and Multinomial Logistic Regression
    Shehadeh, Mahmoud
    Ebrahimi, Nader
    Ochigbo, Abel
    COMPLEX ADAPTIVE SYSTEMS 2012, 2012, 12 : 392 - 397
  • [37] Clinical Decision Support System for Diabetic Patients by Predicting Type 2 Diabetes Using Machine Learning Algorithms
    Islam R.
    Sultana A.
    Tuhin M.N.
    Saikat M.S.H.
    Islam M.R.
    Journal of Healthcare Engineering, 2023, 2023
  • [38] Performance analysis and prediction of type 2 diabetes mellitus based on lifestyle data using machine learning approaches
    Ganie, Shahid Mohammad
    Malik, Majid Bashir
    Arif, Tasleem
    JOURNAL OF DIABETES AND METABOLIC DISORDERS, 2022, 21 (01) : 339 - 352
  • [39] Performance analysis and prediction of type 2 diabetes mellitus based on lifestyle data using machine learning approaches
    Shahid Mohammad Ganie
    Majid Bashir Malik
    Tasleem Arif
    Journal of Diabetes & Metabolic Disorders, 2022, 21 : 339 - 352
  • [40] Advancing Breast Cancer Prediction using Logistic Regression and Machine Learning Techniques
    Bhuria, Ruchika
    Gill, Kanwarpartap Singh
    Malhotra, Sonal
    Singh, Mukesh
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE COMPUTING AND SMART SYSTEMS, ICSCSS 2024, 2024, : 1374 - 1377