Predicting Type 2 Diabetes Using Logistic Regression and Machine Learning Approaches

被引:91
|
作者
Joshi, Ram D. [1 ]
Dhakal, Chandra K. [2 ]
机构
[1] Texas Tech Univ, Dept Econ, Lubbock, TX 79409 USA
[2] Univ Georgia, Dept Agr & Appl Econ, Athens, GA 30602 USA
关键词
decision tree; diabetes risk factors; machine learning; prediction accuracy; INSULIN-RESISTANCE; RISK-FACTORS; LIFE-STYLE; MELLITUS; RECOMMENDATIONS; POPULATION; DISEASES; OBESITY; TOOL;
D O I
10.3390/ijerph18147346
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Diabetes mellitus is one of the most common human diseases worldwide and may cause several health-related complications. It is responsible for considerable morbidity, mortality, and economic loss. A timely diagnosis and prediction of this disease could provide patients with an opportunity to take the appropriate preventive and treatment strategies. To improve the understanding of risk factors, we predict type 2 diabetes for Pima Indian women utilizing a logistic regression model and decision tree-a machine learning algorithm. Our analysis finds five main predictors of type 2 diabetes: glucose, pregnancy, body mass index (BMI), diabetes pedigree function, and age. We further explore a classification tree to complement and validate our analysis. The six-fold classification tree indicates glucose, BMI, and age are important factors, while the ten-node tree implies glucose, BMI, pregnancy, diabetes pedigree function, and age as the significant predictors. Our preferred specification yields a prediction accuracy of 78.26% and a cross-validation error rate of 21.74%. We argue that our model can be applied to make a reasonable prediction of type 2 diabetes, and could potentially be used to complement existing preventive measures to curb the incidence of diabetes and reduce associated costs.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Comparison of multiple linear regression and machine learning methods in predicting cognitive function in older Chinese type 2 diabetes patients
    Liu, Chi-Hao
    Peng, Chung-Hsin
    Huang, Li-Ying
    Chen, Fang-Yu
    Kuo, Chun-Heng
    Wu, Chung-Ze
    Cheng, Yu-Fang
    BMC NEUROLOGY, 2024, 24 (01)
  • [22] Comparison of multiple linear regression and machine learning methods in predicting cognitive function in older Chinese type 2 diabetes patients
    Chi-Hao Liu
    Chung-Hsin Peng
    Li-Ying Huang
    Fang-Yu Chen
    Chun-Heng Kuo
    Chung-Ze Wu
    Yu-Fang Cheng
    BMC Neurology, 24
  • [23] Predicting the water ecological criteria of copper using machine learning and multiple linear regression approaches.
    Yang, Xiao-Ling
    Wang, Meng-Xiao
    Li, Xiao-Juan
    Yuan, Ya-Wen
    Shao, Mei-Chen
    Mu, Yun-Song
    Bai, Ying-Chen
    Wu, Feng-Chang
    Zhongguo Huanjing Kexue/China Environmental Science, 2024, 44 (07): : 3976 - 3985
  • [24] Predicting Antidiabetic Peptide Activity: A Machine Learning Perspective on Type 1 and Type 2 Diabetes
    Cai, Kaida
    Zhang, Zhe
    Zhu, Wenzhou
    Liu, Xiangwei
    Yu, Tingqing
    Liao, Wang
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (18)
  • [25] Loan Repayment Prediction Using Logistic Regression Ensemble Learning With Machine Learning Algorithms
    Dinh, Thuan Nguyen
    Thanh, Binh Pham
    2022 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2022, : 79 - 85
  • [26] Predicting misdiagnosed adult-onset type 1 diabetes using machine learning
    Cheheltani, Rabee
    King, Nicholas
    Lee, Suyin
    North, Benjamin
    Kovarik, Danny
    -Molina, Carmella Evans
    Leavitt, Nadejda
    Dutta, Sanjoy
    DIABETES RESEARCH AND CLINICAL PRACTICE, 2022, 191
  • [27] Diagnosis of Type 2 Diabetes and Pre-diabetes Using Machine Learning
    Severeyn, Erika
    Wong, Sara
    Velasquez, Jesus
    Perpinan, Gilberto
    Herrera, Hector
    Altuve, Miguel
    Diaz, Jose
    VIII LATIN AMERICAN CONFERENCE ON BIOMEDICAL ENGINEERING AND XLII NATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING, 2020, 75 : 792 - 802
  • [28] Predicting Passivhaus certification of dwellings using machine learning: A comparative analysis of logistic regression and gradient boosting decision trees
    Du, Yusheng
    Gou, Zhonghua
    JOURNAL OF BUILDING ENGINEERING, 2023, 79
  • [29] Predicting ipsilateral recurrence in women treated for ductal carcinoma in situ using machine learning and multivariable logistic regression models
    Lamb, Leslie R.
    Mercaldo, Sarah
    Kim, Geunwon
    Hovis, Keegan
    Oseni, Tawakalitu O.
    Bahl, Manisha
    CLINICAL IMAGING, 2022, 92 : 94 - 100
  • [30] The diabacare cloud: predicting diabetes using machine learning
    Alam, Mehtab
    Khan, Ihtiram Raza
    Alam, Mohammad Afshar
    Siddiqui, Farheen
    Tanweer, Safdar
    ACTA SCIENTIARUM-TECHNOLOGY, 2024, 46 (01)