Predicting Type 2 Diabetes Using Logistic Regression and Machine Learning Approaches

被引:91
|
作者
Joshi, Ram D. [1 ]
Dhakal, Chandra K. [2 ]
机构
[1] Texas Tech Univ, Dept Econ, Lubbock, TX 79409 USA
[2] Univ Georgia, Dept Agr & Appl Econ, Athens, GA 30602 USA
关键词
decision tree; diabetes risk factors; machine learning; prediction accuracy; INSULIN-RESISTANCE; RISK-FACTORS; LIFE-STYLE; MELLITUS; RECOMMENDATIONS; POPULATION; DISEASES; OBESITY; TOOL;
D O I
10.3390/ijerph18147346
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Diabetes mellitus is one of the most common human diseases worldwide and may cause several health-related complications. It is responsible for considerable morbidity, mortality, and economic loss. A timely diagnosis and prediction of this disease could provide patients with an opportunity to take the appropriate preventive and treatment strategies. To improve the understanding of risk factors, we predict type 2 diabetes for Pima Indian women utilizing a logistic regression model and decision tree-a machine learning algorithm. Our analysis finds five main predictors of type 2 diabetes: glucose, pregnancy, body mass index (BMI), diabetes pedigree function, and age. We further explore a classification tree to complement and validate our analysis. The six-fold classification tree indicates glucose, BMI, and age are important factors, while the ten-node tree implies glucose, BMI, pregnancy, diabetes pedigree function, and age as the significant predictors. Our preferred specification yields a prediction accuracy of 78.26% and a cross-validation error rate of 21.74%. We argue that our model can be applied to make a reasonable prediction of type 2 diabetes, and could potentially be used to complement existing preventive measures to curb the incidence of diabetes and reduce associated costs.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Prediction of preterm birth in nulliparous women using logistic regression and machine learning
    Belaghi, Reza Arabi
    Beyene, Joseph
    McDonald, Sarah D.
    PLOS ONE, 2021, 16 (06):
  • [42] Prediction of the rate of penetration using logistic regression algorithm of machine learning model
    Deng S.
    Wei M.
    Xu M.
    Cai W.
    Arabian Journal of Geosciences, 2021, 14 (21)
  • [43] Predicting Methylphenidate Response in ADHD Using Machine Learning Approaches
    Kim, Jae-Won
    Sharma, Vinod
    Ryan, Neal D.
    INTERNATIONAL JOURNAL OF NEUROPSYCHOPHARMACOLOGY, 2015, 18 (11):
  • [44] Predicting Aquaculture Water Quality Using Machine Learning Approaches
    Li, Tingting
    Lu, Jian
    Wu, Jun
    Zhang, Zhenhua
    Chen, Liwei
    WATER, 2022, 14 (18)
  • [45] Predicting novel superconducting hydrides using machine learning approaches
    Hutcheon, Michael J.
    Shipley, Alice M.
    Needs, Richard J.
    PHYSICAL REVIEW B, 2020, 101 (14)
  • [46] Predicting Agriculture Yields Based on Machine Learning Using Regression and Deep Learning
    Sharma, Priyanka
    Dadheech, Pankaj
    Aneja, Nagender
    Aneja, Sandhya
    IEEE ACCESS, 2023, 11 : 111255 - 111264
  • [47] Logistic regression has similar performance to optimised machine learning algorithms in a clinical setting: application to the discrimination between type 1 and type 2 diabetes in young adults
    Anita L. Lynam
    John M. Dennis
    Katharine R. Owen
    Richard A. Oram
    Angus G. Jones
    Beverley M. Shields
    Lauric A. Ferrat
    Diagnostic and Prognostic Research, 4 (1)
  • [48] Explainable Machine Learning for Improving Logistic Regression Models
    Yang, Yimin
    Wu, Min
    2021 IEEE 19TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2021,
  • [49] Prediction of Type 2 Diabetes Occurrence Using Machine Learning Model
    Deberneh, Henock M.
    Kim, Intaek
    Park, Jae Hyun
    Cha, Eunseok
    Joung, Kyong Hye
    Lee, Jong Seon
    Lim, Dong Seok
    DIABETES, 2020, 69
  • [50] Prediction of Type 2 Diabetes using Machine Learning Classification Methods
    Tigga, Neha Prerna
    Garg, Shruti
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 706 - 716