Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults

被引:19
|
作者
Wu, Yang [1 ,2 ,3 ]
Hu, Haofei [3 ,4 ,5 ]
Cai, Jinlin [1 ,2 ,6 ]
Chen, Runtian [1 ,2 ,3 ]
Zuo, Xin [7 ]
Cheng, Heng [7 ]
Yan, Dewen [1 ,2 ,3 ]
机构
[1] Shenzhen Univ, Affiliated Hosp 1, Dept Endocrinol, Shenzhen, Peoples R China
[2] Shenzhen Second Peoples Hosp, Dept Endocrinol, Shenzhen, Peoples R China
[3] Shenzhen Univ, Hlth Sci Ctr, Shenzhen, Peoples R China
[4] Shenzhen Univ, Affiliated Hosp 1, Dept Nephrol, Shenzhen, Peoples R China
[5] Shenzhen Second Peoples Hosp, Dept Nephrol, Shenzhen, Peoples R China
[6] Shantou Univ, Med Coll, Shantou, Peoples R China
[7] Third Peoples Hosp Shenzhen, Dept Endocrinol, Shenzhen, Peoples R China
关键词
machine learning; extreme gradient boosting; simple stepwise model; Incident diabetes; risk; TYPE-2; MELLITUS; MODELS; COMPLICATIONS; NOMOGRAM; TRENDS; IMPACT; BMI;
D O I
10.3389/fpubh.2021.626331
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Purpose: We aimed to establish and validate a risk assessment system that combines demographic and clinical variables to predict the 3-year risk of incident diabetes in Chinese adults. Methods: A 3-year cohort study was performed on 15,928 Chinese adults without diabetes at baseline. All participants were randomly divided into a training set (n = 7,940) and a validation set (n = 7,988). XGBoost method is an effective machine learning technique used to select the most important variables from candidate variables. And we further established a stepwise model based on the predictors chosen by the XGBoost model. The area under the receiver operating characteristic curve (AUC), decision curve and calibration analysis were used to assess discrimination, clinical use and calibration of the model, respectively. The external validation was performed on a cohort of 11,113 Japanese participants. Result: In the training and validation sets, 148 and 145 incident diabetes cases occurred. XGBoost methods selected the 10 most important variables from 15 candidate variables. Fasting plasma glucose (FPG), body mass index (BMI) and age were the top 3 important variables. And we further established a stepwise model and a prediction nomogram. The AUCs of the stepwise model were 0.933 and 0.910 in the training and validation sets, respectively. The Hosmer-Lemeshow test showed a perfect fit between the predicted diabetes risk and the observed diabetes risk (p = 0.068 for the training set, p = 0.165 for the validation set). Decision curve analysis presented the clinical use of the stepwise model and there was a wide range of alternative threshold probability spectrum. And there were almost no the interactions between these predictors (most P-values for interaction >0.05). Furthermore, the AUC for the external validation set was 0.830, and the Hosmer-Lemeshow test for the external validation set showed no statistically significant difference between the predicted diabetes risk and observed diabetes risk (P = 0.824). Conclusion: We established and validated a risk assessment system for characterizing the 3-year risk of incident diabetes.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A prediction nomogram for the 3-year risk of incident diabetes among Chinese adults
    Yang Wu
    Haofei Hu
    Jinlin Cai
    Runtian Chen
    Xin Zuo
    Heng Cheng
    Dewen Yan
    Scientific Reports, 10
  • [2] A prediction nomogram for the 3-year risk of incident diabetes among Chinese adults
    Wu, Yang
    Hu, Haofei
    Cai, Jinlin
    Chen, Runtian
    Zuo, Xin
    Cheng, Heng
    Yan, Dewen
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [3] Nomogram Predicting the Risk of Progression from Prediabetes to Diabetes After a 3-Year Follow-Up in Chinese Adults
    Liang, Kai
    Guo, Xinghong
    Wang, Chuan
    Yan, Fei
    Wang, Lingshu
    Liu, Jinbo
    Hou, Xinguo
    Li, Wenjuan
    Li Chen
    DIABETES METABOLIC SYNDROME AND OBESITY, 2021, 14 : 2641 - 2649
  • [4] Predicting the Risk of Incident Type 2 Diabetes Mellitus in Chinese Elderly Using Machine Learning Techniques
    Liu, Qing
    Zhang, Miao
    He, Yifeng
    Zhang, Lei
    Zou, Jingui
    Yan, Yaqiong
    Guo, Yan
    JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (06):
  • [5] Predicting the 2-Year Risk of Progression from Prediabetes to Diabetes Using Machine Learning among Chinese Elderly Adults
    Liu, Qing
    Zhou, Qing
    He, Yifeng
    Zou, Jingui
    Guo, Yan
    Yan, Yaqiong
    JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (07):
  • [6] Machine learning for predicting diabetes risk in western China adults
    Lin Li
    Yinlin Cheng
    Weidong Ji
    Mimi Liu
    Zhensheng Hu
    Yining Yang
    Yushan Wang
    Yi Zhou
    Diabetology & Metabolic Syndrome, 15
  • [7] Machine learning for predicting diabetes risk in western China adults
    Li, Lin
    Cheng, Yinlin
    Ji, Weidong
    Liu, Mimi
    Hu, Zhensheng
    Yang, Yining
    Wang, Yushan
    Zhou, Yi
    DIABETOLOGY & METABOLIC SYNDROME, 2023, 15 (01):
  • [8] Novel nomogram for predicting the 3-year incidence risk of osteoporosis in a Chinese male population
    Mao, Yaqian
    Xu, Lizhen
    Xue, Ting
    Liang, Jixing
    Lin, Wei
    Wen, Junping
    Huang, Huibin
    Li, Liantao
    Chen, Gang
    ENDOCRINE CONNECTIONS, 2021, 10 (09) : 1111 - 1124
  • [9] Novel Subgroups of the Three-Year Risk of Incident Diabetes and Complications in Chinese Diabetes-Free Adults
    Guo, Lixin
    Wang, Weihao
    DIABETES, 2024, 73
  • [10] Baseline osteocalcin levels and incident diabetes in a 3-year prospective study of high-risk individuals
    Liatis, S.
    Sfikakis, P. P.
    Tsiakou, A.
    Stathi, C.
    Terpos, E.
    Katsilambros, N.
    Makrilakis, K.
    DIABETES & METABOLISM, 2014, 40 (03) : 198 - 203