Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults

被引:19
|
作者
Wu, Yang [1 ,2 ,3 ]
Hu, Haofei [3 ,4 ,5 ]
Cai, Jinlin [1 ,2 ,6 ]
Chen, Runtian [1 ,2 ,3 ]
Zuo, Xin [7 ]
Cheng, Heng [7 ]
Yan, Dewen [1 ,2 ,3 ]
机构
[1] Shenzhen Univ, Affiliated Hosp 1, Dept Endocrinol, Shenzhen, Peoples R China
[2] Shenzhen Second Peoples Hosp, Dept Endocrinol, Shenzhen, Peoples R China
[3] Shenzhen Univ, Hlth Sci Ctr, Shenzhen, Peoples R China
[4] Shenzhen Univ, Affiliated Hosp 1, Dept Nephrol, Shenzhen, Peoples R China
[5] Shenzhen Second Peoples Hosp, Dept Nephrol, Shenzhen, Peoples R China
[6] Shantou Univ, Med Coll, Shantou, Peoples R China
[7] Third Peoples Hosp Shenzhen, Dept Endocrinol, Shenzhen, Peoples R China
关键词
machine learning; extreme gradient boosting; simple stepwise model; Incident diabetes; risk; TYPE-2; MELLITUS; MODELS; COMPLICATIONS; NOMOGRAM; TRENDS; IMPACT; BMI;
D O I
10.3389/fpubh.2021.626331
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Purpose: We aimed to establish and validate a risk assessment system that combines demographic and clinical variables to predict the 3-year risk of incident diabetes in Chinese adults. Methods: A 3-year cohort study was performed on 15,928 Chinese adults without diabetes at baseline. All participants were randomly divided into a training set (n = 7,940) and a validation set (n = 7,988). XGBoost method is an effective machine learning technique used to select the most important variables from candidate variables. And we further established a stepwise model based on the predictors chosen by the XGBoost model. The area under the receiver operating characteristic curve (AUC), decision curve and calibration analysis were used to assess discrimination, clinical use and calibration of the model, respectively. The external validation was performed on a cohort of 11,113 Japanese participants. Result: In the training and validation sets, 148 and 145 incident diabetes cases occurred. XGBoost methods selected the 10 most important variables from 15 candidate variables. Fasting plasma glucose (FPG), body mass index (BMI) and age were the top 3 important variables. And we further established a stepwise model and a prediction nomogram. The AUCs of the stepwise model were 0.933 and 0.910 in the training and validation sets, respectively. The Hosmer-Lemeshow test showed a perfect fit between the predicted diabetes risk and the observed diabetes risk (p = 0.068 for the training set, p = 0.165 for the validation set). Decision curve analysis presented the clinical use of the stepwise model and there was a wide range of alternative threshold probability spectrum. And there were almost no the interactions between these predictors (most P-values for interaction >0.05). Furthermore, the AUC for the external validation set was 0.830, and the Hosmer-Lemeshow test for the external validation set showed no statistically significant difference between the predicted diabetes risk and observed diabetes risk (P = 0.824). Conclusion: We established and validated a risk assessment system for characterizing the 3-year risk of incident diabetes.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Leukocyte Telomere Length Independently Predicts 3-Year Diabetes Risk in a Longitudinal Study of Chinese Population
    Liu, Yiwen
    Ma, Chifa
    Li, Pingping
    Ma, Chunxiao
    He, Shuli
    Ping, Fan
    Zhang, Huabing
    Li, Wei
    Xu, Lingling
    Li, Yuxiu
    OXIDATIVE MEDICINE AND CELLULAR LONGEVITY, 2020, 2020
  • [22] Machine Learning for Predicting the Risk of Transition from Prediabetes to Diabetes
    Zueger, Thomas
    Schallmoser, Simon
    Kraus, Mathias
    Saar-Tsechansky, Maytal
    Feuerriegel, Stefan
    Stettler, Christoph
    DIABETES TECHNOLOGY & THERAPEUTICS, 2022, 24 (11) : 842 - 847
  • [23] Hospitalization and mortality of diabetes in older adults - A 3-year prospective study
    Rosenthal, MJ
    Fajardo, M
    Gilmore, S
    Morley, JE
    Naliboff, BD
    DIABETES CARE, 1998, 21 (02) : 231 - 235
  • [24] Discrepancy between Lifetime Risk of ESKD vs. 3-Year Risk of CKD Progression in US Adults with Diabetes
    Obi, Yoshitsugu
    Zhu Xiaoqian
    Tio, Maria Clarissa
    Yen, Timothy E.
    Hall, Michael E.
    Dossabhoy, Neville R.
    Shafi, Tariq
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2024, 35 (10):
  • [25] Predicting incident heart failure in patients with type ii diabetes: A machine a learning approach
    Kaur, N.
    Pellicori, P.
    Deligianni, F.
    Jones, Y.
    Friday, J. M.
    Cleland, J. G. F.
    EUROPEAN JOURNAL OF HEART FAILURE, 2023, 25 : 427 - 427
  • [26] Machine learning model for predicting 1-year and 3-year all-cause mortality in ischemic heart failure patients
    Cai, Anping
    Chen, Rui
    Pang, Chengcheng
    Liu, Hui
    Zhou, Yingling
    Chen, Jiyan
    Li, Liwen
    POSTGRADUATE MEDICINE, 2022, 134 (08) : 810 - 819
  • [27] Effect of New Disability Subtype on 3-Year Mortality in Chinese Older Adults
    Feng, Qiushi
    Hoenig, Helen M.
    Gu, Danan
    Yi, Zeng
    Purser, Jama L.
    JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2010, 58 (10) : 1952 - 1958
  • [28] Predicting stroke risk in Chinese hypertensive population using machine learning
    Huang, X.
    Cao, T. Y.
    Wei, Y. P.
    Xu, B.
    Wu, H. Y.
    Wu, Y. Q.
    Cheng, X. S.
    Xu, X. P.
    Liu, L. S.
    EUROPEAN HEART JOURNAL, 2021, 42 : 2489 - 2489
  • [29] Predicting 3-year all-cause mortality in rectal cancer patients based on body composition and machine learning
    Li, Xiangyong
    Zhou, Zeyang
    Zhang, Xiaoyang
    Cheng, Xinmeng
    Xing, Chungen
    Wu, Yong
    FRONTIERS IN NUTRITION, 2025, 12
  • [30] SEVERITY OF INCIDENT VERTEBRAL FRACTURE AND FUTURE FRACTURE RISK: A 3-YEAR PROSPECTIVE STUDY
    Bruyere, O.
    Roux, C.
    Nicolet, D.
    Fechtenbaum, J.
    Deroisy, R.
    Reginster, J. Y.
    OSTEOPOROSIS INTERNATIONAL, 2012, 23 : S60 - S61