AI Machine Learning-Based Diabetes Prediction in Older Adults in South Korea: Cross-Sectional Analysis

被引:0
|
作者
Lee, Hocheol [1 ]
Park, Myung-Bae [1 ]
Won, Young-Joo [1 ]
机构
[1] Yonsei Univ, Coll Software & Digital Healthcare Convergence, Dept Hlth Adm, Yonseidae Gil 1, Wonju 26493, South Korea
基金
新加坡国家研究基金会;
关键词
diabetes; prediction model; super-aging population; extreme gradient boosting model; geriatrics; older adults; aging; artificial intelligence; machine learning; HEALTH; HYPERTENSION; OBESITY;
D O I
10.2196/57874
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Diabetes is prevalent in older adults, and machine learning algorithms could help predict diabetes in this population. Objective: This study determined diabetes risk factors among older adults aged >= 60 years using machine learning algorithms and selected an optimized prediction model. Methods: This cross-sectional study was conducted on 3084 older adults aged >= 60 years in Seoul from January to November 2023. Data were collected using a mobile app (Gosufit) that measured depression, stress, anxiety, basal metabolic rate, oxygen saturation, heart rate, and average daily step count. Health coordinators recorded data on diabetes, hypertension, hyperlipidemia, chronic obstructive pulmonary disease, percent body fat, and percent muscle. The presence of diabetes was the target variable, with various health indicators as predictors. Machine learning algorithms, including random forest, gradient boosting model, light gradient boosting model, extreme gradient boosting model, and k-nearest neighbors, were employed for analysis. The dataset was split into 70% training and 30% testing sets. Model performance was evaluated using accuracy, precision, recall, F1 score, and area under the curve (AUC). Shapley additive explanations (SHAPs) were used for model interpretability. Results: Significant predictors of diabetes included hypertension (chi(2)1=197.294; P<.001), hyperlipidemia (chi(2)1=47.671; P<.001), age (mean: diabetes group 72.66 years vs nondiabetes group 71.81 years), stress (mean: diabetes group 42.68 vs nondiabetes group 41.47; t3082=-2.858; P=.004), and heart rate (mean: diabetes group 75.05 beats/min vs nondiabetes group 73.14 beats/min; t3082=-7.948; P<.001). The extreme gradient boosting model (XGBM) demonstrated the best performance, with an accuracy of 84.88%, precision of 77.92%, recall of 66.91%, F1 score of 72.00, and AUC of 0.7957. The SHAP analysis of the top-performing XGBM revealed key predictors for diabetes: hypertension, age, percent body fat, heart rate, hyperlipidemia, basal metabolic rate, stress, and oxygen saturation. Hypertension strongly increased diabetes risk, while advanced age and elevated stress levels also showed significant associations. Hyperlipidemia and higher heart rates further heightened diabetes probability. These results highlight the importance and directional impact of specific features in predicting diabetes, providing valuable insights for risk stratification and targeted interventions. Conclusions: This study focused on modifiable risk factors, providing crucial data for establishing a system for the automated collection of health information and lifelog data from older adults using digital devices at service facilities.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Multitasking in older adults with type 2 diabetes: A cross-sectional analysis
    Rucker, Jason L.
    McDowd, Joan M.
    Mahnken, Jonathan D.
    Burns, Jeffrey M.
    Sabus, Carla H.
    Britton-Carpenter, Amanda J.
    Utech, Nora B.
    Kluding, Patricia M.
    PLOS ONE, 2017, 12 (10):
  • [2] Risk factors and prediction models for cardiovascular complications of hypertension in older adults with machine learning: A cross-sectional study
    Wu, Yixin
    Xin, Bo
    Wan, Qiuyuan
    Ren, Yanping
    Jiang, Wenhui
    HELIYON, 2024, 10 (06)
  • [3] Machine learning-based muscle mass estimation using gait parameters in community-dwelling older adults: A cross-sectional study
    Fujita, Kosuke
    Hiyama, Takahiro
    Wada, Kengo
    Aihara, Takahiro
    Matsumura, Yoshihiro
    Hamatsuka, Taichi
    Yoshinaka, Yasuko
    Kimura, Misaka
    Kuzuya, Masafumi
    ARCHIVES OF GERONTOLOGY AND GERIATRICS, 2022, 103
  • [4] Machine learning algorithms to predict depression in older adults in China: a cross-sectional study
    Song, Yan Li Qing
    Chen, Lin
    Liu, Haoqiang
    Liu, Yue
    FRONTIERS IN PUBLIC HEALTH, 2025, 12
  • [5] Memory prediction accuracy in younger and older adults: A cross-sectional and longitudinal analysis
    Woo, Ellen
    Schmitter-Edgecombe, Maureen
    Fancher, Jill B.
    AGING NEUROPSYCHOLOGY AND COGNITION, 2008, 15 (01) : 68 - 94
  • [6] Machine learning-based identification and related features of depression in patients with diabetes mellitus based on the Korea National Health and Nutrition Examination Survey: A cross-sectional study
    Lee, Ji-Yoon
    Won, Doyeon
    Lee, Kiheon
    PLOS ONE, 2023, 18 (07):
  • [7] Machine Learning-Based Prediction of Pancreatic Cancer in Patients With Pancreas Parenchymal or Ductal Abnormality on Cross-Sectional Imaging
    Chen, Q.
    Chen, W.
    Zhou, Y.
    Lustigova, E.
    Wu, B. U.
    PANCREAS, 2021, 50 (07) : 1051 - 1051
  • [8] Prediction of Diabetes Using Data Mining and Machine Learning Algorithms: A Cross-Sectional Study
    Shojaee-Mend, Hassan
    Velayati, Farnia
    Tayefi, Batool
    Babaee, Ebrahim
    HEALTHCARE INFORMATICS RESEARCH, 2024, 30 (01) : 73 - 82
  • [9] Subjective social status, health and well-being among older adults in China and South Korea: a cross-sectional analysis
    Yan, Junwei
    Wang, Yanjie
    Yang, En
    Wang, Jing
    Lv, Benyan
    Cao, Yan
    Tang, Shangfeng
    BMJ OPEN, 2024, 14 (04):
  • [10] Factors affecting social isolation among the young adults in South Korea: A cross-sectional analysis
    Lee, Soo-Bi
    Shin, Yerim
    Jeon, Yebin
    Kim, Seohyun
    FRONTIERS IN PUBLIC HEALTH, 2022, 10