AI Machine Learning-Based Diabetes Prediction in Older Adults in South Korea: Cross-Sectional Analysis

被引:0
|
作者
Lee, Hocheol [1 ]
Park, Myung-Bae [1 ]
Won, Young-Joo [1 ]
机构
[1] Yonsei Univ, Coll Software & Digital Healthcare Convergence, Dept Hlth Adm, Yonseidae Gil 1, Wonju 26493, South Korea
基金
新加坡国家研究基金会;
关键词
diabetes; prediction model; super-aging population; extreme gradient boosting model; geriatrics; older adults; aging; artificial intelligence; machine learning; HEALTH; HYPERTENSION; OBESITY;
D O I
10.2196/57874
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Diabetes is prevalent in older adults, and machine learning algorithms could help predict diabetes in this population. Objective: This study determined diabetes risk factors among older adults aged >= 60 years using machine learning algorithms and selected an optimized prediction model. Methods: This cross-sectional study was conducted on 3084 older adults aged >= 60 years in Seoul from January to November 2023. Data were collected using a mobile app (Gosufit) that measured depression, stress, anxiety, basal metabolic rate, oxygen saturation, heart rate, and average daily step count. Health coordinators recorded data on diabetes, hypertension, hyperlipidemia, chronic obstructive pulmonary disease, percent body fat, and percent muscle. The presence of diabetes was the target variable, with various health indicators as predictors. Machine learning algorithms, including random forest, gradient boosting model, light gradient boosting model, extreme gradient boosting model, and k-nearest neighbors, were employed for analysis. The dataset was split into 70% training and 30% testing sets. Model performance was evaluated using accuracy, precision, recall, F1 score, and area under the curve (AUC). Shapley additive explanations (SHAPs) were used for model interpretability. Results: Significant predictors of diabetes included hypertension (chi(2)1=197.294; P<.001), hyperlipidemia (chi(2)1=47.671; P<.001), age (mean: diabetes group 72.66 years vs nondiabetes group 71.81 years), stress (mean: diabetes group 42.68 vs nondiabetes group 41.47; t3082=-2.858; P=.004), and heart rate (mean: diabetes group 75.05 beats/min vs nondiabetes group 73.14 beats/min; t3082=-7.948; P<.001). The extreme gradient boosting model (XGBM) demonstrated the best performance, with an accuracy of 84.88%, precision of 77.92%, recall of 66.91%, F1 score of 72.00, and AUC of 0.7957. The SHAP analysis of the top-performing XGBM revealed key predictors for diabetes: hypertension, age, percent body fat, heart rate, hyperlipidemia, basal metabolic rate, stress, and oxygen saturation. Hypertension strongly increased diabetes risk, while advanced age and elevated stress levels also showed significant associations. Hyperlipidemia and higher heart rates further heightened diabetes probability. These results highlight the importance and directional impact of specific features in predicting diabetes, providing valuable insights for risk stratification and targeted interventions. Conclusions: This study focused on modifiable risk factors, providing crucial data for establishing a system for the automated collection of health information and lifelog data from older adults using digital devices at service facilities.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Identifying myoglobin as a mediator of diabetic kidney disease: a machine learning-based cross-sectional study
    Wu, Ruoru
    Shu, Zhihao
    Zou, Fei
    Zhao, Shaoli
    Chan, Saolai
    Hu, Yaxian
    Xiang, Hong
    Chen, Shuhua
    Fu, Li
    Cao, Dongsheng
    Lu, Hongwei
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [22] Machine Learning-based Predictive Model for Cross-sectional Properties of Pultruded Hybrid Composite Structure
    Hong, Chaeyoung
    Seo, Kyeong-Bae
    Ji, Wooseok
    COMPOSITES RESEARCH, 2025, 38 (01): : 50 - 55
  • [23] Machine learning-based future performance prediction model for bridge inspection and performance data in South Korea
    Choi, Yangrok
    Bae, Youngjae
    Kwon, Kyungrok
    Choi, Youngjin
    Sun, Jongwan
    Kong, Jung Sik
    ADVANCES IN STRUCTURAL ENGINEERING, 2025,
  • [24] Health care costs of cardiovascular disease in China: a machine learning-based cross-sectional study
    Lu, Mengjie
    Gao, Hong
    Shi, Chenshu
    Xiao, Yuyin
    Li, Xiyang
    Li, Lihua
    Li, Yan
    Li, Guohong
    FRONTIERS IN PUBLIC HEALTH, 2023, 11
  • [25] Identifying myoglobin as a mediator of diabetic kidney disease: a machine learning-based cross-sectional study
    Ruoru Wu
    Zhihao Shu
    Fei Zou
    Shaoli Zhao
    Saolai Chan
    Yaxian Hu
    Hong Xiang
    Shuhua Chen
    Li Fu
    Dongsheng Cao
    Hongwei Lu
    Scientific Reports, 12
  • [26] VALIDITY OF PREDICTION BASED ON CROSS-SECTIONAL ANALYSIS
    DELAMARE, G
    SERGEAN, R
    NATURE, 1961, 192 (480) : 1318 - &
  • [27] Health disparity and healthcare utilization inequity among older adults living in poverty in South Korea: a cross-sectional study
    Kim, Ah-Young
    Seo, Moon Sil
    Kang, Hye-Young
    BMC GERIATRICS, 2022, 22 (01)
  • [28] Health disparity and healthcare utilization inequity among older adults living in poverty in South Korea: a cross-sectional study
    Ah-Young Kim
    Moon Sil Seo
    Hye-Young Kang
    BMC Geriatrics, 22
  • [29] A machine learning-based diabetes risk prediction modeling study
    Ming, Jiexiu
    Xu, Junyi
    Zhang, Miaomiao
    Li, Ningyu
    Yan, Xu
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON COMPUTER AND MULTIMEDIA TECHNOLOGY, ICCMT 2024, 2024, : 363 - 369
  • [30] FUNCTIONAL STATUS IN OLDER ADULTS WITH PERIPHERAL ARTERY DISEASE IN KOREA: A CROSS-SECTIONAL STUDY
    Kim, Yesol
    Kim, Mihui
    Ryu, Gi Wook
    Choi, Mona
    INNOVATION IN AGING, 2021, 5 : 970 - 970