Machine Learning Approaches for Stroke Risk Prediction: Findings from the Suita Study

被引:2
|
作者
Vu, Thien [1 ,2 ,3 ]
Kokubo, Yoshihiro [2 ]
Inoue, Mai [1 ,2 ]
Yamamoto, Masaki [1 ,2 ]
Mohsen, Attayeb [1 ]
Martin-Morales, Agustin [1 ,2 ]
Inoue, Takao [4 ]
Dawadi, Research [1 ,2 ]
Araki, Michihiro [1 ,2 ,5 ,6 ]
机构
[1] Natl Inst Biomed Innovat Hlth & Nutr, Artificial Intelligence Ctr Hlth & Biomed Res, 3-17 Senrioka shinmachi, Settsu 5660002, Japan
[2] Natl Cerebral & Cardiovasc Ctr, 6-1 Kishibe Shinmachi, Suita, Osaka 5648565, Japan
[3] Cho Ray Hosp, Cardiovasc Ctr, Dept Vasc Surg, Ho Chi Minh City 72713, Vietnam
[4] Yamato Univ, Fac Informat, 2-5-1 Katayama, Suita 5640082, Japan
[5] Kyoto Univ, Grad Sch Med, Dept Resp Med, 54 Shogoin Kawahara cho,Sakyo ku, Kyoto 6068507, Japan
[6] Kobe Univ, Grad Sch Sci Technol & Innovat, 1-1 Rokkodai Cho,Nada Ku, Kobe 6578501, Japan
基金
日本科学技术振兴机构;
关键词
stroke; supervised machine learning; unsupervised machine learning; logistic regression; random forest; support vector machine (SVM); extreme gradient boost (XGBoost); light gradient boosted machine (LightGBM); k-prototype clustering; Shapley Additive Explanations (SHAP); JAPANESE URBAN COHORT; CARDIOVASCULAR-DISEASE; HEMOGLOBIN CONCENTRATION; ATRIAL-FIBRILLATION; GLYCATED ALBUMIN; ISCHEMIC-STROKE; BLOOD-PRESSURE; ASSOCIATION; INCIDENT; FRUCTOSAMINE;
D O I
10.3390/jcdd11070207
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Stroke constitutes a significant public health concern due to its impact on mortality and morbidity. This study investigates the utility of machine learning algorithms in predicting stroke and identifying key risk factors using data from the Suita study, comprising 7389 participants and 53 variables. Initially, unsupervised k-prototype clustering categorized participants into risk clusters, while five supervised models including Logistic Regression (LR), Random Forest (RF), Support Vector Machine (SVM), Extreme Gradient Boosting (XGBoost), and Light Gradient Boosted Machine (LightGBM) were employed to predict stroke outcomes. Stroke incidence disparities among identified risk clusters using the unsupervised k-prototype clustering method are substantial, according to the findings. Supervised learning, particularly RF, was a preferable option because of the higher levels of performance metrics. The Shapley Additive Explanations (SHAP) method identified age, systolic blood pressure, hypertension, estimated glomerular filtration rate, metabolic syndrome, and blood glucose level as key predictors of stroke, aligning with findings from the unsupervised clustering approach in high-risk groups. Additionally, previously unidentified risk factors such as elbow joint thickness, fructosamine, hemoglobin, and calcium level demonstrate potential for stroke prediction. In conclusion, machine learning facilitated accurate stroke risk predictions and highlighted potential biomarkers, offering a data-driven framework for risk assessment and biomarker discovery.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Machine Learning Approaches for Pressure Injury Prediction
    Ahmad, Muhammad Aurangzeb
    Larson, Barrett
    Overman, Steve
    Kumar, Vikas
    Xie, Jing
    Rossington, Alan
    Patel, Ankur
    Teredesai, Ankur
    2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021), 2021, : 427 - 431
  • [42] Machine learning approaches for the prediction of materials properties
    Chibani, Siwar
    Coudert, Francois-Xavier
    APL MATERIALS, 2020, 8 (08)
  • [43] Prediction of chemical carcinogenicity by machine learning approaches
    Tan, N. X.
    Rao, H. B.
    Li, Z. R.
    Li, X. Y.
    SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2009, 20 (1-2) : 27 - 75
  • [44] Drug Toxicity Prediction by Machine Learning Approaches
    Shen, Yucong
    Shih, Frank Y.
    Chen, Hao
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (10)
  • [45] Prediction of Antibacterial Compounds by Machine Learning Approaches
    Yang, Xue-Gang
    Chen, Duan
    Wang, Min
    Xue, Ying
    Chen, Yu-Zong
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2009, 30 (08) : 1202 - 1211
  • [46] Improving Stroke Outcome Prediction Using Molecular and Machine Learning Approaches in Large Vessel Occlusion
    Rout, Madhusmita
    Vaughan, April
    Sidorov, Evgeny V.
    Sanghera, Dharambir K.
    JOURNAL OF CLINICAL MEDICINE, 2024, 13 (19)
  • [47] SUMOylation Sites Prediction by Machine Learning Approaches
    Chen, Chi-Wei
    Tu, Chin-Hau
    Chu, Yen-Wei
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,
  • [48] Early Stroke Prediction Using Machine Learning
    Sharma, Chetan
    Sharma, Shamneesh
    Kumar, Mukesh
    Sodhi, Ankur
    2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 890 - 894
  • [49] A Machine Learning Approach in Prediction of Recurrent Stroke
    Park, Moon Ho
    Kwon, Do-Young
    Jung, Jin-Man
    STROKE, 2019, 50
  • [50] Stroke Prediction Using Deep Learning and Transfer Learning Approaches
    Shih, Dong-Her
    Wu, Yi-Huei
    Wu, Ting-Wei
    Chu, Huei-Ying
    Shih, Ming-Hung
    IEEE ACCESS, 2024, 12 : 130091 - 130104