Machine learning and atherosclerotic cardiovascular disease risk prediction in a multi-ethnic population

被引:0
|
作者
Andrew Ward
Ashish Sarraju
Sukyung Chung
Jiang Li
Robert Harrington
Paul Heidenreich
Latha Palaniappan
David Scheinker
Fatima Rodriguez
机构
[1] Stanford University,Department of Electrical Engineering
[2] Stanford University School of Medicine,Division of Cardiovascular Medicine
[3] Palo Alto Medical Foundation Research Institute,Division of Primary Care and Population Health
[4] Stanford University School of Medicine,Department of Management Science and Engineering
[5] Stanford University School of Engineering,Division of Pediatric Endocrinology and Diabetes
[6] Stanford University School of Medicine,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The pooled cohort equations (PCE) predict atherosclerotic cardiovascular disease (ASCVD) risk in patients with characteristics within prespecified ranges and has uncertain performance among Asians or Hispanics. It is unknown if machine learning (ML) models can improve ASCVD risk prediction across broader diverse, real-world populations. We developed ML models for ASCVD risk prediction for multi-ethnic patients using an electronic health record (EHR) database from Northern California. Our cohort included patients aged 18 years or older with no prior CVD and not on statins at baseline (n = 262,923), stratified by PCE-eligible (n = 131,721) or PCE-ineligible patients based on missing or out-of-range variables. We trained ML models [logistic regression with L2 penalty and L1 lasso penalty, random forest, gradient boosting machine (GBM), extreme gradient boosting] and determined 5-year ASCVD risk prediction, including with and without incorporation of additional EHR variables, and in Asian and Hispanic subgroups. A total of 4309 patients had ASCVD events, with 2077 in PCE-ineligible patients. GBM performance in the full cohort, including PCE-ineligible patients (area under receiver-operating characteristic curve (AUC) 0.835, 95% confidence interval (CI): 0.825–0.846), was significantly better than that of the PCE in the PCE-eligible cohort (AUC 0.775, 95% CI: 0.755–0.794). Among patients aged 40–79, GBM performed similarly before (AUC 0.784, 95% CI: 0.759–0.808) and after (AUC 0.790, 95% CI: 0.765–0.814) incorporating additional EHR data. Overall, ML models achieved comparable or improved performance compared to the PCE while allowing risk discrimination in a larger group of patients including PCE-ineligible patients. EHR-trained ML models may help bridge important gaps in ASCVD risk prediction.
引用
收藏
相关论文
共 50 条
  • [21] Carotid artery displacement and cardiovascular disease risk in the Multi-Ethnic Study of Atherosclerosis
    Gepner, Adam D.
    McClelland, Robyn L.
    Korcarz, Claudia E.
    Young, Rebekah
    Kaufman, Joel D.
    Mitchell, Carol C.
    Stein, James H.
    VASCULAR MEDICINE, 2019, 24 (05) : 405 - 413
  • [22] Body composition and risk factors for cardiovascular disease in global multi-ethnic populations
    Jennifer L. Carter
    Noraidatulakma Abdullah
    Fiona Bragg
    Nor Azian Abdul Murad
    Hannah Taylor
    Chin Siok Fong
    Benjamin Lacey
    Paul Sherliker
    Fredrik Karpe
    Norlaila Mustafa
    Sarah Lewington
    Rahman Jamal
    International Journal of Obesity, 2023, 47 : 855 - 864
  • [23] Diabetes mellitus abolishes ethnic differences in cardiovascular risk factors: lessons from a multi-ethnic population
    Tan, CE
    Emmanuel, SC
    Tan, BY
    Tai, ES
    Chew, SK
    ATHEROSCLEROSIS, 2001, 155 (01) : 179 - 186
  • [24] A Multi-ethnic Mendelian Randomization Study of Moderate Alcohol Use and the Risk of Atherosclerotic Cardiovascular Disease in the Women's Health Initiative
    Li, Jin
    Salfati, Elias
    Patel, Chirag
    Eaton, Charles
    Nassir, Rami
    Stefanick, Marcia
    Reiner, Alexander P.
    Assimes, Themistocles L.
    CIRCULATION, 2016, 133
  • [25] The Association Between Coronary Artery Calcium and Atherosclerotic Cardiovascular Disease Risk in Women With Early Menopause: The Multi-Ethnic Study of Atherosclerosis
    Chu, Jian
    Michos, Erin D.
    Ouyang, Pamela
    Vaidya, Dhananjay
    Blumenthal, Roger S.
    Budoff, Matthew J.
    Blaha, Michael J.
    Whelton, Seamus P.
    CIRCULATION, 2020, 142
  • [26] A Machine Learning Approach for Risk Prediction of Cardiovascular Disease
    Panda, Shovna
    Palei, Shantilata
    Samartha, Mullapudi Venkata Sai
    Jena, Biswajit
    Saxena, Sanjay
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT II, 2024, 2010 : 313 - 323
  • [27] Self-Rated Health, Coronary Artery Calcium Scores, and Atherosclerotic Cardiovascular Disease Risk: The Multi-Ethnic Study of Atherosclerosis
    Orimoloye, Olusola A.
    Mirbolouk, Mohammadhassan
    Uddin, S. M. Iftekhar
    Dardari, Zeina
    Miedema, Michael D.
    Al-Mallah, Mouaz H.
    Yeboah, Joseph
    Blankstein, Ron
    Nasir, Khurram
    Blaha, Michael J.
    CIRCULATION, 2018, 138
  • [28] PROGRESSION OF CORONARY ARTERY CALCIUM AND LONG-TERM RISK FOR ATHEROSCLEROTIC CARDIOVASCULAR DISEASE EVENTS: THE MULTI-ETHNIC STUDY OF ATHEROSCLEROSIS
    Wong, Nathan D.
    Wu, Chuyue
    Fan, Wenjun
    Blaha, Michael J.
    Blumenthal, Roger S.
    Michos, Erin D.
    Yeboah, Joseph
    Budoff, Matthew J.
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2023, 81 (08) : 1649 - 1649
  • [29] MACHINE LEARNING IN PREDICTING CORONARY HEART DISEASE AND CARDIOVASCULAR DISEASE EVENTS: RESULTS FROM THE MULTI-ETHNIC STUDY OF ATHEROSCLEROSIS (MESA)
    Nakanishi, Rine
    Dey, Damini
    Commandeur, Frederic
    Slomka, Piotr
    Betancur, Julian
    Gransar, Heidi
    Dailing, Christopher
    Osawa, Kazuhiro
    Berman, Daniel
    Budoff, Matthew
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2018, 71 (11) : 1483 - 1483
  • [30] Prediction of Recurrent Atherosclerotic Cardiovascular Disease Risk Using Machine Learning and Electronic Health Record Data
    Sarraju, Ashish
    Ward, Andrew
    Chung, Sukyung
    Li, Jiang
    Scheinker, David
    Rodriguez, Fatima
    CIRCULATION, 2020, 142