Can machine-learning improve cardiovascular risk prediction using routine clinical data?

被引:689
|
作者
Weng, Stephen F. [1 ,2 ]
Reps, Jenna [3 ,4 ]
Kai, Joe [1 ,2 ]
Garibaldi, Jonathan M. [3 ,4 ]
Qureshi, Nadeem [1 ,2 ]
机构
[1] Univ Nottingham, NIHR Sch Primary Care Res, Nottingham, England
[2] Univ Nottingham, Sch Med, Div Primary Care, Nottingham, England
[3] Univ Nottingham, Adv Data Anal Ctr, Nottingham, England
[4] Univ Nottingham, Sch Comp Sci, Nottingham, England
来源
PLOS ONE | 2017年 / 12卷 / 04期
关键词
CORONARY EVENTS; VALIDATION; MODELS; REGRESSION; DISEASE; MUNSTER; PROFILE; WOMEN; MEN;
D O I
10.1371/journal.pone.0174944
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Current approaches to predict cardiovascular risk fail to identify many people who would benefit from preventive treatment, while others receive unnecessary intervention. Machinelearning offers opportunity to improve accuracy by exploiting complex interactions between risk factors. We assessed whether machine-learning can improve cardiovascular risk prediction. Methods Prospective cohort study using routine clinical data of 378,256 patients from UK family practices, free from cardiovascular disease at outset. Four machine-learning algorithms (random forest, logistic regression, gradient boosting machines, neural networks) were compared to an established algorithm (American College of Cardiology guidelines) to predict first cardiovascular event over 10-years. Predictive accuracy was assessed by area under the 'receiver operating curve' (AUC); and sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) to predict 7.5% cardiovascular risk (threshold for initiating statins). Findings 24,970 incident cardiovascular events (6.6%) occurred. Compared to the established risk prediction algorithm (AUC 0.728, 95% CI 0.723-0.735), machine-learning algorithms improved prediction: random forest + 1.7% (AUC 0.745, 95% CI 0.739-0.750), logistic regression + 3.2% (AUC 0.760, 95% CI 0.755-0.766), gradient boosting + 3.3% (AUC 0.761, 95% CI 0.755-0.766), neural networks + 3.6% (AUC 0.764, 95% CI 0.759-0.769). The highest achieving (neural networks) algorithm predicted 4,998/7,404 cases (sensitivity 67.5%, PPV 18.4%) and 53,458/75,585 non-cases (specificity 70.7%, NPV 95.7%), correctly predicting 355 (+ 7.6%) more patients who developed cardiovascular disease compared to the established algorithm. Conclusions Machine-learning significantly improves accuracy of cardiovascular risk prediction, increasing the number of patients identified who could benefit from preventive treatment, while avoiding unnecessary treatment of others.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Inpatient stroke rehabilitation: prediction of clinical outcomes using a machine-learning approach
    Harari, Yaar
    O'Brien, Megan K.
    Lieber, Richard L.
    Jayaraman, Arun
    JOURNAL OF NEUROENGINEERING AND REHABILITATION, 2020, 17 (01)
  • [22] Inpatient stroke rehabilitation: prediction of clinical outcomes using a machine-learning approach
    Yaar Harari
    Megan K. O’Brien
    Richard L. Lieber
    Arun Jayaraman
    Journal of NeuroEngineering and Rehabilitation, 17
  • [23] Prediction of Atherosclerotic Cardiovascular Disease Risk Using Machine Learning and Electronic Health Record Data
    Ward, Andrew
    Sarraju, Ashish
    Chung, Sukyung
    Palaniappan, Latha
    Scheinker, David
    Rodriguez, Fatima
    CIRCULATION, 2019, 140
  • [24] Improving Mortality Risk Prediction with Routine Clinical Data: A Practical Machine Learning Model Based on eICU Patients
    Zhao, Shangping
    Tang, Guanxiu
    Liu, Pan
    Wang, Qingyong
    Li, Guohui
    Ding, Zhaoyun
    INTERNATIONAL JOURNAL OF GENERAL MEDICINE, 2023, 16 : 3151 - 3161
  • [25] Identification of important factors in an inpatient fall risk prediction model to improve the quality of care using EHR and electronic administrative data: A machine-learning approach
    Lindberg, David S.
    Prosperi, Mattia
    Bjarnadottir, Ragnhildur I.
    Thomas, Jaime
    Crane, Marsha
    Chen, Zhaoyi
    Shear, Kristen
    Solberg, Laurence M.
    Snigurska, Urszula Alina
    Wu, Yonghui
    Xia, Yunpeng
    Lucero, Robert J.
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2020, 143
  • [26] Early prediction of clinical deterioration using data-driven machine-learning modeling of electronic health records
    Ruiz, Victor M.
    Goldsmith, Michael P.
    Shi, Lingyun
    Simpao, Allan F.
    Galvez, Jorge A.
    Naim, Maryam Y.
    Nadkarni, Vinay
    Gaynor, J. William
    Tsui, Fuchiang
    JOURNAL OF THORACIC AND CARDIOVASCULAR SURGERY, 2022, 164 (01): : 211 - +
  • [27] Clinical-learning versus machine-learning for transdiagnostic prediction of psychosis onset in individuals at-risk
    Fusar-Poli, Paolo
    Stringer, Dominic
    Durieux, Alice M. S.
    Rutigliano, Grazia
    Bonoldi, Ilaria
    De Micheli, Andrea
    Stahl, Daniel
    TRANSLATIONAL PSYCHIATRY, 2019, 9 (1)
  • [28] A machine-learning heuristic to improve gene score prediction of polygenic traits
    Pare, Guillaume
    Mao, Shihong
    Deng, Wei Q.
    SCIENTIFIC REPORTS, 2017, 7
  • [29] Clinical-learning versus machine-learning for transdiagnostic prediction of psychosis onset in individuals at-risk
    Paolo Fusar-Poli
    Dominic Stringer
    Alice M. S. Durieux
    Grazia Rutigliano
    Ilaria Bonoldi
    Andrea De Micheli
    Daniel Stahl
    Translational Psychiatry, 9
  • [30] A machine-learning heuristic to improve gene score prediction of polygenic traits
    Guillaume Paré
    Shihong Mao
    Wei Q. Deng
    Scientific Reports, 7