Can machine-learning improve cardiovascular risk prediction using routine clinical data?

被引:689
|
作者
Weng, Stephen F. [1 ,2 ]
Reps, Jenna [3 ,4 ]
Kai, Joe [1 ,2 ]
Garibaldi, Jonathan M. [3 ,4 ]
Qureshi, Nadeem [1 ,2 ]
机构
[1] Univ Nottingham, NIHR Sch Primary Care Res, Nottingham, England
[2] Univ Nottingham, Sch Med, Div Primary Care, Nottingham, England
[3] Univ Nottingham, Adv Data Anal Ctr, Nottingham, England
[4] Univ Nottingham, Sch Comp Sci, Nottingham, England
来源
PLOS ONE | 2017年 / 12卷 / 04期
关键词
CORONARY EVENTS; VALIDATION; MODELS; REGRESSION; DISEASE; MUNSTER; PROFILE; WOMEN; MEN;
D O I
10.1371/journal.pone.0174944
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Current approaches to predict cardiovascular risk fail to identify many people who would benefit from preventive treatment, while others receive unnecessary intervention. Machinelearning offers opportunity to improve accuracy by exploiting complex interactions between risk factors. We assessed whether machine-learning can improve cardiovascular risk prediction. Methods Prospective cohort study using routine clinical data of 378,256 patients from UK family practices, free from cardiovascular disease at outset. Four machine-learning algorithms (random forest, logistic regression, gradient boosting machines, neural networks) were compared to an established algorithm (American College of Cardiology guidelines) to predict first cardiovascular event over 10-years. Predictive accuracy was assessed by area under the 'receiver operating curve' (AUC); and sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) to predict 7.5% cardiovascular risk (threshold for initiating statins). Findings 24,970 incident cardiovascular events (6.6%) occurred. Compared to the established risk prediction algorithm (AUC 0.728, 95% CI 0.723-0.735), machine-learning algorithms improved prediction: random forest + 1.7% (AUC 0.745, 95% CI 0.739-0.750), logistic regression + 3.2% (AUC 0.760, 95% CI 0.755-0.766), gradient boosting + 3.3% (AUC 0.761, 95% CI 0.755-0.766), neural networks + 3.6% (AUC 0.764, 95% CI 0.759-0.769). The highest achieving (neural networks) algorithm predicted 4,998/7,404 cases (sensitivity 67.5%, PPV 18.4%) and 53,458/75,585 non-cases (specificity 70.7%, NPV 95.7%), correctly predicting 355 (+ 7.6%) more patients who developed cardiovascular disease compared to the established algorithm. Conclusions Machine-learning significantly improves accuracy of cardiovascular risk prediction, increasing the number of patients identified who could benefit from preventive treatment, while avoiding unnecessary treatment of others.
引用
收藏
页数:14
相关论文
共 50 条
  • [11] Can machine learning improve prediction - an application with farm survey data
    Iffy, Jennifer
    Kuhns, Ryan
    Patrick, Kevin
    INTERNATIONAL FOOD AND AGRIBUSINESS MANAGEMENT REVIEW, 2018, 21 (08): : 1083 - 1098
  • [12] Cardiovascular disease risk prediction via machine learning using mental health data
    Dorraki, M.
    Liao, Z.
    Abbott, D.
    Psaltis, P. J.
    Baker, E.
    Bidargaddi, N.
    Van Den Hengel, A.
    Narula, J.
    Verjans, J. W.
    EUROPEAN HEART JOURNAL, 2022, 43 : 2784 - 2784
  • [13] A machine-learning approach for disease prediction can reduce misclassification and improve GWAS and polygenic scores
    Eick, Lisa
    Cordioli, Mattia
    Jukarainen, Sakari
    Ganna, Andrea
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2023, 31 : 52 - 52
  • [14] Importance of GWAS Risk Loci and Clinical Data in Predicting Asthma Using Machine-learning Approaches
    Qin, Zan-Mei
    Liang, Si-Qiao
    Long, Jian-Xiong
    Deng, Jing-Min
    Wei, Xuan
    Yang, Mei-Ling
    Tang, Shao-Jie
    Li, Hai-Li
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2024, 27 (03) : 400 - 407
  • [15] Osteoporosis Risk Prediction by Machine Learning Approach Using Clinical and Metabolomic Data
    Qiu, Chuan
    Zhang, Xiao
    Liu, Anqi
    Zhe, Luo
    Tian, Qing
    Shen, Hui
    Deng, Hong-Wen
    Su, Kuan-Jui
    Gong, Yun
    JOURNAL OF BONE AND MINERAL RESEARCH, 2023, 38 : 175 - 175
  • [16] Deep and machine learning models to improve risk prediction of cardiovascular disease using data extraction from electronic health records
    Korsakov, I.
    Gusev, A.
    Kuznetsova, T.
    Gavrilov, D.
    Novitskiy, R.
    EUROPEAN HEART JOURNAL, 2019, 40 : 1213 - 1213
  • [17] A machine-learning prediction model to identify risk of firearm injury using electronic health records data
    Zhou, Hui
    Nau, Claudia
    Xie, Fagen
    Contreras, Richard
    Grant, Deborah Ling
    Negriff, Sonya
    Sidell, Margo
    Koebnick, Corinna
    Hechter, Rulin
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (10) : 2173 - 2180
  • [18] Groundwater Prediction Using Machine-Learning Tools
    Hussein, Eslam A.
    Thron, Christopher
    Ghaziasgar, Mehrdad
    Bagula, Antoine
    Vaccari, Mattia
    ALGORITHMS, 2020, 13 (11)
  • [19] Developing a Machine Learning Algorithm Using Routine Clinical Data for the Detection of Hard Cardiovascular Disease Events
    Kelly, Ian
    Whitehill, Jacob R.
    Goldberg, Robert
    CIRCULATION, 2023, 148
  • [20] Risk prediction of cardiovascular disease using machine learning classifiers
    Pal, Madhumita
    Parija, Smita
    Panda, Ganapati
    Dhama, Kuldeep
    Mohapatra, Ranjan K.
    OPEN MEDICINE, 2022, 17 (01): : 1100 - 1113