Can machine-learning improve cardiovascular risk prediction using routine clinical data?

被引:689
|
作者
Weng, Stephen F. [1 ,2 ]
Reps, Jenna [3 ,4 ]
Kai, Joe [1 ,2 ]
Garibaldi, Jonathan M. [3 ,4 ]
Qureshi, Nadeem [1 ,2 ]
机构
[1] Univ Nottingham, NIHR Sch Primary Care Res, Nottingham, England
[2] Univ Nottingham, Sch Med, Div Primary Care, Nottingham, England
[3] Univ Nottingham, Adv Data Anal Ctr, Nottingham, England
[4] Univ Nottingham, Sch Comp Sci, Nottingham, England
来源
PLOS ONE | 2017年 / 12卷 / 04期
关键词
CORONARY EVENTS; VALIDATION; MODELS; REGRESSION; DISEASE; MUNSTER; PROFILE; WOMEN; MEN;
D O I
10.1371/journal.pone.0174944
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Current approaches to predict cardiovascular risk fail to identify many people who would benefit from preventive treatment, while others receive unnecessary intervention. Machinelearning offers opportunity to improve accuracy by exploiting complex interactions between risk factors. We assessed whether machine-learning can improve cardiovascular risk prediction. Methods Prospective cohort study using routine clinical data of 378,256 patients from UK family practices, free from cardiovascular disease at outset. Four machine-learning algorithms (random forest, logistic regression, gradient boosting machines, neural networks) were compared to an established algorithm (American College of Cardiology guidelines) to predict first cardiovascular event over 10-years. Predictive accuracy was assessed by area under the 'receiver operating curve' (AUC); and sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) to predict 7.5% cardiovascular risk (threshold for initiating statins). Findings 24,970 incident cardiovascular events (6.6%) occurred. Compared to the established risk prediction algorithm (AUC 0.728, 95% CI 0.723-0.735), machine-learning algorithms improved prediction: random forest + 1.7% (AUC 0.745, 95% CI 0.739-0.750), logistic regression + 3.2% (AUC 0.760, 95% CI 0.755-0.766), gradient boosting + 3.3% (AUC 0.761, 95% CI 0.755-0.766), neural networks + 3.6% (AUC 0.764, 95% CI 0.759-0.769). The highest achieving (neural networks) algorithm predicted 4,998/7,404 cases (sensitivity 67.5%, PPV 18.4%) and 53,458/75,585 non-cases (specificity 70.7%, NPV 95.7%), correctly predicting 355 (+ 7.6%) more patients who developed cardiovascular disease compared to the established algorithm. Conclusions Machine-learning significantly improves accuracy of cardiovascular risk prediction, increasing the number of patients identified who could benefit from preventive treatment, while avoiding unnecessary treatment of others.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] MACHINE LEARNING AND COMPUTER VISION OF BONE MICROARCHITECTURE CAN IMPROVE THE FRACTURE RISK PREDICTION PROVIDED BY DXA AND CLINICAL RISK FACTORS
    Fuggle, N.
    Lu, S.
    Breasail, M. O.
    Westbury, L.
    Ward, K.
    Dennison, E.
    Mahmoodi, S.
    Niranjan, M.
    Cooper, C.
    AGING CLINICAL AND EXPERIMENTAL RESEARCH, 2022, 34 (SUPPL 1) : S43 - S43
  • [32] MACHINE LEARNING AND COMPUTER VISION OF BONE MICROARCHITECTURE CAN IMPROVE THE FRACTURE RISK PREDICTION PROVIDED BY DXA AND CLINICAL RISK FACTORS
    Fuggle, Nicholas R.
    Lu, Shengyu
    Breasail, Micheal O.
    Westbury, Leo D.
    Ward, Kate A.
    Dennison, Elaine
    Mahmoodi, Sasan
    Niranjan, Mahesan
    Cooper, Cyrus
    RHEUMATOLOGY, 2022, 61 : I12 - I12
  • [33] CARDIOVASCULAR RISK PREDICTION APPLYING MACHINE LEARNING
    Castel, S.
    Maldonado, L.
    Aguilar, I.
    Malo, S.
    Rabanaque, M. J.
    GACETA SANITARIA, 2023, 37 : S204 - S204
  • [34] Prediction of Personal Cardiovascular Risk using Machine Learning for Smartphone Applications
    Seto, Edmund
    Gravina, Raffaele
    Kim, Jenna
    Lin, Shuhao
    Ferrara, Giannina
    Hua, Jenna
    PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL CONFERENCE ON HUMAN-MACHINE SYSTEMS (ICHMS), 2020, : 405 - 410
  • [35] Cardiovascular Risk Prediction Using Machine Learning In A Large Japanese Cohort
    Matheson, Matthew B.
    Kato, Yoko
    Baba, Shinichi
    Cox, Christopher
    Lima, Joao A.
    Venkatesh, Bharath Ambale
    CIRCULATION, 2021, 143
  • [36] Cardiovascular Risk Prediction Using Machine Learning in a Large Japanese Cohort
    Matheson, Matthew B.
    Kato, Yoko
    Baba, Shinichi
    Cox, Christopher
    Lima, Joao A. C.
    Ambale-Venkatesh, Bharath
    CIRCULATION REPORTS, 2022, 4 (12) : 595 - 603
  • [37] Multimodal Learning for Cardiovascular Risk Prediction using EHR Data
    Bagheri, Ayoub
    Groenhof, T. Katrien J.
    Veldhuis, Wouter B.
    de Jong, Pim A.
    Asselbergs, Folkert W.
    Oberski, Daniel L.
    ACM-BCB 2020 - 11TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2020,
  • [38] A Two-Country Study of Default Risk Prediction Using Bayesian Machine-Learning
    Incerti, Fabio
    Bargagli-Stoffi, Falco J.
    Riccaboni, Massimo
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2022, PT II, 2023, 13811 : 188 - 192
  • [39] Longitudinal clinical data improve survival prediction after hematopoietic cell transplantation using machine learning
    Zhou, Yiwang
    Smith, Jesse
    Keerthi, Dinesh
    Li, Cai
    Sun, Yilun
    Mothi, Suraj Sarvode
    Shyr, David C.
    Spitzer, Barbara
    Harris, Andrew
    Chatterjee, Avijit
    Chatterjee, Subrata
    Shouval, Roni
    Naik, Swati
    Bertaina, Alice
    Boelens, Jaap Jan
    Triplett, Brandon M.
    Tang, Li
    Sharma, Akshay
    BLOOD ADVANCES, 2024, 8 (03) : 686 - 698
  • [40] Advanced Machine-Learning Technologies for Coronary Artery Disease Prediction Using Heterogeneous Data
    Alqulaity, Malak
    Yang, Po
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 54 - 61