Predicting cardiovascular risk from national administrative databases using a combined survival analysis and deep learning approach

被引:20
|
作者
Barbieri, Sebastiano [1 ]
Mehta, Suneela [2 ]
Wu, Billy [2 ]
Bharat, Chrianna [3 ]
Poppe, Katrina [2 ]
Jorm, Louisa [1 ]
Jackson, Rod [2 ]
机构
[1] Univ New South Wales, Ctr Big Data Res Hlth, Sydney, NSW, Australia
[2] Univ Auckland, Sect Epidemiol & Biostat, Auckland, New Zealand
[3] Univ New South Wales, Natl Drug & Alcohol Res Ctr, Sydney, NSW, Australia
关键词
Cardiovascular diseases; primary prevention; risk assessment; population health; health planning; machine learning; deep learning; survival analysis;
D O I
10.1093/ije/dyab258
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background Machine learning-based risk prediction models may outperform traditional statistical models in large datasets with many variables, by identifying both novel predictors and the complex interactions between them. This study compared deep learning extensions of survival analysis models with Cox proportional hazards models for predicting cardiovascular disease (CVD) risk in national health administrative datasets. Methods Using individual person linkage of administrative datasets, we constructed a cohort of all New Zealanders aged 30-74 who interacted with public health services during 2012. After excluding people with prior CVD, we developed sex-specific deep learning and Cox proportional hazards models to estimate the risk of CVD events within 5 years. Models were compared based on the proportion of explained variance, model calibration and discrimination, and hazard ratios for predictor variables. Results First CVD events occurred in 61 927 of 2 164 872 people. Within the reference group, the largest hazard ratios estimated by the deep learning models were for tobacco use in women (2.04, 95% CI: 1.99, 2.10) and chronic obstructive pulmonary disease with acute lower respiratory infection in men (1.56, 95% CI: 1.50, 1.62). Other identified predictors (e.g. hypertension, chest pain, diabetes) aligned with current knowledge about CVD risk factors. Deep learning outperformed Cox proportional hazards models on the basis of proportion of explained variance (R-2: 0.468 vs 0.425 in women and 0.383 vs 0.348 in men), calibration and discrimination (all P <0.0001). Conclusions Deep learning extensions of survival analysis models can be applied to large health administrative datasets to derive interpretable CVD risk prediction equations that are more accurate than traditional Cox proportional hazards models.
引用
收藏
页码:933 / 944
页数:12
相关论文
共 50 条
  • [21] Predicting the risk of developing diabetic retinopathy using deep learning
    Bora, Ashish
    Balasubramanian, Siva
    Babenko, Boris
    Virmani, Sunny
    Venugopalan, Subhashini
    Mitani, Akinori
    Marinho, Guilherme de Oliveira
    Cuadros, Jorge
    Ruamviboonsuk, Paisan
    Corrado, Greg S.
    Peng, Lily
    Webster, Dale R.
    Varadarajan, Avinash V.
    Hammel, Naama
    Liu, Yun
    Bavishi, Pinal
    LANCET DIGITAL HEALTH, 2021, 3 (01): : E10 - E19
  • [22] Predicting Delay in IoT Using Deep Learning: A Multiparametric Approach
    Ateeq, Muhammad
    Ishmanov, Farruh
    Afzal, Muhammad Khalil
    Naeem, Muhammad
    IEEE ACCESS, 2019, 7 : 62022 - 62031
  • [23] DPWTE: A Deep Learning Approach to Survival Analysis Using a Parsimonious Mixture of Weibull Distributions
    Bennis, Achraf
    Mouysset, Sandrine
    Serrurier, Mathieu
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 185 - 196
  • [24] The Risk Analysis of Digital Inclusive Financial Platform Using Deep Learning Approach
    Shi, Wei
    Long, Si-qi
    Li, Yue
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2024, 40 (04) : 763 - 779
  • [25] Automatic feature extraction in large fusion databases by using deep learning approach
    Farias, Gonzalo
    Dormido-Canto, Sebastian
    Vega, Jesus
    Ratta, Giuseppe
    Vargas, Hector
    Hermosilla, Gabriel
    Alfaro, Luis
    Valencia, Agustin
    FUSION ENGINEERING AND DESIGN, 2016, 112 : 979 - 983
  • [26] PREDICTING DIABETES FROM PHOTOPLETHYSMOGRAPHY USING DEEP LEARNING
    Avram, Robert
    Tison, Geoffrey
    Kuhar, Peter
    Marcus, Gregory
    Pletcher, Mark
    Olgin, Jeffrey E.
    Aschbacher, Kirstin
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2019, 73 (09) : 16 - 16
  • [27] Predicting demographics from meibography using deep learning
    Wang, Jiayun
    Graham, Andrew D.
    Yu, Stella X.
    Lin, Meng C.
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [28] Predicting demographics from meibography using deep learning
    Jiayun Wang
    Andrew D. Graham
    Stella X. Yu
    Meng C. Lin
    Scientific Reports, 12
  • [29] SURVIVAL BENEFITS FROM PNEUMOCOCCAL VACCINE: AN ANALYSIS FROM NATIONAL INPATIENT SAMPLE DATABASES
    Ghosh, Arjab
    Mandania, Roshni
    Ma, Jennifer
    Mena, Miguel
    Dodoo, Christopher
    Dwivedi, Alok
    Mukherjee, Debabrata
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2021, 77 (18) : 1506 - 1506
  • [30] Survival analysis using deep learning with medical imaging
    Morrison, Samantha
    Gatsonis, Constantine
    Eloyan, Ani
    Steingrimsson, Jon Arni
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2024, 20 (01): : 1 - 12