Predicting cardiovascular risk from national administrative databases using a combined survival analysis and deep learning approach

被引:20
|
作者
Barbieri, Sebastiano [1 ]
Mehta, Suneela [2 ]
Wu, Billy [2 ]
Bharat, Chrianna [3 ]
Poppe, Katrina [2 ]
Jorm, Louisa [1 ]
Jackson, Rod [2 ]
机构
[1] Univ New South Wales, Ctr Big Data Res Hlth, Sydney, NSW, Australia
[2] Univ Auckland, Sect Epidemiol & Biostat, Auckland, New Zealand
[3] Univ New South Wales, Natl Drug & Alcohol Res Ctr, Sydney, NSW, Australia
关键词
Cardiovascular diseases; primary prevention; risk assessment; population health; health planning; machine learning; deep learning; survival analysis;
D O I
10.1093/ije/dyab258
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background Machine learning-based risk prediction models may outperform traditional statistical models in large datasets with many variables, by identifying both novel predictors and the complex interactions between them. This study compared deep learning extensions of survival analysis models with Cox proportional hazards models for predicting cardiovascular disease (CVD) risk in national health administrative datasets. Methods Using individual person linkage of administrative datasets, we constructed a cohort of all New Zealanders aged 30-74 who interacted with public health services during 2012. After excluding people with prior CVD, we developed sex-specific deep learning and Cox proportional hazards models to estimate the risk of CVD events within 5 years. Models were compared based on the proportion of explained variance, model calibration and discrimination, and hazard ratios for predictor variables. Results First CVD events occurred in 61 927 of 2 164 872 people. Within the reference group, the largest hazard ratios estimated by the deep learning models were for tobacco use in women (2.04, 95% CI: 1.99, 2.10) and chronic obstructive pulmonary disease with acute lower respiratory infection in men (1.56, 95% CI: 1.50, 1.62). Other identified predictors (e.g. hypertension, chest pain, diabetes) aligned with current knowledge about CVD risk factors. Deep learning outperformed Cox proportional hazards models on the basis of proportion of explained variance (R-2: 0.468 vs 0.425 in women and 0.383 vs 0.348 in men), calibration and discrimination (all P <0.0001). Conclusions Deep learning extensions of survival analysis models can be applied to large health administrative datasets to derive interpretable CVD risk prediction equations that are more accurate than traditional Cox proportional hazards models.
引用
收藏
页码:933 / 944
页数:12
相关论文
共 50 条
  • [31] DEEP LEARNING MODELS FOR PREDICTING THE RISK OF CARDIOVASCULAR INCIDENTS BASED ON THE WISCONSIN LONGITUDINAL STUDY
    Kvan, Valerii
    Artomov, Mykyta
    JOURNAL OF HYPERTENSION, 2024, 42
  • [32] DeepHit: A Deep Learning Approach to Survival Analysis with Competing Risks
    Lee, Changhee
    Zame, William R.
    Yoon, Jinsung
    van der Schaar, Mihaela
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2314 - 2321
  • [33] Deep Learning for Risk Analysis of Specific Cardiovascular Diseases Using Environmental Data and Outpatient Records
    Hsiao, Han C. W.
    Chen, Sean H. F.
    Tsai, Jeffrey J. P.
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2016, : 369 - 372
  • [34] DEEP LEARNING ANALYSIS USING CARDIOVASCULAR MAGNETIC RESONANCE IMAGING FOR RISK PREDICTION IN CARDIAC AMYLOIDOSIS
    Cockrum, Joshua
    Chen, David
    Nakashima, Makiya
    Mauch, Joseph
    Hanna, Mazen A.
    Kanj, Kaia
    Ramesh, Basnet
    Benovoy, Mitchel
    Kapadia, Samir R.
    Svensson, Lars G.
    Griffin, Brian P.
    Flamm, Scott D.
    Grimm, Richard A.
    Tang, Wai Hong Wilson
    Kwon, Deborah
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2022, 79 (09) : 1193 - 1193
  • [35] Predicting survival from colorectal cancer histology slides using deep learning: A retrospective multicenter study
    Kather, Jakob Nikolas
    Krisam, Johannes
    Charoentong, Pornpimol
    Luedde, Tom
    Herpel, Esther
    Weis, Cleo-Aron
    Gaiser, Timo
    Marx, Alexander
    Valous, Nektarios A.
    Ferber, Dyke
    Jansen, Lina
    Reyes-Aldasoro, Constantino Carlos
    Zoernig, Inka
    Jaeger, Dirk
    Brenner, Hermann
    Chang-Claude, Jenny
    Hoffmeister, Michael
    Halama, Niels
    PLOS MEDICINE, 2019, 16 (01)
  • [36] Identification of combined biomarkers for predicting the risk of osteoporosis using machine learning
    Zheng, Zhenlong
    Zhang, Xianglan
    Oh, Bong-Kyeong
    Kim, Ki-Yeol
    AGING-US, 2022, 14 (10): : 4270 - 4280
  • [37] Cervical Cancer Classification Using Combined Machine Learning and Deep Learning Approach
    Alquran, Hiam
    Mustafa, Wan Azani
    Abu Qasmieh, Isam
    Yacob, Yasmeen Mohd
    Alsalatie, Mohammed
    Al-Issa, Yazan
    Alqudah, Ali Mohammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (03): : 5117 - 5134
  • [38] Predicting systemic risk in financial systems using Deep Graph Learning
    Balmaseda, Vicente
    Coronado, Maria
    de Cadenas-Santiago, Gonzalo
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 19
  • [39] Predicting Energy Consumption Data Using Deep Learning: An LSTM Approach
    Chaves, Julio Macedo
    Ohata, Elene Firmeza
    dos Santos, Matheus Araujo
    Santos, Jose Daniel de Alencar
    Bernardes, Matheus Jardim
    Dora, Daniel Seleme
    Filho, Pedro Pedrosa Reboucas
    INTELLIGENT SYSTEMS, BRACIS 2024, PT II, 2025, 15413 : 296 - 308
  • [40] A deep learning approach for predicting critical events using event logs
    Huang, Congfang
    Deep, Akash
    Zhou, Shiyu
    Veeramani, Dharmaraj
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2021, 37 (05) : 2214 - 2234