Machine-learning model to predict the cause of death using a stacking ensemble method for observational data

被引:27
|
作者
Kim, Chungsoo [1 ]
You, Seng Chan [2 ]
Reps, Jenna M. [3 ]
Cheong, Jae Youn [4 ]
Park, Rae Woong [1 ,2 ]
机构
[1] Ajou Univ, Dept Biomed Sci, Grad Sch Med, Suwon, Gyeonggi Do, South Korea
[2] Ajou Univ, Dept Biomed Informat, Sch Med, Suwon, Gyeonggi Do, South Korea
[3] Janssen Res & Dev, Titusville, NJ USA
[4] Ajou Univ, Dept Gastroenterol, Sch Med, Suwon, Gyeonggi Do, South Korea
关键词
cause of death; mortality; machine learning; classification; decision support systems; clinical; ALL-CAUSE MORTALITY; RANDOMIZED TRIALS; GLOBAL BURDEN; HEALTH; CLASSIFICATION; DATABASES; OUTCOMES; QUALITY;
D O I
10.1093/jamia/ocaa277
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Cause of death is used as an important outcome of clinical research; however, access to cause-of-death data is limited. This study aimed to develop and validate a machine-learning model that predicts the cause of death from the patient's last medical checkup. Materials and Methods: To classify the mortality status and each individual cause of death, we used a stacking ensemble method. The prediction outcomes were all-cause mortality, 8 leading causes of death in South Korea, and other causes. The clinical data of study populations were extracted from the national claims (n= 174 747) and electronic health records (n =729 065) and were used for model development and external validation. Moreover, we imputed the cause of death from the data of 3 US claims databases (n =994 518, 995 372, and 407 604, respectively). All databases were formatted to the Observational Medical Outcomes Partnership Common Data Model. Results: The generalized area under the receiver operating characteristic curve (AUROC) of the model predicting the cause of death within 60 days was 0.9511. Moreover, the AUROC of the external validation was 0.8887. Among the causes of death imputed in the Medicare Supplemental database, 11.32% of deaths were due to malignant neoplastic disease. Discussion: This study showed the potential of machine-learning models as a new alternative to address the lack of access to cause-of-death data. All processes were disclosed to maintain transparency, and the model was easily applicable to other institutions. Conclusion: A machine-learning model with competent performance was developed to predict cause of death.
引用
收藏
页码:1098 / 1107
页数:10
相关论文
共 50 条
  • [1] An Ensemble Machine-Learning Model To Predict Historical PM2.5 Concentrations in China from Satellite Data
    Xiao, Qingyang
    Chang, Howard H.
    Geng, Guannan
    Liu, Yang
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2018, 52 (22) : 13260 - 13269
  • [2] Predicting electronic stopping powers using stacking ensemble machine learning method
    Akbari, Fatemeh
    Taghizadeh, Somayeh
    Shvydka, Diana
    Sperling, Nicholas Niven
    Parsai, E. Ishmael
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION B-BEAM INTERACTIONS WITH MATERIALS AND ATOMS, 2023, 538 : 8 - 16
  • [3] Detection of Parkinson's Disease by Using Machine Learning Stacking and Ensemble Method
    Vikas Chaurasia
    Aparna Chaurasia
    Biomedical Materials & Devices, 2023, 1 (2): : 966 - 978
  • [4] Optimized Stacking Ensemble Learning Model for Breast Cancer Detection and Classification Using Machine Learning
    Kumar, Mukesh
    Singhal, Saurabh
    Shekhar, Shashi
    Sharma, Bhisham
    Srivastava, Gautam
    SUSTAINABILITY, 2022, 14 (21)
  • [5] An integrated machine-learning model to predict nucleosome architecture
    Sala, Alba
    Labrador, Mireia
    Buitrago, Diana
    De Jorge, Pau
    Battistini, Federica
    Heath, Isabelle Brun
    Orozco, Modesto
    NUCLEIC ACIDS RESEARCH, 2024, 52 (17) : 10132 - 10143
  • [6] Ensemble model aggregation using a computationally lightweight machine-learning model to forecast ocean waves
    O'Donncha, Fearghal
    Zhang, Yushan
    Chen, Bei
    James, Scott C.
    JOURNAL OF MARINE SYSTEMS, 2019, 199
  • [7] A Stacking Ensemble Machine Learning Model for Emergency Call Forecasting
    Megouo, Talotsing Gaelle Patricia
    Pierre, Samuel
    IEEE ACCESS, 2024, 12 : 115820 - 115837
  • [8] Non-targeted detection of food adulteration using an ensemble machine-learning model
    Chung, Teresa
    Tam, Issan Yee San
    Lam, Nelly Yan Yan
    Yang, Yanni
    Liu, Boyang
    He, Billy
    Li, Wengen
    Xu, Jie
    Yang, Zhigang
    Zhang, Lei
    Cao, Jian Nong
    Lau, Lok-Ting
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [9] Non-targeted detection of food adulteration using an ensemble machine-learning model
    Teresa Chung
    Issan Yee San Tam
    Nelly Yan Yan Lam
    Yanni Yang
    Boyang Liu
    Billy He
    Wengen Li
    Jie Xu
    Zhigang Yang
    Lei Zhang
    Jian Nong Cao
    Lok-Ting Lau
    Scientific Reports, 12
  • [10] Machine-Learning Model to Predict the Intradialytic Hypotension Based on Clinical-Analytical Data
    Mendoza-Pitti, Luis
    Manuel Gomez-Pulido, Jose
    Vargas-Lombardo, Miguel
    Gomez-Pulido, Juan A.
    Polo-Luque, Maria-Luz
    Rodriguez-Puyol, Diego
    IEEE ACCESS, 2022, 10 : 72065 - 72079