Predicting Colorectal Cancer Survival Using Time-to-Event Machine Learning: Retrospective Cohort Study

被引:7
|
作者
Yang, Xulin [1 ]
Qiu, Hang [1 ,2 ]
Wang, Liya [2 ]
Wang, Xiaodong [3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, 2006 Xiyuan Ave, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Big Data Res Ctr, Chengdu, Peoples R China
[3] Sichuan Univ, West China Hosp, Dept Gastrointestinal Surg, Chengdu, Peoples R China
关键词
colorectal cancer; survival prediction; machine learning; time-to-event; SHAP; SHapley Additive exPlanations; DIAGNOSIS; MODELS;
D O I
10.2196/44417
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Machine learning (ML) methods have shown great potential in predicting colorectal cancer (CRC) survival. However, the ML models introduced thus far have mainly focused on binary outcomes and have not considered the time-to-event nature of this type of modeling. Objective: This study aims to evaluate the performance of ML approaches for modeling time-to-event survival data and develop transparent models for predicting CRC-specific survival. Methods: The data set used in this retrospective cohort study contains information on patients who were newly diagnosed with CRC between December 28, 2012, and December 27, 2019, at West China Hospital, Sichuan University. We assessed the performance of 6 representative ML models, including random survival forest (RSF), gradient boosting machine (GBM), DeepSurv, DeepHit, neural net-extended time-dependent Cox (or Cox-Time), and neural multitask logistic regression (N-MTLR) in predicting CRC-specific survival. Multiple imputation by chained equations method was applied to handle missing values in variables. Multivariable analysis and clinical experience were used to select significant features associated with CRC survival. Model performance was evaluated in stratified 5-fold cross-validation repeated 5 times by using the time-dependent concordance index, integrated Brier score, calibration curves, and decision curves. The SHapley Additive exPlanations method was applied to calculate feature importance. Results: A total of 2157 patients with CRC were included in this study. Among the 6 time-to-event ML models, the DeepHit model exhibited the best discriminative ability (time-dependent concordance index 0.789, 95% CI 0.779-0.799) and the RSF model produced better-calibrated survival estimates (integrated Brier score 0.096, 95% CI 0.094-0.099), but these are not statistically significant. Additionally, the RSF, GBM, DeepSurv, Cox-Time, and N-MTLR models have comparable predictive accuracy to the Cox Proportional Hazards model in terms of discrimination and calibration. The calibration curves showed that all the ML models exhibited good 5-year survival calibration. The decision curves for CRC-specific survival at 5 years showed that all the ML models, especially RSF, had higher net benefits than default strategies of treating all or no patients at a range of clinically reasonable risk thresholds. The SHapley Additive exPlanations method revealed that R0 resection, tumor-node-metastasis staging, and the number of positive lymph nodes were important factors for 5-year CRC-specific survival. Conclusions: This study showed the potential of applying time-to-event ML predictive algorithms to help predict CRC-specific survival. The RSF, GBM, Cox-Time, and N-MTLR algorithms could provide nonparametric alternatives to the Cox Proportional Hazards model in estimating the survival probability of patients with CRC. The transparent time-to-event ML models help clinicians to more accurately predict the survival rate for these patients and improve patient outcomes by enabling personalized treatment plans that are informed by explainable ML models.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Comparison of time-to-event machine learning models in predicting oral cavity cancer prognosis
    Adeoye, John
    Hui, Liuling
    Koohi-Moghadam, Mohamad
    Tan, Jia Yan
    Choi, Siu-Wai
    Thomson, Peter
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2022, 157
  • [2] A comparative study of machine learning methods for time-to-event survival data for radiomics risk modelling
    Stefan Leger
    Alex Zwanenburg
    Karoline Pilz
    Fabian Lohaus
    Annett Linge
    Klaus Zöphel
    Jörg Kotzerke
    Andreas Schreiber
    Inge Tinhofer
    Volker Budach
    Ali Sak
    Martin Stuschke
    Panagiotis Balermpas
    Claus Rödel
    Ute Ganswindt
    Claus Belka
    Steffi Pigorsch
    Stephanie E. Combs
    David Mönnich
    Daniel Zips
    Mechthild Krause
    Michael Baumann
    Esther G. C. Troost
    Steffen Löck
    Christian Richter
    Scientific Reports, 7
  • [3] A comparative study of machine learning methods for time-to-event survival data for radiomics risk modelling
    Leger, Stefan
    Zwanenburg, Alex
    Pilz, Karoline
    Lohaus, Fabian
    Linge, Annett
    Zoephel, Klaus
    Kotzerke, Joerg
    Schreiber, Andreas
    Tinhofer, Inge
    Budach, Volker
    Sak, Ali
    Stuschke, Martin
    Balermpas, Panagiotis
    Roedel, Claus
    Ganswindt, Ute
    Belka, Claus
    Pigorsch, Steffi
    Combs, Stephanie E.
    Moennich, David
    Zips, Daniel
    Krause, Mechthild
    Baumann, Michael
    Troost, Esther G. C.
    Loeck, Steffen
    Richter, Christian
    SCIENTIFIC REPORTS, 2017, 7
  • [4] Predicting survival from colorectal cancer histology slides using deep learning: A retrospective multicenter study
    Kather, Jakob Nikolas
    Krisam, Johannes
    Charoentong, Pornpimol
    Luedde, Tom
    Herpel, Esther
    Weis, Cleo-Aron
    Gaiser, Timo
    Marx, Alexander
    Valous, Nektarios A.
    Ferber, Dyke
    Jansen, Lina
    Reyes-Aldasoro, Constantino Carlos
    Zoernig, Inka
    Jaeger, Dirk
    Brenner, Hermann
    Chang-Claude, Jenny
    Hoffmeister, Michael
    Halama, Niels
    PLOS MEDICINE, 2019, 16 (01)
  • [5] A Machine-learning Approach to Survival Time-event Predicting: Initial Analyses using Stomach Cancer Data
    Stepanek, Lubomir
    Habarta, Filip
    Mala, Ivana
    Marek, Lubos
    Pazdirek, Filip
    2020 INTERNATIONAL CONFERENCE ON E-HEALTH AND BIOENGINEERING (EHB), 2020,
  • [6] Predicting Continuity of Asthma Care Using a Machine Learning Model: Retrospective Cohort Study
    Tong, Yao
    Lin, Beilei
    Chen, Gang
    Zhang, Zhenxiang
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (03)
  • [7] Author Correction: Machine learning for predicting survival of colorectal cancer patients
    Lucas Buk Cardoso
    Vanderlei Cunha Parro
    Stela Verzinhasse Peres
    Maria Paula Curado
    Gisele Aparecida Fernandes
    Victor Wünsch Filho
    Tatiana Natasha Toporcov
    Scientific Reports, 13
  • [8] Machine Learning Model for Predicting Postoperative Survival of Patients with Colorectal Cancer
    Osman, Mohamed Hosny
    Mohamed, Reham Hosny
    Sarhan, Hossam Mohamed
    Park, Eun Jung
    Baik, Seung Hyuk
    Lee, Kang Young
    Kang, Jeonghyun
    CANCER RESEARCH AND TREATMENT, 2022, 54 (02): : 517 - 524
  • [9] The application of time-to-event analysis in machine learning prognostic models
    Zi-He Peng
    Zhi-Xin Huang
    Juan-Hua Tian
    Tie Chong
    Zhao-Lun Li
    Journal of Translational Medicine, 22
  • [10] An Introduction to Deep Survival Analysis Models for Predicting Time-to-Event Outcomes
    Chen, George H.
    FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2024, 17 (06): : 921 - 1100