Development and validation of an artificial intelligence prediction model and a survival risk stratification for lung metastasis in colorectal cancer from highly imbalanced data: A multicenter retrospective study

被引:3
|
作者
Zhang, Weiyuan [1 ]
Guan, Xu [2 ,3 ,5 ]
Jiao, Shuai [3 ]
Wang, Guiyu [1 ,6 ]
Wang, Xishan [1 ,3 ,4 ]
机构
[1] Harbin Med Univ, Affiliated Hosp 2, Dept Colorectal Canc Surg, Harbin 150000, Peoples R China
[2] Chinese Acad Med Sci & Peking Union Med Coll, Dept Colorectal Surg, Natl Canc Ctr, Natl Clin Res Ctr Canc,Canc Hosp, Beijing 100000, Peoples R China
[3] Shanxi Med Univ, Chinese Acad Med Sci, Shanxi Prov Canc Hosp, Canc Hosp,Dept Colorectal Surg, Taiyuan, Peoples R China
[4] Harbin Med Univ, Affiliated Hosp 2, Dept Colorectal Canc Surg, 246 Xuefu Rd, Harbin, Peoples R China
[5] Chinese Acad Med Sci & Peking Union Med Coll, Natl Canc Ctr, Dept Diagnost Radiol, Natl Clin Res Ctr Canc,Canc Hosp,Dept Colorectal S, 17 Panjiayuan Nanli, Beijing 100021, Peoples R China
[6] Harbin Med Univ, Affiliated Hosp 2, Dept Colorectal Canc Surg, 246 Xuefu Rd, Harbin, Peoples R China
来源
EJSO | 2023年 / 49卷 / 12期
基金
中国国家自然科学基金;
关键词
Lung metastasis; Machine learning; Colorectal cancer; Prediction; Imbalanced data; CURATIVE RESECTION; GUIDELINES; RECURRENCE; NOMOGRAM;
D O I
10.1016/j.ejso.2023.107107
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: To assist clinicians with diagnosis and optimal treatment decision-making, we attempted to develop and validate an artificial intelligence prediction model for lung metastasis (LM) in colorectal cancer (CRC) patients.Methods: The clinicopathological characteristics of 46037 CRC patients from the Surveillance, Epidemiology, and End Results (SEER) database and 2779 CRC patients from a multi-center external validation set were collected retrospectively. After feature selection by univariate and multivariate analyses, six machine learning (ML) models, including logistic regression, K-nearest neighbor, support vector machine, decision tree, random forest, and balanced random forest (BRF), were developed and validated for the LM prediction. In addition, stratified LM patients by risk score were utilized for survival analysis.Results: Extremely low rates of LM with 2.59% and 4.50% were present in the development and validation set. As the imbalanced learning strategy, the BRF model with an Area under the receiver operating characteristic curve (AUC) of 0.874 and an average precision (AP) of 0.184 performed best compares with other models and clinical predictor. Patients with LM in the high-risk group had significantly poorer survival (P<0.001) and failed to benefit from resection (P = 0.125).Conclusions: In summary, we have utilized the BRF algorithm to develop an effective, non-invasive, and practical model for predicting LM in CRC patients based on highly imbalanced datasets. In addition, we have implemented a novel approach to stratify the survival risk of CRC patients with LM based the output of the model.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Development and validation of a risk prediction model for post-polypectomy colorectal cancer in the USA: a prospective cohort study
    Knudsen, Markus Dines
    Wang, Kai
    Wang, Liang
    Polychronidis, Georgios
    Berstad, Paula
    Wu, Kana
    He, Xiaosheng
    Hang, Dong
    Fang, Zhe
    Ogino, Shuji
    Chan, Andrew T.
    Giovannucci, Edward
    Wang, Molin
    Song, Mingyang
    ECLINICALMEDICINE, 2023, 62
  • [22] Development and internal validation of a risk prediction model for stress urinary incontinence throughout pregnancy: A multicenter retrospective longitudinal study in Indonesia
    Liang, Surui
    Huang, Shijie
    Andarini, Esti
    Wang, Ying
    Li, Yan
    Cai, Wenzhi
    NEUROUROLOGY AND URODYNAMICS, 2024, 43 (02) : 354 - 363
  • [23] Development and validation of a novel diagnostic model for assessing lung cancer metastasis in a Chinese population based on multicenter real-world data
    Yao, Yiyong
    Yan, Cunling
    Zhang, Wei
    Wu, San-Gang
    Guan, Jie
    Zeng, Gang
    Du, Qiang
    Huang, Chun
    Zhang, Hui
    Wang, Huiling
    Hou, Yanfeng
    Li, Zhiyan
    Wang, Lixin
    Zheng, Yijie
    Li, Xun
    CANCER MANAGEMENT AND RESEARCH, 2019, 11 : 9213 - 9223
  • [24] Derivation and External Validation of a Risk Prediction Model for Pulmonary Embolism in Patients With Lung Cancer: A Large Retrospective Cohort Study
    Zhu, Ning
    Zhang, Lei
    Gong, Shengping
    Luo, Zhuanbo
    He, Lei
    Wang, Linfeng
    Qiu, Feng
    Huang, Weina
    Cao, Chao
    CLINICAL AND APPLIED THROMBOSIS-HEMOSTASIS, 2023, 29
  • [25] Development and Validation of a Computed Tomography-Based Model for Noninvasive Prediction of the T Stage in Gastric Cancer: Multicenter Retrospective Study
    Tao, Jin
    Liu, Dan
    Hu, Fu-Bi
    Zhang, Xiao
    Yin, Hongkun
    Zhang, Huiling
    Zhang, Kai
    Huang, Zixing
    Yang, Kun
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [26] Development and validation of an artificial intelligence model for predicting de novo distant bone metastasis in breast cancer: a dual-center study
    Zhang, Wen-hai
    Tan, Yang
    Huang, Zhen
    Tan, Qi-xing
    Zhang, Yue-mei
    Wei, Chang-yuan
    BMC WOMENS HEALTH, 2024, 24 (01)
  • [27] Development of a prediction model for lung metastasis by using postoperative circulating tumor DNA binding clinicopathological risk factors in patients with colorectal cancer
    Han, Bohan
    Li, Weiyao
    Li, Jianxia
    Hu, Huabin
    Zhang, Jianwei
    Xie, Xiaoyu
    Mo, Qinli
    Deng, Yanhong
    JOURNAL OF CLINICAL ONCOLOGY, 2024, 42 (16)
  • [28] Survival Benefit of Palliative Local Treatments and Efficacy of Different Pharmacotherapies in Colorectal Cancer With Lung Metastasis: Results From a Large Retrospective Study
    Wang, Zhenghang
    Wang, Xicheng
    Yuan, Jiajia
    Zhang, Xiaotian
    Zhou, Jun
    Lu, Ming
    Liu, Dan
    Li, Jian
    Shen, Lin
    CLINICAL COLORECTAL CANCER, 2018, 17 (02) : E233 - E255
  • [29] Development and Validation of a Prognostic Model to Predict Overall Survival for Lung Adenocarcinoma: A Population-Based Study From the SEER Database and the Chinese Multicenter Lung Cancer Database
    Wang, Zhiqiang
    Hu, Fan
    Chang, Ruijie
    Yu, Xiaoyue
    Xu, Chen
    Liu, Yujie
    Wang, Rongxi
    Chen, Hui
    Liu, Shangbin
    Xia, Danni
    Chen, Yingjie
    Ge, Xin
    Zhou, Tian
    Zhang, Shuixiu
    Pang, Haoyue
    Fang, Xueni
    Zhang, Yushuang
    Li, Jin
    Hu, Kaiwen
    Cai, Yong
    TECHNOLOGY IN CANCER RESEARCH & TREATMENT, 2022, 21
  • [30] Development and validation of a risk stratification model for stroke recurrence after acute ischemic stroke in young adults: A nomogram-based, multicenter retrospective study
    Li, Zhongzhen
    Tian, Tian
    Yan, Yujia
    Yu, Yue
    Liu, Jun
    Zhang, Shusheng
    Zhang, Guobin
    Yin, Shaoya
    CLINICAL NEUROLOGY AND NEUROSURGERY, 2025, 249