Development and validation of machine learning models for predicting venous thromboembolism in colorectal cancer patients: A cohort study in China

被引:0
|
作者
Hu, Zuhai [1 ]
Li, Xiaosheng [2 ]
Yuan, Yuliang [1 ]
Xu, Qianjie [3 ]
Zhang, Wei [2 ]
Lei, Haike [1 ]
机构
[1] Chongqing Univ Canc Hosp, Chongqing Canc Multi Big Data Applicat Engn Res Ct, Chongqing 400030, Peoples R China
[2] Chongqing Univ Canc Hosp, Chongqing Key Lab Translat Res Canc Metastasis & I, Chongqing 400030, Peoples R China
[3] Chongqing Med Univ, Sch Publ Hlth, Dept Hlth Stat, Chongqing 400016, Peoples R China
关键词
Colorectal cancer; VTE; Machine learning; Bootstrap; RISK-ASSESSMENT MODELS; DIAGNOSIS;
D O I
10.1016/j.ijmedinf.2024.105770
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: With advancements in healthcare, traditional VTE risk assessment tools are increasingly insufficient to meet the demands of high-quality care, underscoring the need for innovative and specialized assessment methods. Objective: Owing to the remarkable success of machine learning in supervised learning and disease prediction, our objective is to develop a reliable and efficient model for assessing VTE risk by leveraging the fundamental data and clinical characteristics of colorectal cancer patients within our medical facility. Methods: Six commonly used machine learning algorithms were utilized in our study to predict the occurrence of VTE in patients with rectal cancer. In the modeling process, LASSO regression was employed to identify and exclude variables not associated with VTE. Additionally, hyperparameter tuning was conducted via 5-fold cross- validation to mitigate overfitting, and 200 bootstrap samples were used to adjust the apparent performance on the training set. The selection of the VTE assessment model was determined by a thorough evaluation of performance criteria, such as the AUC, ACC and F1 score. Results: The RF model exhibits consistent and efficient performance. Specifically, in the internally validation dataset, where generalizability was adjusted, the RF model achieved the highest scores across multiple metrics: AD-AUC (0.895), AD-ACC (0.871), AD-F1 (0.311), AD-MCC (0.316), AD-Precision (0.241), AD-Specificity (0.888). For external validation on unseen colon cancer data, the RF model also performed best in terms of ACC (0.728), F1 (0.292), MCC (0.225), Precision (0.192), and Specificity (0.740), with a suboptimal AUC of 0.745 and a Sensitivity (Recall) of 0.615. Additionally, the RF model demonstrates strong performance not only on the original dataset but also on datasets processed via alternative imbalance handling techniques. Conclusions: Our research successfully established and validated a risk assessment model for assessing the risk of VTE in colorectal cancer patients.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Development and Validation of a Risk Score for Prediction of Venous Thromboembolism in Patients With Lung Cancer
    Li, Zilun
    Zhang, Guolong
    Zhang, Mengping
    Mei, Jie
    Weng, Huiwen
    Peng, Zhenwei
    CLINICAL AND APPLIED THROMBOSIS-HEMOSTASIS, 2020, 26
  • [42] Machine learning models for prediction of postoperative venous thromboembolism in gynecological malignant tumor patients
    Lin, Bijuan
    Chen, Fang
    Wu, Meiying
    Li, Chaojing
    Lin, Lanzhi
    JOURNAL OF OBSTETRICS AND GYNAECOLOGY RESEARCH, 2024, 50 (07) : 1175 - 1181
  • [43] Machine Learning-Based Predictive Models for Patients with Venous Thromboembolism: A Systematic Review
    Danilatou, Vasiliki
    Dimopoulos, Dimitrios
    Kostoulas, Theodoros
    Douketis, James
    THROMBOSIS AND HAEMOSTASIS, 2024, 124 (11) : 1040 - 1052
  • [44] Comparing different venous thromboembolism risk assessment machine learning models in Chinese patients
    Wang, Xin
    Yang, Yu-Qing
    Liu, Si-Hua
    Hong, Xin-Yu
    Sun, Xue-Feng
    Shi, Ju-hong
    JOURNAL OF EVALUATION IN CLINICAL PRACTICE, 2020, 26 (01) : 26 - 34
  • [45] Venous thromboembolism in patients with pemphigus: A cohort study
    Leshem, Yael Anne
    Atzmony, Lihi
    Dudkiewicz, Israel
    Hodak, Emmilia
    Mimouni, Daniel
    JOURNAL OF THE AMERICAN ACADEMY OF DERMATOLOGY, 2017, 77 (02) : 256 - 260
  • [46] Risk Factors of Venous Thromboembolism in Inpatients With Colorectal Cancer in China
    Chen, Fengjiao
    Sun, Yixin
    Zhang, Cui
    Li, Lei
    Du, Yu
    Zhou, Mingyan
    Cheng, Wen
    CLINICAL AND APPLIED THROMBOSIS-HEMOSTASIS, 2023, 29
  • [47] Validation of clinical risk assessment scores for venous thromboembolism in patients with cancer: a population-based cohort study
    Lanting, Vincent
    Vago, Emese
    Horvath-Puho, Erzsebet
    Mulder, Frits
    Di Nisio, Marcello
    Kamphuisen, Pieter W.
    Pedersen, Lars
    van Es, Nick
    Sorensen, Henrik T.
    JOURNAL OF THROMBOSIS AND HAEMOSTASIS, 2025, 23 (02) : 600 - 609
  • [48] Evaluation of risk factors and assessment models for predicting venous thromboembolism in lung cancer patients
    Rupa-Matysek, Joanna
    Lembicz, Marta
    Rogowska, Eliza Katarzyna
    Gil, Lidia
    Komarnicki, Mieczyslaw
    Batura-Gabryel, Halina
    MEDICAL ONCOLOGY, 2018, 35 (05)
  • [49] Evaluation of risk factors and assessment models for predicting venous thromboembolism in lung cancer patients
    Joanna Rupa-Matysek
    Marta Lembicz
    Eliza Katarzyna Rogowska
    Lidia Gil
    Mieczysław Komarnicki
    Halina Batura-Gabryel
    Medical Oncology, 2018, 35
  • [50] Models for predicting venous thromboembolism in ambulatory patients with lung cancer: a systematic review protocol
    Yan, Ann-Rong
    Samarawickrema, Indira
    Naunton, Mark
    Peterson, Gregory M.
    Yip, Desmond
    Mortazavi, Reza
    BMJ OPEN, 2021, 11 (12):