Development and validation of an interpretable machine learning model for predicting the risk of hepatocellular carcinoma in patients with chronic hepatitis B: a case-control study

被引:0
|
作者
Wu, Linghong [1 ]
Liu, Zengjing [2 ]
Huang, Hongyuan [1 ]
Pan, Dongmei [2 ]
Fu, Cuiping [3 ]
Lu, Yao [3 ]
Zhou, Min [4 ]
Huang, Kaiyong [1 ]
Huang, Tianren [5 ]
Yang, Li [1 ]
机构
[1] Guangxi Med Univ, Sch Publ Hlth, Dept Occupat Hlth & Environm Hlth, Nanning 530021, Guangxi, Peoples R China
[2] Guangxi Med Univ, Affiliated Hosp 4, Med Records Data Ctr, Liuzhou 545000, Guangxi, Peoples R China
[3] Guangxi Med Univ, Affiliated Hosp 4, Med Dept, Liuzhou 545000, Guangxi, Peoples R China
[4] Guangxi Med Univ, Affiliated Hosp 4, Gen Surg, Liuzhou 545000, Guangxi, Peoples R China
[5] Guangxi Med Univ, Sch Publ Hlth, Dept Epidemiol & Biostat, Nanning 530021, Guangxi, Peoples R China
关键词
Chronic hepatitis B; Hepatocellular carcinoma; Machine learning; Predictive model; GAMMA-GLUTAMYL-TRANSFERASE; CLASSIFICATION;
D O I
10.1186/s12876-025-03697-2
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
BackgroundThe aim of this study was to develop and internally validate an interpretable machine learning (ML) model for predicting the risk of hepatocellular carcinoma (HCC) in patients with chronic hepatitis B (CHB) infection.MethodsWe retrospectively collected clinical data from patients with HCC and CHB treated at the Fourth Affiliated Hospital of Guangxi Medical University from January 2022 to December 2022, including demographics, comorbidities, and laboratory parameters. The datasets were randomly divided into a training set (361 cases) and a validation set (155 cases) in a 7:3 ratio. Variables were screened using Least Absolute Shrinkage and Selection Operator (LASSO) and multifactor logistic regression. The prediction model of HCC risk in CHB patients was constructed based on five machine learning models, including logistic regression (LR), K-nearest neighbour (KNN), support vector machine (SVM), random forest (RF) and artificial neural network (ANN). Receiver operating characteristic (ROC) curve, calibration curve and decision curve analysis (DCA) were used to evaluate the predictive performance of the model in terms of identification, calibration and clinical application. The SHapley Additive exPlanation (SHAP) method was used to rank the importance of the features and explain the final model.ResultsAmong the five ML models constructed, the RF model has the best performance, and the RF model predicts the risk of HCC in patients with CHB in the training set [AUC: 0.996, 95% confidence interval (CI) (0.991-0.999)] and internal validation set [AUC: 0.993, 95% CI (0.986-1.000)]. It has high AUC, specificity, sensitivity, F1 score and low Brier score. Calibration showed good agreement between observed and predicted risks. The model yielded higher positive net benefits in DCA than when all participants were considered to be at high or low risk, indicating good clinical utility. In addition, the SHAP plot of the RF showed that age, basophil/lymphocyte ratio (BLR), D-Dimer, aspartate aminotransferase/alanine aminotransferase (AST/ALT), gamma-glutamyltransferase (GGT) and alpha-fetoprotein (AFP) can help identify patients with CHB who are at high or low risk of developing HCC.ConclusionML models can be used as a tool to predict the risk of HCC in patients with CHB. The RF model has the best predictive performance and helps clinicians to identify high-risk patients and intervene early to reduce or delay the occurrence of HCC. However, the model needs to be further improved through large sample studies.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Development and validation of an interpretable machine learning model for predicting in-hospital mortality for ischemic stroke patients in ICU
    Luo, Xiao
    Li, Binghan
    Zhu, Ronghui
    Tai, Yaoyong
    Wang, Zongyu
    He, Qian
    Zhao, Yanfang
    Bi, Xiaoying
    Wu, Cheng
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2025, 198
  • [42] Development and validation of an ensemble machine-learning model for predicting early mortality among patients with bone metastases of hepatocellular carcinoma
    Long, Ze
    Yi, Min
    Qin, Yong
    Ye, Qianwen
    Che, Xiaotong
    Wang, Shengjie
    Lei, Mingxing
    FRONTIERS IN ONCOLOGY, 2023, 13
  • [43] Hepatocellular Carcinoma Development in Patients with Diabetes Mellitus: A Nationwide Case-Control Study
    Tateishi, Ryosuke
    Matsumura, Takeshi
    Uchino, Koji
    Fujiwara, Naoto
    Senokuchi, Takafumi
    Kon, Kazuyoshi
    Sasako, Takahiro
    Taniai, Makiko
    Kawaguchi, Takumi
    Inoue, Hiroshi
    Watada, Hirotaka
    Kubota, Naoto
    Shimano, Hitoshi
    Kaneko, Shuichi
    Hashimoto, Etsuko
    Watanabe, Sumio
    Shiota, Goshi
    Ueki, Kojiro
    Kashiwabara, Kosuke
    Matsuyama, Yutaka
    Tanaka, Hideo
    Kasuga, Masato
    Araki, Eiichi
    Koike, Kazuhiko
    HEPATOLOGY, 2018, 68 : 528A - 528A
  • [44] A case-control study on the relationship between hepatitis B virus DNA level and risk of hepatocellular carcinoma in Qidong,China
    Liu, Taotao
    Fang, Ying
    Xiong, Hui
    Chen, Taoyang
    Ni, Zhengpin
    Shen, Xizhong
    GASTROENTEROLOGY, 2007, 132 (04) : A760 - A760
  • [45] A case-control study of the relationship between hepatitis B virus DNA level and risk of hepatocellular carcinoma in Qidong, China
    Liu, Tao-Tao
    Fang, Ying
    Xiong, Hui
    Chen, Tao Yang
    Ni, Zheng-Pin
    Luo, Jian-Feng
    Zhao, Nai-Qing
    Shen, Xi-Zhong
    WORLD JOURNAL OF GASTROENTEROLOGY, 2008, 14 (19) : 3059 - 3063
  • [46] Reply to 'Consider more about risk scores for predicting hepatocellular carcinoma in patients with chronic hepatitis B'
    Kamalapirat, Thanachote
    Ungtrakul, Teerapat
    JOURNAL OF VIRAL HEPATITIS, 2021, 28 (10) : 1501 - 1501
  • [47] Past exposure to hepatitis B virus (HBV) as a risk factor for hepatocellular carcinoma (HCC) in Japanese patients with chronic liver disease (CLD): A case-control study.
    Okada, S
    Okusaka, T
    Ishii, H
    Ikeda, M
    Nakasuka, H
    Nagahama, H
    Yoshimori, M
    Sato, T
    GASTROENTEROLOGY, 1997, 112 (04) : A1352 - A1352
  • [48] Predicting Hepatocellular Carcinoma Risk in Chronic Hepatitis B Patients Receiving Finite Periods of Antiviral Therapy
    Lin, Chih-Lang
    Wu, Szu-Yuan
    Lai, Ming-Wei
    Hsu, Chao-Wei
    Chen, Wan-Ming
    Jao, An-Tzu
    Chien, Cheng-Hung
    Hu, Ching-Chih
    Chien, Rong-Nan
    Yeh, Chau-Ting
    CANCERS, 2023, 15 (13)
  • [50] Anti-hepatitis B core positivity as a risk factor for hepatocellular carcinoma in alcoholic cirrhosis: A case-control study
    Kwon, Oh Sang
    Jung, Young Kul
    Bae, Kyoung Suk
    Kim, Ju Hyun
    Kim, Sang Gyune
    Kim, Young Seok
    Lee, Jung Il
    Lee, Jin Woo
    Kim, Yun Soo
    ALCOHOL, 2012, 46 (06) : 537 - 541