Machine Learning Algorithms Outperform Conventional Regression Models in Predicting Development of Hepatocellular Carcinoma

被引:202
|
作者
Singal, Amit G. [1 ,2 ,3 ]
Mukherjee, Ashin [4 ]
Elmunzer, B. Joseph [5 ]
Higgins, Peter D. R. [5 ]
Lok, Anna S. [5 ]
Zhu, Ji [1 ,4 ]
Marrero, Jorge A. [1 ]
Waljee, Akbar K. [5 ,6 ]
机构
[1] UT Southwestern Med Ctr, Dept Internal Med, Dallas, TX USA
[2] Univ Texas Southwestern, Dept Clin Sci, Dallas, TX USA
[3] UT Southwestern Med Ctr, Harold C Simmons Canc Ctr, Dallas, TX USA
[4] Univ Michigan, Dept Stat, Ann Arbor, MI 48109 USA
[5] Univ Michigan, Dept Internal Med, Ann Arbor, MI 48109 USA
[6] Vet Affairs Ctr Clin Management Res, Ann Arbor, MI USA
来源
AMERICAN JOURNAL OF GASTROENTEROLOGY | 2013年 / 108卷 / 11期
关键词
ALPHA-FETOPROTEIN; HEPATITIS-C; SURVEILLANCE; CURVE;
D O I
10.1038/ajg.2013.332
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
OBJECTIVES: Predictive models for hepatocellular carcinoma (HCC) have been limited by modest accuracy and lack of validation. Machine-learning algorithms offer a novel methodology, which may improve HCC risk prognostication among patients with cirrhosis. Our study's aim was to develop and compare predictive models for HCC development among cirrhotic patients, using conventional regression analysis and machine-learning algorithms. METHODS: We enrolled 442 patients with Child A or B cirrhosis at the University of Michigan between January 2004 and September 2006 (UM cohort) and prospectively followed them until HCC development, liver transplantation, death, or study termination. Regression analysis and machine-learning algorithms were used to construct predictive models for HCC development, which were tested on an independent validation cohort from the Hepatitis C Antiviral Long-term Treatment against Cirrhosis (HALT-C) Trial. Both models were also compared with the previously published HALT-C model. Discrimination was assessed using receiver operating characteristic curve analysis, and diagnostic accuracy was assessed with net reclassification improvement and integrated discrimination improvement statistics. RESULTS: After a median follow-up of 3.5 years, 41 patients developed HCC. The UM regression model had a c-statistic of 0.61 (95% confidence interval (CI) 0.56-0.67), whereas the machine-learning algorithm had a c-statistic of 0.64 (95% CI 0.60-0.69) in the validation cohort. The HALT-C model had a c-statistic of 0.60 (95% CI 0.50-0.70) in the validation cohort and was outperformed by the machine-learning algorithm. The machine-learning algorithm had significantly better diagnostic accuracy as assessed by net reclassification improvement (P<0.001) and integrated discrimination improvement (P=0.04). CONCLUSIONS: Machine-learning algorithms improve the accuracy of risk stratifying patients with cirrhosis and can be used to accurately identify patients at high-risk for developing HCC.
引用
收藏
页码:1723 / 1730
页数:8
相关论文
共 50 条
  • [1] Machine Learning Algorithms Outperform Conventional Regression Models in Identifying Risk Factors for Hepatocellular Carcinoma in Patients With Cirrhosis
    Singal, Amit G.
    Waljee, Akbar K.
    Mukherjee, Ashin
    Higgins, Peter D.
    Zhu, Ji
    Marrero, Jorge A.
    GASTROENTEROLOGY, 2012, 142 (05) : S984 - S984
  • [2] Machine learning algorithms are comparable to conventional regression models in predicting distant metastasis of follicular thyroid carcinoma
    Mao, Yaqian
    Lan, Huiyu
    Lin, Wei
    Liang, Jixing
    Huang, Huibin
    Li, Liantao
    Wen, Junping
    Chen, Gang
    CLINICAL ENDOCRINOLOGY, 2023, 98 (01) : 98 - 109
  • [3] Development of prognostic models for advanced multiple hepatocellular carcinoma based on Cox regression, deep learning and machine learning algorithms
    Shen, Jie
    Zhou, Yu
    Pei, Junpeng
    Yang, Dashuai
    Zhao, Kailiang
    Ding, Youming
    FRONTIERS IN MEDICINE, 2024, 11
  • [4] Novel machine learning models outperform risk scores in predicting hepatocellular carcinoma in patients with chronic viral hepatitis
    Wong, Grace Lai-Hung
    Hui, Vicki Wing-Ki
    Tan, Qingxiong
    Xu, Jingwen
    Lee, Hye Won
    Yip, Terry Cheuk-Fung
    Yang, Baoyao
    Tse, Yee-Kit
    Yin, Chong
    Lyu, Fei
    Lai, Jimmy Che-To
    Lui, Grace Chung-Yan
    Chan, Henry Lik-Yuen
    Yuen, Pong-Chi
    Wong, Vincent Wai-Sun
    JHEP REPORTS, 2022, 4 (03)
  • [5] Machine Learning Algorithms are Superior to Conventional Regression Models in Predicting Risk Stratification of COVID-19 Patients
    Ye, Jiru
    Hua, Meng
    Zhu, Feng
    RISK MANAGEMENT AND HEALTHCARE POLICY, 2021, 14 : 3159 - 3166
  • [6] Comparing Machine Learning Algorithms And Regression Models For Predicting Functional Outcome In The Stratis Registry
    Jumaa, Mouhammad A.
    Zoghi, Zeinab
    Zaidi, Syed F.
    Mueller-Kronast, Nils
    Zaidat, Osama
    Castonguay, Alicia C.
    STROKE, 2022, 53
  • [7] Machine learning models in electronic health records can outperform conventional survival models for predicting patient mortality in coronary artery disease
    Steele, Andrew J.
    Denaxas, Spiros C.
    Shah, Anoop D.
    Hemingway, Harry
    Luscombe, Nicholas M.
    PLOS ONE, 2018, 13 (08):
  • [8] Selecting Machine Learning Algorithms using Regression Models
    Doan, Tri
    Kalita, Jugal
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1498 - 1505
  • [9] Predicting survival of advanced laryngeal squamous cell carcinoma: comparison of machine learning models and Cox regression models
    Yi-Fan Zhang
    Yu-Jie Shen
    Qiang Huang
    Chun-Ping Wu
    Liang Zhou
    Heng-Lei Ren
    Scientific Reports, 13
  • [10] Predicting survival of advanced laryngeal squamous cell carcinoma: comparison of machine learning models and Cox regression models
    Zhang, Yi-Fan
    Shen, Yu-Jie
    Huang, Qiang
    Wu, Chun-Ping
    Zhou, Liang
    Ren, Heng-Lei
    SCIENTIFIC REPORTS, 2023, 13 (01)