Explainable machine learning identifies a polygenic risk score as a key predictor of pancreatic cancer risk in the UK Biobank

被引:2
|
作者
Peduzzi, Giulia [1 ]
Felici, Alessio [1 ]
Pellungrini, Roberto
Campa, Daniele [1 ,2 ]
机构
[1] Univ Pisa, Dept Biol, Via Luca Ghini 13, I-56126 Pisa, Italy
[2] Scuola Normale Super Pisa, Classe Sci, Piazza Cavalieri 7, I-56126 Pisa, Italy
关键词
Pancreatic cancer; Risk prediction; Explainable artificial intelligence; Polygenic Risk Score; GENOME-WIDE ASSOCIATION; SUSCEPTIBILITY LOCI; BREAST-CANCER; VARIANTS; DISEASE; GENES; MODEL;
D O I
10.1016/j.dld.2024.11.010
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Background: Predicting the risk of developing pancreatic ductal adenocarcinoma (PDAC) is of paramount importance, given its high mortality rate. Current PDAC risk prediction models rely on a limited number of variables, do not include genetics, and have a modest accuracy. Aim: This study aimed to develop an interpretable PDAC risk prediction model, based on machine learning (ML). Methods: Five ML models (Adaptive Boosting, eXtreme Gradient Boosting, CatBoost, Deep Forest and Random Forest) built on 56 exposome variables and a polygenic risk score (PRS) were tested in 654 PDAC cases and 1,308 controls of the UK Biobank. Additionally, SHapley Additive exPlanation (SHAP) and Global model Interpretation via the Recursive Partitioning (Girp) were employed to explain the models. Results: All models provided similar performance, but based on recall the best was CatBoost (77.10 %). SHAP highlighted age and the PRS as primary contributors across all models. Girp developed rules to discern cases from controls, identifying age, PRS, and pancreatitis in most of the rules. Conclusion: The predictive models tested have exhibited good performance, indicating their potential application in the clinical field in the near future, with the PRS playing a key role in identifying high-risk individuals as demonstrated by the explainers. (c) 2024 Published by Elsevier Ltd on behalf of Editrice Gastroenterologica Italiana S.r.l.
引用
收藏
页码:915 / 922
页数:8
相关论文
共 50 条
  • [1] Assessments of dietary intake and polygenic risk score in associations with colorectal cancer risk: evidence from the UK Biobank
    Tung Hoang
    Sooyoung Cho
    Ji-Yeob Choi
    Daehee Kang
    Aesun Shin
    BMC Cancer, 23 (1)
  • [2] High polygenic risk score is a risk factor associated with colorectal cancer based on data from the UK Biobank
    Yang, Mei
    Narasimhan, Vagheesh M.
    Zhan, F. Benjamin
    PLOS ONE, 2023, 18 (11):
  • [3] Healthy Diet, Polygenic Risk Score, and Upper Gastrointestinal Cancer Risk: A Prospective Study from UK Biobank
    Liu, Wenmin
    Wang, Tianpei
    Zhu, Meng
    Jin, Guangfu
    NUTRIENTS, 2023, 15 (06)
  • [4] Assessments of dietary intake and polygenic risk score in associations with colorectal cancer risk: evidence from the UK Biobank
    Hoang, Tung
    Cho, Sooyoung
    Choi, Ji-Yeob
    Kang, Daehee
    Shin, Aesun
    BMC CANCER, 2023, 23 (01)
  • [5] Sleep and Risk of Pancreatic Cancer in the UK Biobank
    Freeman, Joshua R.
    Saint-Maurice, Pedro F.
    Zhang, Ting
    Matthews, Charles E.
    Stolzenberg-Solomon, Rachael Z.
    CANCER EPIDEMIOLOGY BIOMARKERS & PREVENTION, 2024, 33 (04) : 624 - 627
  • [6] Predicting Pancreatic Cancer in the UK Biobank Cohort Using Polygenic Risk Scores and Diabetes Mellitus
    Sharma, Shreya
    Tapper, William J.
    Collins, Andrew
    Hamady, Zaed Z. R.
    GASTROENTEROLOGY, 2022, 162 (06) : 1665 - +
  • [7] The interaction between systemic inflammatory markers and polygenic risk score in breast cancer risk: A cohort study in the UK Biobank
    Yang, Zixuan
    Zhang, Yanyu
    Song, Mengjie
    Huang, Xiaoxi
    Lin, Yuxiang
    Yang, Haomin
    CANCER EPIDEMIOLOGY, 2023, 87
  • [8] Venous Thromboembolism Polygenic Risk Score Associates With Pulmonary Hypertension in the UK Biobank
    Clapham, Katharine R.
    Mesbah Uddin, Md
    Honigberg, Michael C.
    Gilliland, Thomas
    Ruan, Yunfeng
    Natarajan, Pradeep
    CIRCULATION-GENOMIC AND PRECISION MEDICINE, 2022, 15 (06): : 605 - 607
  • [9] A systematic evaluation of the performance and properties of the UK Biobank Polygenic Risk Score (PRS) Release
    Thompson, Deborah J.
    Wells, Daniel
    Selzam, Saskia
    Peneva, Iliana
    Moore, Rachel
    Sharp, Kevin
    Tarran, William A.
    Beard, Edward J.
    Riveros-Mckay, Fernando
    Giner-Delgado, Carla
    Palmer, Duncan
    Seth, Priyanka
    Harrison, James
    Futema, Marta
    McVean, Gil
    Plagnol, Vincent
    Donnelly, Peter
    Weale, Michael E.
    PLOS ONE, 2024, 19 (09):
  • [10] Assessing the Value of Incorporating a Polygenic Risk Score with Nongenetic Factors for Predicting Breast Cancer Diagnosis in the UK Biobank
    Collister, Jennifer A.
    Liu, Xiaonan
    Littlejohns, Thomas J.
    Cuzick, Jack
    Clifton, Lei
    Hunter, David J.
    CANCER EPIDEMIOLOGY BIOMARKERS & PREVENTION, 2024, 33 (06) : 812 - 820