Explainable machine learning identifies a polygenic risk score as a key predictor of pancreatic cancer risk in the UK Biobank

被引:2
|
作者
Peduzzi, Giulia [1 ]
Felici, Alessio [1 ]
Pellungrini, Roberto
Campa, Daniele [1 ,2 ]
机构
[1] Univ Pisa, Dept Biol, Via Luca Ghini 13, I-56126 Pisa, Italy
[2] Scuola Normale Super Pisa, Classe Sci, Piazza Cavalieri 7, I-56126 Pisa, Italy
关键词
Pancreatic cancer; Risk prediction; Explainable artificial intelligence; Polygenic Risk Score; GENOME-WIDE ASSOCIATION; SUSCEPTIBILITY LOCI; BREAST-CANCER; VARIANTS; DISEASE; GENES; MODEL;
D O I
10.1016/j.dld.2024.11.010
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Background: Predicting the risk of developing pancreatic ductal adenocarcinoma (PDAC) is of paramount importance, given its high mortality rate. Current PDAC risk prediction models rely on a limited number of variables, do not include genetics, and have a modest accuracy. Aim: This study aimed to develop an interpretable PDAC risk prediction model, based on machine learning (ML). Methods: Five ML models (Adaptive Boosting, eXtreme Gradient Boosting, CatBoost, Deep Forest and Random Forest) built on 56 exposome variables and a polygenic risk score (PRS) were tested in 654 PDAC cases and 1,308 controls of the UK Biobank. Additionally, SHapley Additive exPlanation (SHAP) and Global model Interpretation via the Recursive Partitioning (Girp) were employed to explain the models. Results: All models provided similar performance, but based on recall the best was CatBoost (77.10 %). SHAP highlighted age and the PRS as primary contributors across all models. Girp developed rules to discern cases from controls, identifying age, PRS, and pancreatitis in most of the rules. Conclusion: The predictive models tested have exhibited good performance, indicating their potential application in the clinical field in the near future, with the PRS playing a key role in identifying high-risk individuals as demonstrated by the explainers. (c) 2024 Published by Elsevier Ltd on behalf of Editrice Gastroenterologica Italiana S.r.l.
引用
收藏
页码:915 / 922
页数:8
相关论文
共 50 条
  • [41] Development of a Polygenic Risk Score for Metabolic Dysfunction-Associated Steatotic Liver Disease Prediction in UK Biobank
    Giardoglou, Panagiota
    Gavra, Ioanna
    Amanatidou, Athina I.
    Kalafati, Ioanna Panagiota
    Symianakis, Panagiotis
    Kafyra, Maria
    Moulos, Panagiotis
    Dedoussis, George V.
    GENES, 2025, 16 (01)
  • [42] Associations of air pollution with obesity and body fat percentage, and modification by polygenic risk score for BMI in the UK Biobank
    Furlong, Melissa A.
    Klimentidis, Yann C.
    ENVIRONMENTAL RESEARCH, 2020, 185
  • [43] Breast Cancer Polygenic Risk Score and Contralateral Breast Cancer Risk
    Kramer, Iris
    Hooning, Maartje J.
    Mavaddat, Nasim
    Hauptmann, Michael
    Keeman, Renske
    Steyerberg, Ewout W.
    Giardiello, Daniele
    Antoniou, Antonis C.
    Pharoah, Paul D. P.
    Canisius, Sander
    Abu-Ful, Zumuruda
    Andrulis, Irene L.
    Anton-Culver, Hoda
    Aronson, Kristan J.
    Augustinsson, Annelie
    Becher, Heiko
    Beckmann, Matthias W.
    Behrens, Sabine
    Benitez, Javier
    Bermisheva, Marina
    Bogdanova, Natalia, V
    Bojesen, Stig E.
    Bolla, Manjeet K.
    Bonanni, Bernardo
    Brauch, Hiltrud
    Bremer, Michael
    Brucker, Sara Y.
    Burwinkel, Barbara
    Castelao, Jose E.
    Chan, Tsun L.
    Chang-Claude, Jenny
    Chanock, Stephen J.
    Chenevix-Trench, Georgia
    Choi, Ji-Yeob
    Clarke, Christine L.
    Collee, J. Margriet
    Couch, Fergus J.
    Cox, Angela
    Cross, Simon S.
    Czene, Kamila
    Daly, Mary B.
    Devilee, Peter
    Dork, Thilo
    dos-Santos-Silva, Isabel
    Dunning, Alison M.
    Dwek, Miriam
    Eccles, Diana M.
    Evans, D. Gareth
    Fasching, Peter A.
    Flyger, Henrik
    AMERICAN JOURNAL OF HUMAN GENETICS, 2020, 107 (05) : 837 - 848
  • [44] Polygenic risk score is a predictor of adenomatous polyps at screening colonoscopy
    Northcutt, Michael J.
    Shi, Zhuqing
    Zijlstra, Michael
    Shah, Ayush
    Zheng, Siqun
    Yen, Eugene F.
    Khan, Omar
    Beig, Mohammad Imran
    Imas, Polina
    Vanderloo, Adam
    Ansari, Obaid
    Xu, Jianfeng
    Goldstein, Jay L.
    BMC GASTROENTEROLOGY, 2021, 21 (01)
  • [45] Complement polygenic risk score as a predictor of Alzheimer's Disease
    Keat, Samuel
    Morales, Atahualpa Castillo
    Sims, Rebecca
    Williams, Julie
    Morgan, B. Paul
    Carpanini, Sarah
    EUROPEAN JOURNAL OF IMMUNOLOGY, 2024, 54 : 186 - 186
  • [46] Polygenic risk score identifies associations between sleep duration and diseases determined from an electronic medical record biobank
    Dashti, Hassan S.
    Redline, Susan
    Saxena, Richa
    SLEEP, 2019, 42 (03)
  • [47] Polygenic risk score is a predictor of adenomatous polyps at screening colonoscopy
    Michael J. Northcutt
    Zhuqing Shi
    Michael Zijlstra
    Ayush Shah
    Siqun Zheng
    Eugene F. Yen
    Omar Khan
    Mohammad Imran Beig
    Polina Imas
    Adam Vanderloo
    Obaid Ansari
    Jianfeng Xu
    Jay L. Goldstein
    BMC Gastroenterology, 21
  • [48] Physical activity, polygenic risk score, and colorectal cancer risk
    Chen, Xuechen
    Guo, Feng
    Chang-Claude, Jenny
    Hoffmeister, Michael
    Brenner, Hermann
    CANCER MEDICINE, 2023, 12 (04): : 4655 - 4666
  • [49] Evaluation of polygenic risk score for risk prediction of gastric cancer
    Xiao-Yu Wang
    Li-Li Wang
    Lin Xu
    Shu-Zhen Liang
    Meng-Chao Yu
    Qiu-Yue Zhang
    Quan-Jiang Dong
    World Journal of Gastrointestinal Oncology, 2023, 15 (02) : 276 - 285
  • [50] A polygenic risk score predicts breast cancer risk in Latinas
    Shieh, Yiwey
    Fejerman, Laura
    Sawyer, Sarah D.
    Hu, Donglei
    Huntsman, Scott
    John, Esther M.
    Kushi, Lawrence H.
    Torres-Mejia, Gabriela
    Weitzel, Jeffrey N.
    Haiman, Christopher A.
    Ziv, Elad
    Neuhausen, Susan L.
    CANCER RESEARCH, 2019, 79 (13)