Discovering and Overcoming the Bias in Neoantigen Identification by Unified Machine Learning Models

被引:0
|
作者
Zhang, Ziting
Wu, Wenxu
Wei, Lei
Wang, Xiaowo [1 ]
机构
[1] Tsinghua Univ, Minist Educ, Key Lab Bioinformat, Beijing, Peoples R China
来源
RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, RECOMB 2024 | 2024年 / 14758卷
关键词
neoantigen identification; data bias; machine learning; attention mechanism;
D O I
10.1007/978-1-0716-3989-4_28
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Neoantigens, formed by genetic mutations in tumor cells, are abnormal peptides that can trigger immune responses. Precisely identifying neoantigens from vast mutations is the key to tumor immunotherapy design. There are three main steps in the neoantigen immune process, i.e., binding with MHCs, extracellular presentation, and induction of immunogenicity. Various machine learning methods have been developed to predict the probability of one of the three events, but the overall accuracy of neoantigen identification remains far from satisfactory. To gain a systematic understanding of the key factors of neoantigen identification, we developed a unified transformer-based machine learning framework ImmuBPI that comprised three tasks and achieved state-of-the-art performance. Through cross-task model interpretation, we have discovered an underestimation of data bias for immunogenicity prediction, which has led to skewed discriminatory boundaries of current machine learning models. We designed a mutual information-based debiasing strategy that performed well on mutation variants immunogenicity prediction, a task where current methods fell short. Clustering immunogenic peptides with debiased representations uncovers unique preferences for biophysical properties, such as hydrophobicity and polarity. These observations serve as an important complement to the past understanding that accurately predicting neoantigen is constrained by limited data, highlighting the necessity of bias control. We expect this study will provide novel and insightful perspectives for neoantigen prediction methods and benefit future neoantigen-mediated immunotherapy designs.
引用
收藏
页码:348 / 351
页数:4
相关论文
共 50 条
  • [31] Sociodemographic bias in clinical machine learning models: a scoping review of algorithmic bias instances and mechanisms
    Colacci, Michael
    Huang, Yu Qing
    Postill, Gemma
    Zhelnov, Pavel
    Fennelly, Orna
    Verma, Amol
    Straus, Sharon
    Tricco, Andrea C.
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2025, 178
  • [32] Preface: Overcoming opacity in machine learning
    Zednik, Carlos
    Boelsen, Hannes
    AISB Convention 2021: Communication and Conversations, 2021,
  • [33] Unified virtual ADME/Tox using a hierarchy of machine learning models.
    Lanza, G
    Mydlowec, W
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2002, 224 : U341 - U341
  • [34] A story of the artificial ant: Discovering the correct bias for learning
    Kuschchu, I
    CEC: 2003 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-4, PROCEEDINGS, 2003, : 2777 - 2784
  • [35] Women Also Snowboard: Overcoming Bias in Captioning Models
    Hendricks, Lisa Anne
    Burns, Kaylee
    Saenko, Kate
    Darrell, Trevor
    Rohrbach, Anna
    COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 793 - 811
  • [36] Risk Identification Questionnaire for Detecting Unintended Bias in the Machine Learning Development Lifecycle
    Lee, Michelle Seng Ah
    Singh, Jatinder
    AIES '21: PROCEEDINGS OF THE 2021 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, 2021, : 704 - 714
  • [37] Machine Learning: Discovering the Future of Medical Imaging
    Erickson, Bradley J.
    JOURNAL OF DIGITAL IMAGING, 2017, 30 (04) : 391 - 391
  • [38] Machine Learning: Discovering the Future of Medical Imaging
    Bradley J. Erickson
    Journal of Digital Imaging, 2017, 30 : 391 - 391
  • [39] Assessment of impact of racial demographics on bias in machine learning models for IED Detection
    Hanson, E.
    Robertson, H.
    Sandridge, K.
    Bagherzadeh, N.
    Chan, A.
    Kleinschmidt, D.
    Arslan, A.
    Alday, P.
    Westover, M. B.
    Donoghue, J.
    Pathmanathan, J.
    EPILEPSIA, 2023, 64 : 478 - 479
  • [40] Predicting Prenatal Depression and Assessing Model Bias Using Machine Learning Models
    Huang, Yongchao
    Alvernaz, Suzanne
    Kim, Sage J.
    Maki, Pauline
    Dai, Yang
    Bernabe, Beatriz Penalver
    BIOLOGICAL PSYCHIATRY: GLOBAL OPEN SCIENCE, 2024, 4 (06):