Discovering and Overcoming the Bias in Neoantigen Identification by Unified Machine Learning Models

被引：0

作者：

Zhang, Ziting

Wu, Wenxu

Wei, Lei

Wang, Xiaowo ^{[1
]}

机构：

[1] Tsinghua Univ, Minist Educ, Key Lab Bioinformat, Beijing, Peoples R China

来源：

RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, RECOMB 2024 | 2024年 / 14758卷

关键词：

neoantigen identification; data bias; machine learning; attention mechanism;

D O I：

10.1007/978-1-0716-3989-4_28

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Neoantigens, formed by genetic mutations in tumor cells, are abnormal peptides that can trigger immune responses. Precisely identifying neoantigens from vast mutations is the key to tumor immunotherapy design. There are three main steps in the neoantigen immune process, i.e., binding with MHCs, extracellular presentation, and induction of immunogenicity. Various machine learning methods have been developed to predict the probability of one of the three events, but the overall accuracy of neoantigen identification remains far from satisfactory. To gain a systematic understanding of the key factors of neoantigen identification, we developed a unified transformer-based machine learning framework ImmuBPI that comprised three tasks and achieved state-of-the-art performance. Through cross-task model interpretation, we have discovered an underestimation of data bias for immunogenicity prediction, which has led to skewed discriminatory boundaries of current machine learning models. We designed a mutual information-based debiasing strategy that performed well on mutation variants immunogenicity prediction, a task where current methods fell short. Clustering immunogenic peptides with debiased representations uncovers unique preferences for biophysical properties, such as hydrophobicity and polarity. These observations serve as an important complement to the past understanding that accurately predicting neoantigen is constrained by limited data, highlighting the necessity of bias control. We expect this study will provide novel and insightful perspectives for neoantigen prediction methods and benefit future neoantigen-mediated immunotherapy designs.

引用

页码：348 / 351

页数：4

共 50 条

[31] Sociodemographic bias in clinical machine learning models: a scoping review of algorithmic bias instances and mechanisms
Colacci, Michael
Huang, Yu Qing
Postill, Gemma
Zhelnov, Pavel
Fennelly, Orna
Verma, Amol
Straus, Sharon
Tricco, Andrea C.
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2025, 178
[32] Preface: Overcoming opacity in machine learning
Zednik, Carlos
Boelsen, Hannes
AISB Convention 2021: Communication and Conversations, 2021,
[33] Unified virtual ADME/Tox using a hierarchy of machine learning models.
Lanza, G
Mydlowec, W
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2002, 224 : U341 - U341
[34] A story of the artificial ant: Discovering the correct bias for learning
Kuschchu, I
CEC: 2003 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-4, PROCEEDINGS, 2003, : 2777 - 2784
[35] Women Also Snowboard: Overcoming Bias in Captioning Models
Hendricks, Lisa Anne
Burns, Kaylee
Saenko, Kate
Darrell, Trevor
Rohrbach, Anna
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 793 - 811
[36] Risk Identification Questionnaire for Detecting Unintended Bias in the Machine Learning Development Lifecycle
Lee, Michelle Seng Ah
Singh, Jatinder
AIES '21: PROCEEDINGS OF THE 2021 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, 2021, : 704 - 714
[37] Machine Learning: Discovering the Future of Medical Imaging
Erickson, Bradley J.
JOURNAL OF DIGITAL IMAGING, 2017, 30 (04) : 391 - 391
[38] Machine Learning: Discovering the Future of Medical Imaging
Bradley J. Erickson
Journal of Digital Imaging, 2017, 30 : 391 - 391
[39] Assessment of impact of racial demographics on bias in machine learning models for IED Detection
Hanson, E.
Robertson, H.
Sandridge, K.
Bagherzadeh, N.
Chan, A.
Kleinschmidt, D.
Arslan, A.
Alday, P.
Westover, M. B.
Donoghue, J.
Pathmanathan, J.
EPILEPSIA, 2023, 64 : 478 - 479
[40] Predicting Prenatal Depression and Assessing Model Bias Using Machine Learning Models
Huang, Yongchao
Alvernaz, Suzanne
Kim, Sage J.
Maki, Pauline
Dai, Yang
Bernabe, Beatriz Penalver
BIOLOGICAL PSYCHIATRY: GLOBAL OPEN SCIENCE, 2024, 4 (06):

← 1 2 3 4 5 →