Interaction-Based Inductive Bias in Graph Neural Networks: Enhancing Protein-Ligand Binding Affinity Predictions From 3D Structures

被引:5
|
作者
Yang, Ziduo [1 ]
Zhong, Weihe [2 ]
Lv, Qiujie [1 ]
Dong, Tiejun [1 ]
Chen, Guanxing [1 ]
Chen, Calvin Yu-Chian [3 ,4 ,5 ,6 ]
机构
[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Artificial Intelligence Med Res Ctr, Shenzhen Campus, Shenzhen 518107, Peoples R China
[2] Chinese Acad Sci, Shanghai Inst Mat Med, Zhongshan Inst Drug Discovery, Zhongshan 528400, Peoples R China
[3] Peking Univ, Shenzhen Grad Sch, AI Sci AI4S Preferred Program, Shenzhen 518055, Peoples R China
[4] Peking Univ, Shenzhen Grad Sch, Sch Elect & Comp Engn, Shenzhen 518055, Peoples R China
[5] China Med Univ Hosp, Dept Med Res, Taichung 404, Taiwan
[6] Asia Univ, Dept Bioinformat & Med Engn, Taichung 413, Taiwan
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Proteins; Programmable logic arrays; Three-dimensional displays; Predictive models; Graph neural networks; Data models; Convolution; Protein-ligand binding affinity; graph neural networks; inductive bias; drug-target interaction; structure-based virtual screening; SCORING FUNCTIONS; ACCURATE DOCKING; DATABASE; GLIDE;
D O I
10.1109/TPAMI.2024.3400515
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inductive bias in machine learning (ML) is the set of assumptions describing how a model makes predictions. Different ML-based methods for protein-ligand binding affinity (PLA) prediction have different inductive biases, leading to different levels of generalization capability and interpretability. Intuitively, the inductive bias of an ML-based model for PLA prediction should fit in with biological mechanisms relevant for binding to achieve good predictions with meaningful reasons. To this end, we propose an interaction-based inductive bias to restrict neural networks to functions relevant for binding with two assumptions: 1) A protein-ligand complex can be naturally expressed as a heterogeneous graph with covalent and non-covalent interactions; 2) The predicted PLA is the sum of pairwise atom-atom affinities determined by non-covalent interactions. The interaction-based inductive bias is embodied by an explainable heterogeneous interaction graph neural network (EHIGN) for explicitly modeling pairwise atom-atom interactions to predict PLA from 3D structures. Extensive experiments demonstrate that EHIGN achieves better generalization capability than other state-of-the-art ML-based baselines in PLA prediction and structure-based virtual screening. More importantly, comprehensive analyses of distance-affinity, pose-affinity, and substructure-affinity relations suggest that the interaction-based inductive bias can guide the model to learn atomic interactions that are consistent with physical reality. As a case study to demonstrate practical usefulness, our method is tested for predicting the efficacy of Nirmatrelvir against SARS-CoV-2 variants. EHIGN successfully recognizes the changes in the efficacy of Nirmatrelvir for different SARS-CoV-2 variants with meaningful reasons.
引用
收藏
页码:8191 / 8208
页数:18
相关论文
共 28 条
  • [21] Performance of HADDOCK and a simple contact-based protein-ligand binding affinity predictor in the D3R Grand Challenge 2
    Kurkcuoglu, Zeynep
    Koukos, Panagiotis I.
    Citro, Nevia
    Trellet, Mikael E.
    Rodrigues, J. P. G. L. M.
    Moreira, Irina S.
    Roel-Touris, Jorge
    Melquiond, Adrien S. J.
    Geng, Cunliang
    Schaarschmidt, Jorg
    Xue, Li C.
    Vangone, Anna
    Bonvin, A. M. J. J.
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2018, 32 (01) : 175 - 185
  • [22] BgN-Score and BsN-Score: Bagging and boosting based ensemble neural networks scoring functions for accurate binding affinity prediction of protein-ligand complexes
    Hossam M Ashtawy
    Nihar R Mahapatra
    BMC Bioinformatics, 16
  • [23] BgN-Score and BsN-Score: Bagging and boosting based ensemble neural networks scoring functions for accurate binding affinity prediction of protein-ligand complexes
    Ashtawy, Hossam M.
    Mahapatra, Nihar R.
    BMC BIOINFORMATICS, 2015, 16
  • [24] Sfcnn: a novel scoring function based on 3D convolutional neural network for accurate and stable protein–ligand affinity prediction
    Yu Wang
    Zhengxiao Wei
    Lei Xi
    BMC Bioinformatics, 23
  • [25] Improving binding mode and binding affinity predictions of docking by ligand-based search of protein conformations: evaluation in D3R grand challenge 2015
    Xu, Xianjin
    Yan, Chengfei
    Zou, Xiaoqin
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2017, 31 (08) : 689 - 699
  • [26] Improving binding mode and binding affinity predictions of docking by ligand-based search of protein conformations: evaluation in D3R grand challenge 2015
    Xianjin Xu
    Chengfei Yan
    Xiaoqin Zou
    Journal of Computer-Aided Molecular Design, 2017, 31 : 689 - 699
  • [27] Using 3D point cloud and graph-based neural networks to improve the estimation of pulmonary function tests from chest CT
    Jia, Jingnan
    Yu, Bo
    Mody, Prerak
    Ninaber, Maarten K.
    Schouffoer, Anne A.
    de Vries-Bouwstra, Jeska K.
    Kroft, Lucia J.M.
    Staring, Marius
    Stoel, Berend C.
    Computers in Biology and Medicine, 2024, 182
  • [28] Representing Protein 3D Structures in Spherical Coordinates - Two Applications: 1. Detection of Invaginations, Protrusions, and Potential Ligand Binding Sites; and 2. Separation of Protein Hydrophilic Outer Layer from the Hydrophobic Core
    Reyes, Vicente M.
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2009, 26 (06): : 874 - 875