Improving the generalizability of protein-ligand binding predictions with AI-Bind

被引:45
|
作者
Chatterjee, Ayan [1 ]
Walters, Robin [2 ]
Shafi, Zohair [2 ]
Ahmed, Omair Shafi [2 ]
Sebek, Michael [1 ,3 ]
Gysi, Deisy [1 ,3 ,4 ]
Yu, Rose [5 ]
Eliassi-Rad, Tina [1 ,2 ,6 ,7 ]
Barabasi, Albert-Laszlo [1 ,3 ,8 ]
Menichetti, Giulia [1 ,3 ,9 ]
机构
[1] Northeastern Univ, Network Sci Inst, Boston, MA 02115 USA
[2] Northeastern Univ, Khoury Coll Comp Sci, Boston, MA USA
[3] Northeastern Univ, Dept Phys, Boston, MA 02115 USA
[4] Harvard Med Sch, Brigham & Womens Hosp, Dept Med, Boston, MA USA
[5] Univ Calif San Diego, Dept Comp Sci & Engn, San Diego, CA USA
[6] Santa Fe Inst, Santa Fe, NM USA
[7] Northeastern Univ, Inst Experiential AI, Boston, MA USA
[8] Cent European Univ, Dept Network & Data Sci, Budapest, Hungary
[9] Harvard Med Sch, Brigham & Womens Hosp, Dept Med, Channing Div Network Med, Boston, MA 02115 USA
基金
美国国家卫生研究院;
关键词
NF-KAPPA-B; CHEMISTRY; SEQUENCE; DOCKING; NETWORK;
D O I
10.1038/s41467-023-37572-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying novel drug-target interactions is a critical and rate-limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, here we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortcuts that leverage the topology of the protein-ligand bipartite network, rather than learning the node features. Here we introduce AI-Bind, a pipeline that combines network-based sampling strategies with unsupervised pre-training to improve binding predictions for novel proteins and ligands. We validate AI-Bind predictions via docking simulations and comparison with recent experimental evidence, and step up the process of interpreting machine learning prediction of protein-ligand binding by identifying potential active binding sites on the amino acid sequence. AI-Bind is a high-throughput approach to identify drug-target combinations with the potential of becoming a powerful tool in drug discovery. State-of-the-art machine learning models in drug discovery fail to reliably predict the binding properties of poorly annotated proteins and small molecules. Here, the authors present AI-Bind, a machine learning pipeline to improve generalizability and interpretability of binding predictions.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Improving the generalizability of protein-ligand binding predictions with AI-Bind
    Ayan Chatterjee
    Robin Walters
    Zohair Shafi
    Omair Shafi Ahmed
    Michael Sebek
    Deisy Gysi
    Rose Yu
    Tina Eliassi-Rad
    Albert-László Barabási
    Giulia Menichetti
    Nature Communications, 14
  • [2] Quantitative predictions of protein-ligand binding affinities
    Mobley, David L.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2009, 237
  • [3] Recent advances in AI-driven protein-ligand interaction predictions
    Sim, Jaemin
    Kim, Dongwoo
    Kim, Bomin
    Choi, Jieun
    Lee, Juyong
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2025, 92
  • [4] Enhancing Generalizability in Protein-Ligand Binding Affinity Prediction with Multimodal Contrastive Learning
    Luo, Ding
    Liu, Dandan
    Qu, Xiaoyang
    Dong, Lina
    Wang, Binju
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (06) : 1892 - 1906
  • [5] Enhancing Protein-Ligand Binding Affinity Predictions Using Neural Network Potentials
    Zariquiey, Francesc Sabanes
    Galvelis, Raimondas
    Gallicchio, Emilio
    Chodera, John D.
    Markland, Thomas E.
    De Fabritiis, Gianni
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (05) : 1481 - 1485
  • [6] FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions
    Roche, Daniel B.
    Buenavista, Maria T.
    McGuffin, Liam J.
    PLOS ONE, 2012, 7 (05):
  • [7] Exploring, assessing and improving energy landscapes for protein-ligand binding.
    Brooks, CL
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2004, 227 : U899 - U899
  • [8] Improving the scoring of protein-ligand binding affinity by including the desolvation energy
    Li, Yan
    Wang, Renxiao
    Holloway, M.
    Gao, Ying-Duo
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 251
  • [9] Improving protein-ligand interactions.
    Wandless, TJ
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2000, 219 : U137 - U137
  • [10] Protein-ligand binding affinity predictions by implicit solvent simulations: A tool for lead optimization?
    Michel, Julien
    Verdonk, Marcel L.
    Essex, Jonathan W.
    JOURNAL OF MEDICINAL CHEMISTRY, 2006, 49 (25) : 7427 - 7439