Improving the generalizability of protein-ligand binding predictions with AI-Bind

被引:45
|
作者
Chatterjee, Ayan [1 ]
Walters, Robin [2 ]
Shafi, Zohair [2 ]
Ahmed, Omair Shafi [2 ]
Sebek, Michael [1 ,3 ]
Gysi, Deisy [1 ,3 ,4 ]
Yu, Rose [5 ]
Eliassi-Rad, Tina [1 ,2 ,6 ,7 ]
Barabasi, Albert-Laszlo [1 ,3 ,8 ]
Menichetti, Giulia [1 ,3 ,9 ]
机构
[1] Northeastern Univ, Network Sci Inst, Boston, MA 02115 USA
[2] Northeastern Univ, Khoury Coll Comp Sci, Boston, MA USA
[3] Northeastern Univ, Dept Phys, Boston, MA 02115 USA
[4] Harvard Med Sch, Brigham & Womens Hosp, Dept Med, Boston, MA USA
[5] Univ Calif San Diego, Dept Comp Sci & Engn, San Diego, CA USA
[6] Santa Fe Inst, Santa Fe, NM USA
[7] Northeastern Univ, Inst Experiential AI, Boston, MA USA
[8] Cent European Univ, Dept Network & Data Sci, Budapest, Hungary
[9] Harvard Med Sch, Brigham & Womens Hosp, Dept Med, Channing Div Network Med, Boston, MA 02115 USA
基金
美国国家卫生研究院;
关键词
NF-KAPPA-B; CHEMISTRY; SEQUENCE; DOCKING; NETWORK;
D O I
10.1038/s41467-023-37572-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying novel drug-target interactions is a critical and rate-limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, here we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortcuts that leverage the topology of the protein-ligand bipartite network, rather than learning the node features. Here we introduce AI-Bind, a pipeline that combines network-based sampling strategies with unsupervised pre-training to improve binding predictions for novel proteins and ligands. We validate AI-Bind predictions via docking simulations and comparison with recent experimental evidence, and step up the process of interpreting machine learning prediction of protein-ligand binding by identifying potential active binding sites on the amino acid sequence. AI-Bind is a high-throughput approach to identify drug-target combinations with the potential of becoming a powerful tool in drug discovery. State-of-the-art machine learning models in drug discovery fail to reliably predict the binding properties of poorly annotated proteins and small molecules. Here, the authors present AI-Bind, a machine learning pipeline to improve generalizability and interpretability of binding predictions.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Protonation and pK changes in protein-ligand binding
    Onufriev, Alexey V.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2013, 246
  • [32] Rate Constants and Mechanisms of Protein-Ligand Binding
    Pang, Xiaodong
    Zhou, Huan-Xiang
    ANNUAL REVIEW OF BIOPHYSICS, VOL 46, 2017, 46 : 105 - 130
  • [33] Influence of Dimehylsulfoxide on Protein-Ligand Binding Affinities
    Cubrilovic, Dragana
    Zenobi, Renato
    ANALYTICAL CHEMISTRY, 2013, 85 (05) : 2724 - 2730
  • [34] Origin of high affinity protein-ligand binding
    Sharp, Kim A.
    Harpole, Kyle
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2012, 244
  • [35] A protein-ligand binding assay with proteomic potential
    Fitzgerald, Michael C.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2009, 238 : 727 - 727
  • [36] Exploring protein-ligand recognition with binding MOAD
    Smith, RD
    Hu, LG
    Falkner, JA
    Benson, ML
    Nerothin, JP
    Carlson, HA
    JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2006, 24 (06): : 414 - 425
  • [37] Protonation and pK changes in protein-ligand binding
    Onufriev, Alexey V.
    Alexov, Emil
    QUARTERLY REVIEWS OF BIOPHYSICS, 2013, 46 (02) : 181 - 209
  • [38] Rigidity Strengthening: A Mechanism for Protein-Ligand Binding
    Nguyen, Duc D.
    Xiao, Tian
    Wang, Menglun
    Wei, Guo-Wei
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (07) : 1715 - 1721
  • [39] Predicting protein-ligand binding affinity with gnina
    Francoeur, Paul
    Koes, David
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258
  • [40] EvoDock: Optimization of protein-ligand binding interfaces
    Edich, M.
    Friedrichs, M.
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2021, 77 : C695 - C695