Accurate prediction of protein-ligand interactions by combining physical energy functions and graph-neural networks

被引:0
|
作者
Hong, Yiyu [1 ]
Ha, Junsu [1 ]
Sim, Jaemin [2 ]
Lim, Chae Jo [3 ]
Oh, Kwang-Seok [3 ]
Chandrasekaran, Ramakrishnan [1 ]
Kim, Bomin [6 ]
Choi, Jieun [6 ]
Ko, Junsu [1 ]
Shin, Woong-Hee [1 ,4 ]
Lee, Juyong [1 ,2 ,5 ,6 ]
机构
[1] Arontier Co, 241 Gangnam Daero, Seoul 06735, South Korea
[2] Seoul Natl Univ, Grad Sch Convergence Sci & Technol, Dept Mol Med & Biopharmaceut Sci, Seoul 08826, South Korea
[3] Korea Res Inst Chem Technol, Data Convergence Drug Res Ctr, Daejeon 34114, South Korea
[4] Korea Univ, Dept Med, Coll Med, Seoul 02841, South Korea
[5] Seoul Natl Univ, Res Inst Pharmaceut Sci, Coll Pharm, Seoul 08826, South Korea
[6] Seoul Natl Univ, Coll Pharm, Seoul 08826, South Korea
来源
JOURNAL OF CHEMINFORMATICS | 2024年 / 16卷 / 01期
基金
新加坡国家研究基金会;
关键词
Virtual screening; Protein-ligand binding prediction; Hit discovery; Protein-ligand docking; Graph neural network; Protein-ligand binding pose prediction; Deep-learning; Physics-based scoring function; EMPIRICAL SCORING FUNCTIONS; DRUG DISCOVERY; DOCKING; MODEL;
D O I
10.1186/s13321-024-00912-2
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We introduce an advanced model for predicting protein-ligand interactions. Our approach combines the strengths of graph neural networks with physics-based scoring methods. Existing structure-based machine-learning models for protein-ligand binding prediction often fall short in practical virtual screening scenarios, hindered by the intricacies of binding poses, the chemical diversity of drug-like molecules, and the scarcity of crystallographic data for protein-ligand complexes. To overcome the limitations of existing machine learning-based prediction models, we propose a novel approach that fuses three independent neural network models. One classification model is designed to perform binary prediction of a given protein-ligand complex pose. The other two regression models are trained to predict the binding affinity and root-mean-square deviation of a ligand conformation from an input complex structure. We trained the model to account for both deviations in experimental and predicted binding affinities and pose prediction uncertainties. By effectively integrating the outputs of the triplet neural networks with a physics-based scoring function, our model showed a significantly improved performance in hit identification. The benchmark results with three independent decoy sets demonstrate that our model outperformed existing models in forward screening. Our model achieved top 1% enrichment factors of 32.7 and 23.1 with the CASF2016 and DUD-E benchmark sets, respectively. The benchmark results using the LIT-PCBA set further confirmed its higher average enrichment factors, emphasizing the model's efficiency and generalizability. The model's efficiency was further validated by identifying 23 active compounds from 63 candidates in experimental screening for autotaxin inhibitors, demonstrating its practical applicability in hit discovery.Scientific contributionOur work introduces a novel training strategy for a protein-ligand binding affinity prediction model by integrating the outputs of three independent sub-models and utilizing expertly crafted decoy sets. The model showcases exceptional performance across multiple benchmarks. The high enrichment factors in the LIT-PCBA benchmark demonstrate its potential to accelerate hit discovery.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Ensemble Neural Networks Scoring Functions for Accurate Binding Affinity Prediction of Protein-Ligand Complexes
    Ashtawy, Hossam M.
    Mahapatra, Nihar R.
    PATTERN RECOGNITION IN BIOINFORMATICS, PRIB 2014, 2014, 8626 : 129 - 130
  • [2] Decoding the protein-ligand interactions using parallel graph neural networks
    Knutson, Carter
    Bontha, Mridula
    Bilbrey, Jenna A.
    Kumar, Neeraj
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [3] Scoring Functions for Prediction of Protein-Ligand Interactions
    Wang, Jui-Chih
    Lin, Jung-Hsin
    CURRENT PHARMACEUTICAL DESIGN, 2013, 19 (12) : 2174 - 2182
  • [4] Accurate prediction of hydrogen bond networks in protein-ligand complexes
    Detering, Carsten
    Lippert, Tobias
    Rarey, Matthias
    Claussen, Holger
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2012, 244
  • [5] Quantum Graph Neural Networks Based Protein-Ligand Classification
    Ganguly, Srinjoy
    Chandilkar, Vaishnavi
    Jain, Prateek
    Bertel, Luis Gerardo Ayala
    ARTIFICIAL INTELLIGENCE AND KNOWLEDGE PROCESSING, AIKP 2023, 2024, 2127 : 146 - 159
  • [6] Comparative evaluation of five scoring functions for accurate prediction of protein-ligand binding energy.
    Puvanendrampillai, D
    Marsden, PM
    Mitchell, JBO
    Glen, RC
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2004, 227 : U1018 - U1018
  • [7] Structure-aware Interactive Graph Neural Networks for the Prediction of Protein-Ligand Binding Affinity
    Li, Shuangli
    Zhou, Jingbo
    Xu, Tong
    Huang, Liang
    Wang, Fan
    Xiong, Haoyi
    Huang, Weili
    Dou, Dejing
    Xiong, Hui
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 975 - 985
  • [8] Learning characteristics of graph neural networks predicting protein-ligand affinities
    Mastropietro, Andrea
    Pasculli, Giuseppe
    Bajorath, Juergen
    NATURE MACHINE INTELLIGENCE, 2023, 5 (12) : 1427 - 1436
  • [9] Enhancing protein-ligand binding affinity prediction through sequential fusion of graph and convolutional neural networks
    Yang, Yimin
    Zhang, Ruiqin
    Lin, Zijing
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2024, 45 (32) : 2929 - 2940
  • [10] DeepBindGCN: Integrating Molecular Vector Representation with Graph Convolutional Neural Networks for Protein-Ligand Interaction Prediction
    Zhang, Haiping
    Saravanan, Konda Mani
    Zhang, John Z. H.
    MOLECULES, 2023, 28 (12):