Harnessing pre-trained models for accurate prediction of protein-ligand binding affinity

被引:0
|
作者
Li, Jiashan [1 ]
Gong, Xinqi [1 ]
机构
[1] Renmin Univ China, Inst Math Sci, Sch Math, 59 Zhongguancun St, Beijing 100872, Peoples R China
来源
BMC BIOINFORMATICS | 2025年 / 26卷 / 01期
关键词
Binding affinity; Binding site prediction; Molecular representation; Molecular pre-training; SCORING FUNCTIONS; DOCKING; GLIDE;
D O I
10.1186/s12859-025-06064-w
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundThe binding between proteins and ligands plays a crucial role in the field of drug discovery. However, this area currently faces numerous challenges. On one hand, existing methods are constrained by the limited availability of labeled data, often performing inadequately when addressing complex protein-ligand interactions. On the other hand, many models struggle to effectively capture the flexible variations and relative spatial relationships between proteins and ligands. These issues not only significantly hinder the advancement of protein-ligand binding research but also adversely affect the accuracy and efficiency of drug discovery. Therefore, in response to these challenges, our study aims to enhance predictive capabilities through innovative approaches, providing more reliable support for drug discovery efforts.MethodsThis study leverages a pre-trained model with spatial awareness to enhance the prediction of protein-ligand binding affinity. By perturbing the structures of small molecules in a manner consistent with physical constraints and employing self-supervised tasks, we improve the representation of small molecule structures, allowing for better adaptation to affinity predictions. Meanwhile, our approach enables the identification of potential binding sites on proteins.ResultsOur model demonstrates a significantly higher correlation coefficient in binding affinity predictions. Extensive evaluation on the PDBBind v2019 refined set, CASF, and Merck FEP benchmarks confirms the model's robustness and strong generalization across diverse datasets. Additionally, the model achieves over 95% in classification ROC for binding site identification, underscoring its high accuracy in pinpointing protein-ligand interaction regions.ConclusionThis research presents a novel approach that not only enhances the accuracy of binding affinity predictions but also facilitates the identification of binding sites, showcasing the potential of pre-trained models in computational drug design. Data and code are available at https://github.com/MIALAB-RUC/SableBind.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Protein-ligand binding affinity prediction model based on graph attention network
    Yuan, Hong
    Huang, Jing
    Li, Jin
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (06) : 9148 - 9162
  • [42] GraphscoreDTA: optimized graph neural network for protein-ligand binding affinity prediction
    Wang, Kaili
    Zhou, Renyi
    Tang, Jing
    Li, Min
    BIOINFORMATICS, 2023, 39 (06)
  • [43] TwoFold: Highly accurate structure and affinity prediction for protein-ligand complexes from sequences
    Hsu, Darren J.
    Lu, Hao
    Kashi, Aditya
    Matheson, Michael
    Gounley, John
    Wang, Feiyi
    Joubert, Wayne
    Glaser, Jens
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2023, 37 (06): : 666 - 682
  • [44] Development and evaluation of a deep learning model for protein-ligand binding affinity prediction
    Stepniewska-Dziubinska, Marta M.
    Zielenkiewicz, Piotr
    Siedlecki, Pawel
    BIOINFORMATICS, 2018, 34 (21) : 3666 - 3674
  • [45] DOX: A new computational protocol for accurate prediction of the protein-ligand binding structures
    Rao, Li
    Chi, Bo
    Ren, Yanliang
    Li, Yongjian
    Xu, Xin
    Wan, Jian
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2016, 37 (03) : 336 - 344
  • [46] Pre-trained language models for keyphrase prediction: A review
    Umair, Muhammad
    Sultana, Tangina
    Lee, Young-Koo
    ICT EXPRESS, 2024, 10 (04): : 871 - 890
  • [47] Forman persistent Ricci curvature (FPRC)-based machine learning models for protein-ligand binding affinity prediction
    Wee, JunJie
    Xia, Kelin
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [48] REPRESENTATION OF AFFINITY IN THE CASE OF COOPERATIVITY IN PROTEIN-LIGAND BINDING
    MONOT, C
    LAPICQUE, F
    BENAMGHAR, L
    MULLER, N
    PAYAN, E
    NETTER, P
    FUNDAMENTAL & CLINICAL PHARMACOLOGY, 1994, 8 (01) : 18 - 25
  • [49] Harnessing Generative Pre-Trained Transformers for Construction Accident Prediction with Saliency Visualization
    Yoo, Byunghee
    Kim, Jinwoo
    Park, Seongeun
    Ahn, Changbum R.
    Oh, Taekeun
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [50] ResBiGAAT: Residual Bi-GRU with attention for protein-ligand binding affinity prediction
    Abdelkader, Gelany Aly
    Njimbouom, Soualihou Ngnamsie
    Oh, Tae-Jin
    Kim, Jeong-Dong
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2023, 107