BgN-Score and BsN-Score: Bagging and boosting based ensemble neural networks scoring functions for accurate binding affinity prediction of protein-ligand complexes

被引:49
|
作者
Ashtawy, Hossam M. [1 ]
Mahapatra, Nihar R. [1 ]
机构
[1] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48824 USA
来源
BMC BIOINFORMATICS | 2015年 / 16卷
基金
美国国家科学基金会;
关键词
MOLECULAR DOCKING; RECOGNITION; VALIDATION; DISCOVERY;
D O I
10.1186/1471-2105-16-S4-S8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Accurately predicting the binding affinities of large sets of protein-ligand complexes is a key challenge in computational biomolecular science, with applications in drug discovery, chemical biology, and structural biology. Since a scoring function (SF) is used to score, rank, and identify drug leads, the fidelity with which it predicts the affinity of a ligand candidate for a protein's binding site has a significant bearing on the accuracy of virtual screening. Despite intense efforts in developing conventional SFs, which are either force-field based, knowledge-based, or empirical, their limited predictive power has been a major roadblock toward cost-effective drug discovery. Therefore, in this work, we present novel SFs employing a large ensemble of neural networks (NN) in conjunction with a diverse set of physicochemical and geometrical features characterizing protein-ligand complexes to predict binding affinity. Results: We assess the scoring accuracies of two new ensemble NN SFs based on bagging (BgN-Score) and boosting (BsN-Score), as well as those of conventional SFs in the context of the 2007 PDBbind benchmark that encompasses a diverse set of high-quality protein families. We find that BgN-Score and BsN-Score have more than 25% better Pearson's correlation coefficient (0.804 and 0.816 vs. 0.644) between predicted and measured binding affinities compared to that achieved by a state-of-the-art conventional SF. In addition, these ensemble NN SFs are also at least 19% more accurate (0.804 and 0.816 vs. 0.675) than SFs based on a single neural network that has been traditionally used in drug discovery applications. We further find that ensemble models based on NNs surpass SFs based on the decision-tree ensemble technique Random Forests. Conclusions: Ensemble neural networks SFs, BgN-Score and BsN-Score, are the most accurate in predicting binding affinity of protein-ligand complexes among the considered SFs. Moreover, their accuracies are even higher when they are used to predict binding affinities of protein-ligand complexes that are related to their training sets.
引用
收藏
页数:12
相关论文
共 40 条
  • [1] BgN-Score and BsN-Score: Bagging and boosting based ensemble neural networks scoring functions for accurate binding affinity prediction of protein-ligand complexes
    Hossam M Ashtawy
    Nihar R Mahapatra
    BMC Bioinformatics, 16
  • [2] Ensemble Neural Networks Scoring Functions for Accurate Binding Affinity Prediction of Protein-Ligand Complexes
    Ashtawy, Hossam M.
    Mahapatra, Nihar R.
    PATTERN RECOGNITION IN BIOINFORMATICS, PRIB 2014, 2014, 8626 : 129 - 130
  • [3] AK-Score: Accurate Protein-Ligand Binding Affinity Prediction Using an Ensemble of 3D-Convolutional Neural Networks
    Kwon, Yongbeom
    Shin, Woong-Hee
    Ko, Junsu
    Lee, Juyong
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2020, 21 (22) : 1 - 16
  • [4] SFCscore: Scoring functions for affinity prediction of protein-ligand complexes
    Sotriffer, Christoph A.
    Sanschagrin, Paul
    Matter, Hans
    Klebe, Gerhard
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 73 (02) : 395 - 419
  • [5] Empirical Scoring Functions for Affinity Prediction of Protein-ligand Complexes
    Pason, Lukas P.
    Sotriffer, Christoph A.
    MOLECULAR INFORMATICS, 2016, 35 (11-12) : 541 - 548
  • [6] EISA-Score: Element Interactive Surface Area Score for Protein-Ligand Binding Affinity Prediction
    Rana, Md Masud
    Nguyen, Duc Duy
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (18) : 4329 - 4341
  • [7] Neural networks prediction of the protein-ligand binding affinity with circular fingerprints
    Yin, Zuode
    Song, Wei
    Li, Baiyi
    Wang, Fengfei
    Xie, Liangxu
    Xu, Xiaojun
    TECHNOLOGY AND HEALTH CARE, 2023, 31 : S487 - S495
  • [8] Accurate prediction of dynamic protein-ligand binding using P-score ranking
    Ibrahim, Peter E. G. F.
    Zuccotto, Fabio
    Zachariae, Ulrich
    Gilbert, Ian
    Bodkin, Mike
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2024, 45 (20) : 1762 - 1778
  • [9] Comparative evaluation of five scoring functions for accurate prediction of protein-ligand binding energy.
    Puvanendrampillai, D
    Marsden, PM
    Mitchell, JBO
    Glen, RC
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2004, 227 : U1018 - U1018
  • [10] Scoring Functions for Protein-Ligand Binding Affinity Prediction Using Structure-based Deep Learning: A Review
    Meli, Rocco
    Morris, Garrett M.
    Biggin, Philip C.
    FRONTIERS IN BIOINFORMATICS, 2022, 2