FINDSITE-metal: Integrating evolutionary information and machine learning for structure-based metal-binding site prediction at the proteome level

被引:79
|
作者
Brylinski, Michal [1 ]
Skolnick, Jeffrey [1 ]
机构
[1] Georgia Inst Technol, Ctr Study Syst Biol, Atlanta, GA 30318 USA
基金
美国国家卫生研究院;
关键词
metalloproteins; metal binding residue prediction; protein threading; protein structure prediction; human proteome; machine learning; SCORING FUNCTION; 3; DOMAINS; SEQUENCE; PROTEINS; IDENTIFICATION; ALIGNMENT; GEOMETRY; RESIDUES; IONS; RECOGNITION;
D O I
10.1002/prot.22913
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The rapid accumulation of gene sequences, many of which are hypothetical proteins with unknown function, has stimulated the development of accurate computational tools for protein function prediction with evolution/structure-based approaches showing considerable promise. In this article, we present FINDSITE-metal, a new threading-based method designed specifically to detect metal-binding sites in modeled protein structures. Comprehensive benchmarks using different quality protein structures show that weakly homologous protein models provide sufficient structural information for quite accurate annotation by FINDSITE-metal. Combining structure/evolutionary information with machine learning results in highly accurate metal-binding annotations; for protein models constructed by TASSER, whose average C alpha RMSD from the native structure is 8.9 angstrom, 59.5% (71.9%) of the best of top five predicted metal locations are within 4 angstrom (8 angstrom) from a bound metal in the crystal structure. For most of the targets, multiple metal-binding sites are detected with the best predicted binding site at rank 1 and within the top two ranks in 65.6% and 83.1% of the cases, respectively. Furthermore, for iron, copper, zinc, calcium, and magnesium ions, the binding metal can be predicted with high, typically 70% to 90%, accuracy. FINDSITE-metal also provides a set of confidence indexes that help assess the reliability of predictions. Finally, we describe the proteome-wide application of FINDSITE-metal that quantifies the metal-binding complement of the human proteome. FINDSITE-metal is freely available to the academic community at http://cssb.biology.gatech.edu/findsite-metal/.
引用
收藏
页码:735 / 751
页数:17
相关论文
共 15 条
  • [1] Co-evolution-based prediction of metal-binding sites in proteomes by machine learning
    Cheng, Yao
    Wang, Haobo
    Xu, Hua
    Liu, Yuan
    Ma, Bin
    Chen, Xuemin
    Zeng, Xin
    Wang, Xianghe
    Wang, Bo
    Shiau, Carina
    Ovchinnikov, Sergey
    Su, Xiao-Dong
    Wang, Chu
    NATURE CHEMICAL BIOLOGY, 2023, 19 (05) : 548 - +
  • [2] Co-evolution-based prediction of metal-binding sites in proteomes by machine learning
    Yao Cheng
    Haobo Wang
    Hua Xu
    Yuan Liu
    Bin Ma
    Xuemin Chen
    Xin Zeng
    Xianghe Wang
    Bo Wang
    Carina Shiau
    Sergey Ovchinnikov
    Xiao-Dong Su
    Chu Wang
    Nature Chemical Biology, 2023, 19 : 548 - 555
  • [3] mFASD: a structure-based algorithm for discriminating different types of metal-binding sites
    He, Wei
    Liang, Zhi
    Teng, Maikun
    Niu, Liwen
    BIOINFORMATICS, 2015, 31 (12) : 1938 - 1944
  • [4] A Comprehensive Review of Computation-Based Metal-Binding Prediction Approaches at the Residue Level
    Ye, Nan
    Zhou, Feng
    Liang, Xingchen
    Chai, Haiting
    Fan, Jianwei
    Li, Bo
    Zhang, Jian
    BIOMED RESEARCH INTERNATIONAL, 2022, 2022
  • [5] ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction
    Jérôme Tubiana
    Dina Schneidman-Duhovny
    Haim J. Wolfson
    Nature Methods, 2022, 19 (6) : 730 - 739
  • [6] ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction
    Tubiana, Jerome
    Schneidman-Duhovny, Dina
    Wolfson, Haim J.
    NATURE METHODS, 2022, 19 (06) : 730 - +
  • [7] Prediction of peptide binding to MHC using machine learning with sequence and structure-based feature sets
    Aranha, Michelle P.
    Spooner, Catherine
    Demerdash, Omar
    Czejdo, Bogdan
    Smith, Jeremy C.
    Mitchell, Julie C.
    BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2020, 1864 (04):
  • [8] MetalPrognosis: A Biological Language Model-Based Approach for Disease-Associated Mutations in Metal-Binding Site Prediction
    Jia, Runchang
    He, Zhijie
    Wang, Cong
    Guo, Xudong
    Li, Fuyi
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (06) : 2340 - 2348
  • [9] Machine-learning scoring functions to improve structure-based binding affinity prediction and virtual screening
    Ain, Qurrat Ul
    Aleksandrova, Antoniya
    Roessler, Florian D.
    Ballester, Pedro J.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2015, 5 (06) : 405 - 424
  • [10] DNABind: A hybrid algorithm for structure-based prediction of DNA-binding residues by combining machine learning- and template-based approaches
    Liu, Rong
    Hu, Jianjun
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2013, 81 (11) : 1885 - 1899