Analysis of deep learning methods for blind protein contact prediction in CASP12

被引:61
|
作者
Wang, Sheng [1 ]
Sun, Siqi [1 ]
Xu, Jinbo [1 ]
机构
[1] Toyota Technol Inst, 6045 S Kenwood Ave, Chicago, IL 60637 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
protein folding; protein contact prediction; CASP; deep learning; coevolution analysis; RESIDUE-RESIDUE CONTACTS; SEQUENCE; EVOLUTIONARY;
D O I
10.1002/prot.25377
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Here we present the results of protein contact prediction achieved in CASP12 by our RaptorX-Contact server, which is an early implementation of our deep learning method for contact prediction. On a set of 38 free-modeling target domains with a median family size of around 58 effective sequences, our server obtained an average top L/5 long- and medium-range contact accuracy of 47% and 44%, respectively (L=length). A complete implementation has an average accuracy of 59% and 57%, respectively. Our deep learning method formulates contact prediction as a pixel-level image labeling problem and simultaneously predicts all residue pairs of a protein using a combination of two deep residual neural networks, taking as input the residue conservation information, predicted secondary structure and solvent accessibility, contact potential, and coevolution information. Our approach differs from existing methods mainly in (1) formulating contact prediction as a pixel-level image labeling problem instead of an image-level classification problem; (2) simultaneously predicting all contacts of an individual protein to make effective use of contact occurrence patterns; and (3) integrating both one-dimensional and two-dimensional deep convolutional neural networks to effectively learn complex sequence-structure relationship including high-order residue correlation. This paper discusses the RaptorX-Contact pipeline, both contact prediction and contact-based folding results, and finally the strength and weakness of our method.
引用
收藏
页码:67 / 77
页数:11
相关论文
共 50 条
  • [21] Deep learning methods in protein structure prediction
    Torrisi, Mirko
    Pollastri, Gianluca
    Le, Quan
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 : 1301 - 1310
  • [22] Deep learning methods for protein function prediction
    Boadu, Frimpong
    Lee, Ahhyun
    Cheng, Jianlin
    PROTEOMICS, 2025, 25 (1-2)
  • [23] Simultaneous refinement of inaccurate local regions and overall structure in the CASP12 protein model refinement experiment
    Lee, Gyu Rie
    Heo, Lim
    Seok, Chaok
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2018, 86 : 168 - 176
  • [24] Improving deep learning-based protein distance prediction in CASP14
    Guo, Zhiye
    Wu, Tianqi
    Liu, Jian
    Hou, Jie
    Cheng, Jianlin
    BIOINFORMATICS, 2021, 37 (19) : 3190 - 3196
  • [25] Protein structure model refinement in CASP12 using short and long molecular dynamics simulations in implicit solvent
    Terashi, Genki
    Kihara, Daisuke
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2018, 86 : 189 - 201
  • [26] Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15
    Roy, Raj S.
    Liu, Jian
    Giri, Nabin
    Guo, Zhiye
    Cheng, Jianlin
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2023, 91 (12) : 1889 - 1902
  • [27] Definition and classification of evaluation units for tertiary structure prediction in CASP12 facilitated through semi-automated metrics
    Abriata, Luciano A.
    Kinch, Lisa N.
    Tamo, Giorgio E.
    Monastyrskyy, Bohdan
    Kryshtafovych, Andriy
    Dal Peraro, Matteo
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2018, 86 : 16 - 26
  • [28] Template-based and free modeling of I-TASSER and QUARK pipelines using predicted contact maps in CASP12
    Zhang, Chengxin
    Mortuza, S. M.
    He, Baoji
    Wang, Yanting
    Zhang, Yang
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2018, 86 : 136 - 151
  • [29] Deep learning methods for protein torsion angle prediction
    Haiou Li
    Jie Hou
    Badri Adhikari
    Qiang Lyu
    Jianlin Cheng
    BMC Bioinformatics, 18
  • [30] Protein tertiary structure prediction and refinement using deep learning and Rosetta in CASP14
    Anishchenko, Ivan
    Baek, Minkyung
    Park, Hahnbeom
    Hiranuma, Naozumi
    Kim, David E.
    Dauparas, Justas
    Mansoor, Sanaa
    Humphreys, Ian R.
    Baker, David
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (12) : 1722 - 1733