Co-evolution Transformer for Protein Contact Prediction

被引:0
|
作者
Zhang, He [1 ]
Ju, Fusong [2 ]
Zhu, Jianwei [2 ]
He, Liang [2 ]
Shao, Bin [2 ]
Zheng, Nanning [1 ]
Liu, Tie-Yan [2 ]
机构
[1] Xi An Jiao Tong Univ, Xian, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Proteins are the main machinery of life and protein functions are largely determined by their 3D structures. The measurement of the pairwise proximity between amino acids of a protein, known as inter-residue contact map, well characterizes the structural information of a protein. Protein contact prediction (PCP) is an essential building block of many protein structure related applications. The prevalent approach to contact prediction is based on estimating the inter-residue contacts using hand-crafted coevolutionary features derived from multiple sequence alignments (MSAs). To mitigate the information loss caused by hand-crafted features, some recently proposed methods try to learn residue co-evolutions directly from MSAs. These methods generally derive coevolutionary features by aggregating the learned residue representations from individual sequences with equal weights, which is inconsistent with the premise that residue co-evolutions are a reflection of collective covariation patterns of numerous homologous proteins. Moreover, non-homologous residues and gaps commonly exist in MSAs. By aggregating features from all homologs equally, the non-homologous information may cause misestimation of the residue co-evolutions. To overcome these issues, we propose an attention-based architecture, Co-evolution Transformer (CoT), for PCP. CoT jointly considers the information from all homologous sequences in the MSA to better capture global coevolutionary patterns. To mitigate the influence of the nonhomologous information, CoT selectively aggregates the features from different homologs by assigning smaller weights to non-homologous sequences or residue pairs. Extensive experiments on two rigorous benchmark datasets demonstrate the effectiveness of CoT. In particular, CoT achieves a 51:6% top-L long-range precision score for the Free Modeling (FM) domains on the CASP14 benchmark, which outperforms the winner group of CASP14 contact prediction challenge by 9:8%
引用
收藏
页数:12
相关论文
共 50 条
  • [1] FreeContact: fast and free software for protein contact prediction from residue co-evolution
    Kajan, Laszlo
    Hopf, Thomas A.
    Kalas, Matus
    Marks, Debora S.
    Rost, Burkhard
    BMC BIOINFORMATICS, 2014, 15
  • [2] FreeContact: fast and free software for protein contact prediction from residue co-evolution
    László Kaján
    Thomas A Hopf
    Matúš Kalaš
    Debora S Marks
    Burkhard Rost
    BMC Bioinformatics, 15
  • [3] Membrane protein contact and structure prediction using co-evolution in conjunction with machine learning
    Teixeira, Pedro L.
    Mendenhall, Jeff L.
    Heinze, Sten
    Weiner, Brian
    Skwark, Marcin J.
    Meiler, Jens
    PLOS ONE, 2017, 12 (05):
  • [4] Improving the quality of co-evolution intermolecular contact prediction with DisVis
    van Keulen, Siri C.
    Bonvin, Alexandre M. J. J.
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2023, 91 (10) : 1407 - 1416
  • [5] Inserting Co-evolution Information from Contact Maps into a Multiobjective Genetic Algorithm for Protein Structure Prediction
    Rocha, Gregorio K.
    dos Santos, Karina B.
    Angelo, Jaqueline S.
    Custodio, Fabio L.
    Barbosa, Helio J. C.
    Dardenne, Laurent E.
    2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 957 - 964
  • [6] Protein contact prediction from amino acid co-evolution using convolutional networks for graph-valued images
    Golkov, Vladimir
    Skwark, Marcin J.
    Golkov, Antonij
    Dosovitskiy, Alexey
    Brox, Thomas
    Meiler, Jens
    Cremers, Daniel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [7] Emerging methods in protein co-evolution
    de Juan, David
    Pazos, Florencio
    Valencia, Alfonso
    NATURE REVIEWS GENETICS, 2013, 14 (04) : 249 - 261
  • [8] Emerging methods in protein co-evolution
    David de Juan
    Florencio Pazos
    Alfonso Valencia
    Nature Reviews Genetics, 2013, 14 : 249 - 261
  • [9] Practical aspects of protein co-evolution
    Ochoa, David
    Pazos, Florencio
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2014, 2
  • [10] Improved protein structure prediction by deep learning irrespective of co-evolution information
    Xu, Jinbo
    McPartlon, Matthew
    Li, Jin
    NATURE MACHINE INTELLIGENCE, 2021, 3 (07) : 601 - +