Tertiary alphabet for the observable protein structural universe

被引:42
|
作者
Mackenzie, Craig O. [1 ]
Zhou, Jianfu [2 ]
Grigoryan, Gevorg [1 ,2 ,3 ]
机构
[1] Dartmouth Coll, Inst Quantitat Biomed Sci, Hanover, NH 03755 USA
[2] Dartmouth Coll, Dept Comp Sci, Hanover, NH 03755 USA
[3] Dartmouth Coll, Dept Biol Sci, Hanover, NH 03755 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
tertiary motif; structural degeneracy; protein structural universe; sequence-structure relationships; structural modularity; COMPUTATIONAL DESIGN; STRUCTURE LIBRARY; ROTAMER LIBRARY; LOCAL-STRUCTURE; COILED-COIL; AMINO-ACIDS; BETA-SHEETS; SEQUENCE; MOTIFS; MODEL;
D O I
10.1073/pnas.1607178113
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Here, we systematically decompose the known protein structural universe into its basic elements, which we dub tertiary structural motifs (TERMs). A TERM is a compact backbone fragment that captures the secondary, tertiary, and quaternary environments around a given residue, comprising one or more disjoint segments (three on average). We seek the set of universal TERMs that capture all structure in the Protein Data Bank (PDB), finding remarkable degeneracy. Only similar to 600 TERMs are sufficient to describe 50% of the PDB at sub-Angstrom resolution. However, more rare geometries also exist, and the overall structural coverage grows logarithmically with the number of TERMs. We go on to show that universal TERMs provide an effective mapping between sequence and structure. We demonstrate that TERM-based statistics alone are sufficient to recapitulate close-to-native sequences given either NMR or X-ray backbones. Furthermore, sequence variability predicted from TERM data agrees closely with evolutionary variation. Finally, locations of TERMs in protein chains can be predicted from sequence alone based on sequence signatures emergent from TERM instances in the PDB. For multisegment motifs, this method identifies spatially adjacent fragments that are not contiguous in sequence-a major bottleneck in structure prediction. Although all TERMs recur in diverse proteins, some appear specialized for certain functions, such as interface formation, metal coordination, or even water binding. Structural biology has benefited greatly from previously observed degeneracies in structure. The decomposition of the known structural universe into a finite set of compact TERMs offers exciting opportunities toward better understanding, design, and prediction of protein structure.
引用
收藏
页码:E7438 / E7447
页数:10
相关论文
共 50 条
  • [41] Ab initio estimates of the size of the observable universe
    Page, Don N.
    JOURNAL OF COSMOLOGY AND ASTROPARTICLE PHYSICS, 2011, (09):
  • [42] Could the observable universe be inside of a black hole?
    Doran, R
    Crawford, P
    ASTROPHYSICS AND SPACE SCIENCE, 1998, 261 (1-4) : 301 - 302
  • [43] OBSERVABLE DEVIATIONS FROM HOMOGENEITY IN AN INHOMOGENEOUS UNIVERSE
    Giblin, John T., Jr.
    Mertens, James B.
    Starkman, Glenn D.
    ASTROPHYSICAL JOURNAL, 2016, 833 (02):
  • [44] Could the Observable Universe Be Inside of a Black Hole?
    Rosa Doran
    Paulo Crawford
    Astrophysics and Space Science, 1998, 261 : 301 - 302
  • [45] SAFlex: A structural alphabet extension to integrate protein structural flexibility and missing data information
    Allam, Ikram
    Flatters, Delphine
    Caumes, Geraldine
    Regad, Leslie
    Delos, Vincent
    Nuel, Gregory
    Camproux, Anne-Claude
    PLOS ONE, 2018, 13 (07):
  • [46] A structural-alphabet-based strategy for finding structural motifs across protein families
    Wu, Chih Yuan
    Chen, Yao Chi
    Lim, Carmay
    NUCLEIC ACIDS RESEARCH, 2010, 38 (14) : e150 - e150
  • [47] Trends in structural coverage of the protein universe and the impact of the Protein Structure Initiative
    Khafizov, Kamil
    Madrid-Aliste, Carlos
    Almo, Steven C.
    Fiser, Andras
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (10) : 3733 - 3738
  • [48] The impact of structural diversity and parameterization on maps of the protein universe
    Daniel Asarnow
    Rahul Singh
    BMC Proceedings, 7 (Suppl 7)
  • [49] Exploration of the Protein Universe with High Throughput Structural Biology
    Wilson, Ian A.
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2011, 67 : C8 - C9
  • [50] STIMULATED CREATION OF QUANTA DURING INFLATION AND THE OBSERVABLE UNIVERSE
    Agullo, Ivan
    Parker, Leonard
    INTERNATIONAL JOURNAL OF MODERN PHYSICS D, 2011, 20 (14): : 2861 - 2866