A survey on algorithms for protein contact prediction

被引:0
|
作者
Zhang H. [1 ,2 ]
Gao Y. [3 ]
Deng M. [3 ,4 ,5 ]
Zheng W. [6 ]
Bu D. [1 ]
机构
[1] Institute of Computing Technology, Chinese Academy of Sciences, Beijing
[2] University of Chinese Academy of Sciences, Beijing
[3] Centre for Quantitative Biology, Peking University, Beijing
[4] School of Mathematical Sciences, Peking University, Beijing
[5] Center for Statistical Sciences, Peking University, Beijing
[6] Institute of Theoretical Physics, Chinese Academy of Sciences, Beijing
来源
Gao, Yujuan (lacus2009@163.com) | 1600年 / Science Press卷 / 54期
基金
中国国家自然科学基金;
关键词
Co-evolution; Graphical model; Machine learning; Protein contact prediction; Protein tertiary structure prediction;
D O I
10.7544/issn1000-1239.2017.20151076
中图分类号
学科分类号
摘要
Proteins are large molecules consisting of a linear sequence of amino acids. In the natural environment, a protein spontaneously folds into specific tertiary structure to perform its biological functionality. The main factors that drive proteins to fold are interactions between residues, including hydrophobic interaction, Van der Waals' force and electrostatic interactions. The interactions between residues usually lead to residue-residue contacts, and the prediction of residue-residue contacts should greatly facilitate understanding of protein structures and functionalities. A great variety of techniques have been proposed for residue-residue contacts prediction, including machine learning, statistical models, and linear programing. It should be pointed out that most of these techniques are based on the biological insight of co-evolution, i.e., during the evolutionary history of proteins, a residue's mutation usually leads its contacting partner to mutate accordingly. In this review, we summarize the state-of-art algorithms in this field with emphasis on the construction of statistical models based on biological insights. We also present the evaluation of these algorithms using CASP (critical assessment of techniques for protein structure prediction) targets as well as popular benchmark datasets, and describe the trends in the field of protein contact prediction. © 2017, Science Press. All right reserved.
引用
收藏
页码:1 / 19
页数:18
相关论文
共 49 条
  • [1] Lodish H.F., Berk A., Zipursky S.L., Et al., Molecular Cell Biology, (2000)
  • [2] Petsko G.A., Ringe D., Protein Structure and Function, (2004)
  • [3] Wuthrich K., The way to NMR structures of proteins, Nature Structural & Molecular Biology, 8, 11, pp. 923-925, (2001)
  • [4] Kendrew J.C., Bodo G., Dintzis H.M., Et al., A three-dimensional model of the myoglobin molecule obtained by X-ray analysis, Nature, 181, 4610, pp. 662-666, (1958)
  • [5] Taylor K.A., Glaeser R.M., Electron diffraction of frozen, hydrated protein crystals, Science, 186, 4168, pp. 1036-1037, (1974)
  • [6] Marks D.S., Hopf T.A., Sander C., Protein structure prediction from sequence variation, Nature Biotechnology, 30, 11, pp. 1072-1080, (2012)
  • [7] Anfinsen C.B., Principles that govern the folding of protein chains, Science, 181, 4096, pp. 223-230, (1973)
  • [8] Kim D.E., Dimaio F., Wang R.Y., Et al., One contact for every twelve residues allows robust and accurate topology-level protein structure modeling, Proteins: Structure, Function, and Bioinformatics, 82, pp. 208-218, (2014)
  • [9] Haile J.M., Molecular Dynamics Simulation, (1992)
  • [10] Anzai Y., Pattern Recognition and Machine Learning, (2012)