Bayesian segmentation of protein secondary structure

被引:93
|
作者
Schmidler, SC
Liu, JS
Brutlag, DL
机构
[1] Stanford Univ, Sch Med, Sect Med Informat, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[3] Stanford Univ, Sch Med, Dept Biochem, Stanford, CA 94305 USA
关键词
protein secondary structure prediction; Bayesian methods; probabilistic modeling;
D O I
10.1089/10665270050081496
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We present a novel method for predicting the secondary structure of a protein from its amino acid sequence, Most existing methods predict each position in turn based on a local window of residues, sliding this window along the length of the sequence. In contrast, we develop a probabilistic model of protein sequence/structure relationships in terms of structural segments, and formulate secondary structure prediction as a general Bayesian inference problem, A distinctive feature of our approach is the ability to develop explicit probabilistic models for alpha-helices, beta-strands, and other classes of secondary structure, incorporating experimentally and empirically observed aspects of protein structure such as helical capping signals, side chain correlations, and segment length distributions. Our model is Markovian in the segments, permitting efficient exact calculation of the posterior probability distribution over all possible segmentations of the sequence using dynamic programming. The optimal segmentation is computed and compared to a predictor based on marginal posterior modes, and the latter is shown to provide significant improvement in predictive accuracy. The marginalization procedure provides exact secondary structure probabilities at each sequence position, which are shown to be reliable estimates of prediction uncertainty. We apply this model to a database of 452 nonhomologous structures, achieving accuracies as high as the best currently available methods. We conclude by discussing an extension of this framework to model nonlocal interactions in protein structures, providing a possible direction for future improvements in secondary structure prediction accuracy.
引用
收藏
页码:233 / 248
页数:16
相关论文
共 50 条
  • [1] An architecture combining Bayesian segmentation and neural network ensembles for protein secondary structure prediction
    Bidargaddi, NP
    Chetty, M
    Kamruzzaman, J
    Proceedings of the 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, 2005, : 498 - 505
  • [2] Bayesian segmentation using residue proximity for secondary structure and contact prediction
    Bidargaddi, N. P.
    Chetty, M.
    Kamruzzaman, J.
    PROCEEDINGS OF THE 2006 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2006, : 79 - +
  • [3] Bayesian neural networks for prediction of protein secondary structure
    Shao, JL
    Xu, D
    Wang, LZ
    Wang, YF
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 544 - 551
  • [4] Protein secondary structure prediction with Bayesian learning method
    Wang, PL
    Zhang, D
    14TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, : 252 - 257
  • [5] Bayesian Model of Protein Primary Sequence for Secondary Structure Prediction
    Li, Qiwei
    Dahl, David B.
    Vannucci, Marina
    Joo, Hyun
    Tsai, Jerry W.
    PLOS ONE, 2014, 9 (10):
  • [6] A dynamic Bayesian network approach to protein secondary structure prediction
    Xin-Qiu Yao
    Huaiqiu Zhu
    Zhen-Su She
    BMC Bioinformatics, 9
  • [7] A dynamic Bayesian network approach to protein secondary structure prediction
    Yao, Xin-Qiu
    Zhu, Huaiqiu
    She, Zhen-Su
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [8] Bayesian protein secondary structure prediction with near-optimal segmentations
    Aydin, Zafer
    Altunbasak, Yucel
    Erdogan, Hakan
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (07) : 3512 - 3525
  • [9] Bayesian network multi-classifiers for protein secondary structure prediction
    Robes, V
    Larrañaga, P
    Peña, JM
    Menasalvas, E
    Pérez, MS
    Herves, V
    Wasilewska, A
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2004, 31 (02) : 117 - 136
  • [10] PSSNet-An Accurate Super-Secondary Structure for Protein Segmentation
    Petrovsky, Denis V.
    Rudnev, Vladimir R.
    Nikolsky, Kirill S.
    Kulikova, Liudmila I.
    Malsagova, Kristina M.
    Kopylov, Arthur T.
    Kaysheva, Anna L.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (23)