Bayesian segmentation of protein secondary structure

被引:93
|
作者
Schmidler, SC
Liu, JS
Brutlag, DL
机构
[1] Stanford Univ, Sch Med, Sect Med Informat, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[3] Stanford Univ, Sch Med, Dept Biochem, Stanford, CA 94305 USA
关键词
protein secondary structure prediction; Bayesian methods; probabilistic modeling;
D O I
10.1089/10665270050081496
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We present a novel method for predicting the secondary structure of a protein from its amino acid sequence, Most existing methods predict each position in turn based on a local window of residues, sliding this window along the length of the sequence. In contrast, we develop a probabilistic model of protein sequence/structure relationships in terms of structural segments, and formulate secondary structure prediction as a general Bayesian inference problem, A distinctive feature of our approach is the ability to develop explicit probabilistic models for alpha-helices, beta-strands, and other classes of secondary structure, incorporating experimentally and empirically observed aspects of protein structure such as helical capping signals, side chain correlations, and segment length distributions. Our model is Markovian in the segments, permitting efficient exact calculation of the posterior probability distribution over all possible segmentations of the sequence using dynamic programming. The optimal segmentation is computed and compared to a predictor based on marginal posterior modes, and the latter is shown to provide significant improvement in predictive accuracy. The marginalization procedure provides exact secondary structure probabilities at each sequence position, which are shown to be reliable estimates of prediction uncertainty. We apply this model to a database of 452 nonhomologous structures, achieving accuracies as high as the best currently available methods. We conclude by discussing an extension of this framework to model nonlocal interactions in protein structures, providing a possible direction for future improvements in secondary structure prediction accuracy.
引用
收藏
页码:233 / 248
页数:16
相关论文
共 50 条
  • [41] An Approach to Developing Benchmark Datasets for Protein Secondary Structure Segmentation from Cryo-EM Density Maps
    Nguyen, Thu
    Mu, Yongcheng
    Sun, Jiangwen
    He, Jing
    14TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, BCB 2023, 2023,
  • [42] Protein secondary structure pattern discovery and its application in secondary structure prediction
    Li, MH
    Wang, XL
    Lin, L
    Guan, Y
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1435 - 1440
  • [43] An Approach for RNA Secondary Structure Prediction Based on Bayesian Network
    Wu, Tianhua
    Deng, Zhidong
    Song, Dandan
    CIBCB: 2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2009, : 24 - 30
  • [44] Hierarchical junction trees as the secondary structure for inference in Bayesian networks
    Wu, Dan
    Wu, Libing
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 706 - +
  • [45] Prediction of secondary structure of proteins on the basis of Bayesian recognition procedures
    Beletskiy, Boris A.
    Vasilyev, Sergey V.
    Gupal, Anatoliy M.
    Journal of Automation and Information Sciences, 2007, 39 (02) : 1 - 9
  • [46] Bayesian contour segmentation
    Feldman, J
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1999, 40 (04) : S780 - S780
  • [47] HMM in Predicting Protein Secondary Structure
    Huang Jing
    WuhanUniversityJournalofNaturalSciences, 2003, (S1) : 307 - 310
  • [48] PROTEIN SECONDARY STRUCTURE - ANALYSIS AND PREDICTION
    HIDER, RC
    HODGES, SJ
    BIOCHEMICAL EDUCATION, 1984, 12 (01): : 9 - 18
  • [49] Binary coding of the secondary protein structure
    Stambuk, N
    Konjevoda, P
    PERIODICUM BIOLOGORUM, 2005, 107 (04) : 393 - 396
  • [50] SECONDARY STRUCTURE OF UNFOLDED PROTEIN CHAIN
    FINKELSTEIN, AV
    BIOORGANICHESKAYA KHIMIYA, 1978, 4 (03): : 346 - 348