Using Maximum Entropy Model to Predict Protein Secondary Structure with Single Sequence

被引:21
|
作者
Ding, Yong-Sheng [1 ,2 ]
Zhang, Tong-Liang [1 ]
Gu, Quan [1 ]
Zhao, Pei-Ying [1 ]
Chou, Kuo-Chen [3 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai 201620, Peoples R China
[2] Minist Educ, Engn Res Ctr Digitized Text & Fash Technol, Shanghai 201620, Peoples R China
[3] Gordon Life Sci Inst, San Diego, CA 92130 USA
来源
PROTEIN AND PEPTIDE LETTERS | 2009年 / 16卷 / 05期
关键词
Protein secondary structure; single-sequence prediction method; protein structural classes; maximum entropy model; AMINO-ACID-COMPOSITION; SUPPORT VECTOR MACHINES; FUNCTIONAL DOMAIN COMPOSITION; SUBCELLULAR LOCATION PREDICTION; NEURAL-NETWORK METHOD; BETA-TURN TYPES; MEMBRANE-PROTEINS; EVOLUTIONARY INFORMATION; HYBRIDIZATION SPACE; PATTERN-RECOGNITION;
D O I
10.2174/092986609788167833
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Prediction of protein secondary structure is somewhat reminiscent of the efforts by many previous investigators but yet still worthy of revisiting it owing to its importance in protein science. Several studies indicate that the knowledge of protein structural classes can provide useful information towards the determination of protein secondary structure. Particularly, the performance of prediction algorithms developed recently have been improved rapidly by incorporating homologous multiple sequences alignment information. Unfortunately, this kind of information is not available for a significant amount of proteins. In view of this, it is necessary to develop the method based on the query protein sequence alone, the so-called single-sequence method. Here, we propose a novel single-sequence approach which is featured by that various kinds of contextual information are taken into account, and that a maximum entropy model classifier is used as the prediction engine. As a demonstration, cross-validation tests have been performed by the new method on datasets containing proteins from different structural classes, and the results thus obtained are quite promising, indicating that the new method may become an useful tool in protein science or at least play a complementary role to the existing protein secondary structure prediction methods.
引用
收藏
页码:552 / 560
页数:9
相关论文
共 50 条
  • [1] Protein secondary structure predict based on the path with the maximum weight
    Luo, Liang
    Shao, Zehui
    2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 3, 2009, : 299 - +
  • [2] A seqlet-based maximum entropy Markov approach for protein secondary structure prediction
    Dong, QW
    Wang, XL
    Lin, L
    Guan, Y
    SCIENCE IN CHINA SERIES C-LIFE SCIENCES, 2005, 48 (04): : 394 - 405
  • [4] A seqlet-based maximum entropy Markov approach for protein secondary structure prediction
    Qiwen Dong
    Xiaolong Wang
    Lei Lin
    Yi Guan
    Science in China Series C: Life Sciences, 2005, 48 : 394 - 405
  • [5] Bayesian Model of Protein Primary Sequence for Secondary Structure Prediction
    Li, Qiwei
    Dahl, David B.
    Vannucci, Marina
    Joo, Hyun
    Tsai, Jerry W.
    PLOS ONE, 2014, 9 (10):
  • [6] Predict the Suitable Places to Run in the Urban Area of Beijing by Using the Maximum Entropy Model
    Song, Liuyi
    Zhang, An
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (08)
  • [7] Single-sequence protein structure prediction using a language model and deep learning
    Ratul Chowdhury
    Nazim Bouatta
    Surojit Biswas
    Christina Floristean
    Anant Kharkar
    Koushik Roy
    Charlotte Rochereau
    Gustaf Ahdritz
    Joanna Zhang
    George M. Church
    Peter K. Sorger
    Mohammed AlQuraishi
    Nature Biotechnology, 2022, 40 : 1617 - 1623
  • [8] Protein secondary structure prediction for a single-sequence using hidden semi-Markov models
    Aydin, Zafer
    Altunbasak, Yucel
    Borodovsky, Mark
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [9] Protein secondary structure prediction for a single-sequence using hidden semi-Markov models
    Zafer Aydin
    Yucel Altunbasak
    Mark Borodovsky
    BMC Bioinformatics, 7
  • [10] Protein secondary structure and sequence characters
    Nakata, K
    PROGRESS IN BIOPHYSICS & MOLECULAR BIOLOGY, 1996, 65 : PA438 - PA438