Predicting protein disorder by analyzing amino acid sequence

被引:5
|
作者
Yang, Jack Y. [1 ]
Yang, Mary Qu [2 ]
机构
[1] Harvard Univ, Sch Med, Cambridge, MA 02115 USA
[2] Natl Human Genome Res Inst, Natl Inst Hlth, Bethesda, MD 20852 USA
关键词
Feature Selection; Protein Data Bank; Class Label; Test Instance; Window Length;
D O I
10.1186/1471-2164-9-S2-S8
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Many protein regions and some entire proteins have no definite tertiary structure, presenting instead as dynamic, disorder ensembles under different physiochemical circumstances. These proteins and regions are known as Intrinsically Unstructured Proteins (IUP). IUP have been associated with a wide range of protein functions, along with roles in diseases characterized by protein misfolding and aggregation. Results: Identifying IUP is important task in structural and functional genomics. We exact useful features from sequences and develop machine learning algorithms for the above task. We compare our IUP predictor with PONDRs (mainly neural-network-based predictors), disEMBL (also based on neural networks) and Globplot (based on disorder propensity). Conclusion: We find that augmenting features derived from physiochemical properties of amino acids (such as hydrophobicity, complexity etc.) and using ensemble method proved beneficial. The IUP predictor is a viable alternative software tool for identifying IUP protein regions and proteins.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Predicting protein disorder by analyzing amino acid sequence
    Jack Y Yang
    Mary Qu Yang
    BMC Genomics, 9
  • [2] Predicting intrinsic disorder from amino acid sequence
    Obradovic, Z
    Peng, K
    Vucetic, S
    Radivojac, P
    Brown, CJ
    Dunker, AK
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2003, 53 (06): : 566 - 572
  • [3] Predicting Protein Folding Rate From Amino Acid Sequence
    Guo Han-Xiu
    Rao Ni-Ni
    Liu Guang-Xiong
    Li Jie
    Wang Yun-He
    PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2010, 37 (12) : 1331 - 1338
  • [4] PREDICTING PROTEIN FOLDING RATE FROM AMINO ACID SEQUENCE
    Guo, Jianxiu
    Rao, Nini
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2011, 9 (01) : 1 - 13
  • [5] Predicting long disordered regions in protein from amino acid sequence
    Garner, E
    Guilliot, S
    Dunker, AK
    Romero, P
    Obradovic, Z
    BIOPHYSICAL JOURNAL, 1998, 74 (02) : A281 - A281
  • [6] PREDICTING PROTEIN SECONDARY STRUCTURE BASED ON AMINO-ACID-SEQUENCE
    NISHIKAWA, K
    NOGUCHI, T
    METHODS IN ENZYMOLOGY, 1991, 202 : 31 - 44
  • [7] Predicting protein amidation sites by orchestrating amino acid sequence features
    Zhao, Shuqiu
    Yu, Hua
    Gong, Xiujun
    2ND ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2017), 2017, 887
  • [8] Comparing kernels for predicting protein binding sites from amino acid sequence
    Wu, Feihong
    Olson, Byron
    Dobbs, Drena
    Honavar, Vasant
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 1612 - +
  • [9] Predicting protein folding rates from geometric contact and amino acid sequence
    Zheng Ouyang
    Jie Liang
    PROTEIN SCIENCE, 2008, 17 (07) : 1256 - 1263
  • [10] GOR method for predicting protein secondary structure from amino acid sequence
    Garnier, J
    Gibrat, JF
    Robson, B
    COMPUTER METHODS FOR MACROMOLECULAR SEQUENCE ANALYSIS, 1996, 266 : 540 - 553