Predicting protein disorder by analyzing amino acid sequence

被引:5
|
作者
Yang, Jack Y. [1 ]
Yang, Mary Qu [2 ]
机构
[1] Harvard Univ, Sch Med, Cambridge, MA 02115 USA
[2] Natl Human Genome Res Inst, Natl Inst Hlth, Bethesda, MD 20852 USA
关键词
Feature Selection; Protein Data Bank; Class Label; Test Instance; Window Length;
D O I
10.1186/1471-2164-9-S2-S8
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Many protein regions and some entire proteins have no definite tertiary structure, presenting instead as dynamic, disorder ensembles under different physiochemical circumstances. These proteins and regions are known as Intrinsically Unstructured Proteins (IUP). IUP have been associated with a wide range of protein functions, along with roles in diseases characterized by protein misfolding and aggregation. Results: Identifying IUP is important task in structural and functional genomics. We exact useful features from sequences and develop machine learning algorithms for the above task. We compare our IUP predictor with PONDRs (mainly neural-network-based predictors), disEMBL (also based on neural networks) and Globplot (based on disorder propensity). Conclusion: We find that augmenting features derived from physiochemical properties of amino acids (such as hydrophobicity, complexity etc.) and using ensemble method proved beneficial. The IUP predictor is a viable alternative software tool for identifying IUP protein regions and proteins.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Predicting Protein Subcellular Localization: A Multiobjective PSO-based Feature Subset Selection from Amino Acid Sequence of Protein
    Mandal, Monalisa
    Mukhopadhyay, Anirban
    2014 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (ICIT), 2014, : 251 - 255
  • [42] Predicting protein-protein interactions with pseudo amino acid composition
    Cai, Yu-Dong
    Zhou, Guo-Ping
    BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 158 - +
  • [43] Exploiting Amino Acid Composition for Predicting Protein-Protein Interactions
    Roy, Sushmita
    Martinez, Diego
    Platero, Harriett
    Lane, Terran
    Werner-Washburne, Margaret
    PLOS ONE, 2009, 4 (11):
  • [44] Predicting protein subcellular location: exploiting amino acid based sequence of feature spaces and fusion of diverse classifiers
    Khan, Asifullah
    Majid, Abdul
    Choi, Tae-Sun
    AMINO ACIDS, 2010, 38 (01) : 347 - 350
  • [45] Predicting protein subcellular location: exploiting amino acid based sequence of feature spaces and fusion of diverse classifiers
    Asifullah Khan
    Abdul Majid
    Tae-Sun Choi
    Amino Acids, 2010, 38 : 347 - 350
  • [46] ANALYZING AND PREDICTING PROTEIN INTERACTIONS
    OLSON, AJ
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1993, 205 : 112 - AGFD
  • [47] STUDIES ON AMINO ACID SEQUENCE OF TOBACCO MOSAIC VIRUS PROTEIN .6. AMINO ACID SEQUENCE OF TRYPTIC PEPTIDE 6
    TSUNG, CM
    YOUNG, JD
    FUNATSU, G
    ARCHIVES OF BIOCHEMISTRY AND BIOPHYSICS, 1964, 105 (01) : 42 - &
  • [48] A novel conjoint triad auto covariance (CTAC) coding method for predicting protein-protein interaction based on amino acid sequence
    Wang Xue
    Wang Rujing
    Wei Yuanyuan
    Gui Yuanmiao
    MATHEMATICAL BIOSCIENCES, 2019, 313 : 41 - 47
  • [49] Predicting post-absorptive protein and amino acid metabolism
    Hanigan, Mark D.
    White, Robin R.
    Apelo, Sebastian I. Arriola
    Aguilar, Michelle
    Estes, Kari A.
    Myers, Adelyn
    REVISTA BRASILEIRA DE ZOOTECNIA-BRAZILIAN JOURNAL OF ANIMAL SCIENCE, 2018, 47
  • [50] SIFT: predicting amino acid changes that affect protein function
    Ng, PC
    Henikoff, S
    NUCLEIC ACIDS RESEARCH, 2003, 31 (13) : 3812 - 3814