A novel approach for protein subcellular location prediction using amino acid exposure

被引:10
|
作者
Mer, Arvind Singh [1 ]
Andrade-Navarro, Miguel A. [1 ]
机构
[1] Max Delbruck Ctr Mol Med, D-13125 Berlin, Germany
来源
BMC BIOINFORMATICS | 2013年 / 14卷
关键词
SOLVENT ACCESSIBILITY; SECONDARY STRUCTURE; LOCALIZATION; SEQUENCE;
D O I
10.1186/1471-2105-14-342
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Proteins perform their functions in associated cellular locations. Therefore, the study of protein function can be facilitated by predictions of protein location. Protein location can be predicted either from the sequence of a protein alone by identification of targeting peptide sequences and motifs, or by homology to proteins of known location. A third approach, which is complementary, exploits the differences in amino acid composition of proteins associated to different cellular locations, and can be useful if motif and homology information are missing. Here we expand this approach taking into account amino acid composition at different levels of amino acid exposure. Results: Our method has two stages. For stage one, we trained multiple Support Vector Machines (SVMs) to score eukaryotic protein sequences for membership to each of three categories: nuclear, cytoplasmic and extracellular, plus extra category nucleocytoplasmic, accounting for the fact that a large number of proteins shuttles between those two locations. In stage two we use an artificial neural network (ANN) to propose a category from the scores given to the four locations in stage one. The method reaches an accuracy of 68% when using as input 3D-derived values of amino acid exposure. Calibration of the method using predicted values of amino acid exposure allows classifying proteins without 3D-information with an accuracy of 62% and discerning proteins in different locations even if they shared high levels of identity. Conclusions: In this study we explored the relationship between residue exposure and protein subcellular location. We developed a new algorithm for subcellular location prediction that uses residue exposure signatures. Our algorithm uses a novel approach to address the multiclass classification problem.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Supervised learning method for the prediction of subcellular localization of proteins using amino acid and amino acid pair composition
    Habib, Tanwir
    Zhang, Chaoyang
    Yang, Jack Y.
    Yang, Mary Qu
    Deng, Youping
    BMC GENOMICS, 2008, 9 (Suppl 1)
  • [42] Supervised learning method for the prediction of subcellular localization of proteins using amino acid and amino acid pair composition
    Tanwir Habib
    Chaoyang Zhang
    Jack Y Yang
    Mary Qu Yang
    Youping Deng
    BMC Genomics, 9
  • [43] Using pseudo amino acid composition to predict protein subnuclear location with improved hybrid approach
    Li, F. -M.
    Li, Q. -Z.
    AMINO ACIDS, 2008, 34 (01) : 119 - 125
  • [44] Using pseudo amino acid composition to predict protein subnuclear location with improved hybrid approach
    F.-M. Li
    Q.-Z. Li
    Amino Acids, 2008, 34 : 119 - 125
  • [45] Using functional domain composition and support vector machines for prediction of protein subcellular location
    Chou, KC
    Cai, YD
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2002, 277 (48) : 45765 - 45769
  • [46] LVQ approach using AA indices for protein subcellular localisation prediction
    Toh, KS
    Nguyen, MN
    Rajapakse, JC
    PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2005, : 296 - 302
  • [47] Using a Novel AdaBoost Algorithm and Chou's Pseudo Amino Acid Composition for Predicting Protein Subcellular Localization
    Lin, Jie
    Wang, Yan
    PROTEIN AND PEPTIDE LETTERS, 2011, 18 (12): : 1219 - 1225
  • [48] Using pseudo amino acid composition to predict protein subcellular location: Approached with Lyapunov index, Bessel function, and Chebyshev filter
    Gao, Y
    Shao, S
    Xiao, X
    Ding, Y
    Huang, Y
    Huang, Z
    Chou, KC
    AMINO ACIDS, 2005, 28 (04) : 373 - 376
  • [49] Using pseudo amino acid composition to predict protein subcellular location: Approached with Lyapunov index, Bessel function, and Chebyshev filter
    Y. Gao
    S. Shao
    X. Xiao
    Y. Ding
    Y. Huang
    Z. Huang
    K.-C. Chou
    Amino Acids, 2005, 28 : 373 - 376
  • [50] Using the concept of Chou's Pseudo Amino Acid composition to predict apoptosis proteins subcellular location: An approach by approximate entropy
    Jiang, Xiaoying
    Wei, Rong
    Zhang, Tongliang
    Gu, Quan
    PROTEIN AND PEPTIDE LETTERS, 2008, 15 (04): : 392 - 396