A novel approach for protein subcellular location prediction using amino acid exposure

被引:10
|
作者
Mer, Arvind Singh [1 ]
Andrade-Navarro, Miguel A. [1 ]
机构
[1] Max Delbruck Ctr Mol Med, D-13125 Berlin, Germany
来源
BMC BIOINFORMATICS | 2013年 / 14卷
关键词
SOLVENT ACCESSIBILITY; SECONDARY STRUCTURE; LOCALIZATION; SEQUENCE;
D O I
10.1186/1471-2105-14-342
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Proteins perform their functions in associated cellular locations. Therefore, the study of protein function can be facilitated by predictions of protein location. Protein location can be predicted either from the sequence of a protein alone by identification of targeting peptide sequences and motifs, or by homology to proteins of known location. A third approach, which is complementary, exploits the differences in amino acid composition of proteins associated to different cellular locations, and can be useful if motif and homology information are missing. Here we expand this approach taking into account amino acid composition at different levels of amino acid exposure. Results: Our method has two stages. For stage one, we trained multiple Support Vector Machines (SVMs) to score eukaryotic protein sequences for membership to each of three categories: nuclear, cytoplasmic and extracellular, plus extra category nucleocytoplasmic, accounting for the fact that a large number of proteins shuttles between those two locations. In stage two we use an artificial neural network (ANN) to propose a category from the scores given to the four locations in stage one. The method reaches an accuracy of 68% when using as input 3D-derived values of amino acid exposure. Calibration of the method using predicted values of amino acid exposure allows classifying proteins without 3D-information with an accuracy of 62% and discerning proteins in different locations even if they shared high levels of identity. Conclusions: In this study we explored the relationship between residue exposure and protein subcellular location. We developed a new algorithm for subcellular location prediction that uses residue exposure signatures. Our algorithm uses a novel approach to address the multiclass classification problem.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Prediction of protein subcellular location using a combined feature of sequence
    Gao, QB
    Wang, ZZ
    Yan, C
    Du, YH
    FEBS LETTERS, 2005, 579 (16): : 3444 - 3448
  • [22] Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs
    Park, KJ
    Kanehisa, M
    BIOINFORMATICS, 2003, 19 (13) : 1656 - 1663
  • [23] Using nonlinear energy operator index as pseudo amino acid compositions for predicting protein subcellular location
    Guo, Xiaoli
    Chen, Xiaoming
    Qiu, Yihong
    Huang, Zhende
    Zhu, Yisheng
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 784 - 787
  • [24] Prediction of Subcellular Localization of Apoptosis Protein Using Chou’s Pseudo Amino Acid Composition
    Hao Lin
    Hao Wang
    Hui Ding
    Ying-Li Chen
    Qian-Zhong Li
    Acta Biotheoretica, 2009, 57 : 321 - 330
  • [25] Prediction of Subcellular Localization of Apoptosis Protein Using Chou's Pseudo Amino Acid Composition
    Lin, Hao
    Wang, Hao
    Ding, Hui
    Chen, Ying-Li
    Li, Qian-Zhong
    ACTA BIOTHEORETICA, 2009, 57 (03) : 321 - 330
  • [26] Prediction of the subcellular location of prokaryotic proteins based on a new representation of the amino acid composition
    Feng, ZP
    BIOPOLYMERS, 2001, 58 (05) : 491 - 499
  • [27] Protein location prediction using atomic composition and global features of the amino acid sequence
    Cherian, Betsy Sheena
    Nair, Achuthsankar S.
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2010, 391 (04) : 1670 - 1674
  • [28] Prediction of subcellular location of mycobacterial protein using feature selection techniques
    Lin, Hao
    Ding, Hui
    Guo, Feng-Biao
    Huang, Jian
    MOLECULAR DIVERSITY, 2010, 14 (04) : 667 - 671
  • [29] Prediction of subcellular location of mycobacterial protein using feature selection techniques
    Hao Lin
    Hui Ding
    Feng-Biao Guo
    Jian Huang
    Molecular Diversity, 2010, 14 : 667 - 671
  • [30] Prediction of Eukaryotic Protein Subcellular Location Using a Novel Feature Extraction Method and Support Vector Machine
    Zhang ShaowuPan QuanWu YonghongCheng YongmeiSchool of Life SciencesNorthwestern Polytechnic UniversityXian ChinaSchool of Automatic ControlNorthwestern Polytechnic UniversityXian Chin
    西北工业大学学报, 2005, (06) : 798 - 803