A novel approach for protein subcellular location prediction using amino acid exposure

被引:10
|
作者
Mer, Arvind Singh [1 ]
Andrade-Navarro, Miguel A. [1 ]
机构
[1] Max Delbruck Ctr Mol Med, D-13125 Berlin, Germany
来源
BMC BIOINFORMATICS | 2013年 / 14卷
关键词
SOLVENT ACCESSIBILITY; SECONDARY STRUCTURE; LOCALIZATION; SEQUENCE;
D O I
10.1186/1471-2105-14-342
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Proteins perform their functions in associated cellular locations. Therefore, the study of protein function can be facilitated by predictions of protein location. Protein location can be predicted either from the sequence of a protein alone by identification of targeting peptide sequences and motifs, or by homology to proteins of known location. A third approach, which is complementary, exploits the differences in amino acid composition of proteins associated to different cellular locations, and can be useful if motif and homology information are missing. Here we expand this approach taking into account amino acid composition at different levels of amino acid exposure. Results: Our method has two stages. For stage one, we trained multiple Support Vector Machines (SVMs) to score eukaryotic protein sequences for membership to each of three categories: nuclear, cytoplasmic and extracellular, plus extra category nucleocytoplasmic, accounting for the fact that a large number of proteins shuttles between those two locations. In stage two we use an artificial neural network (ANN) to propose a category from the scores given to the four locations in stage one. The method reaches an accuracy of 68% when using as input 3D-derived values of amino acid exposure. Calibration of the method using predicted values of amino acid exposure allows classifying proteins without 3D-information with an accuracy of 62% and discerning proteins in different locations even if they shared high levels of identity. Conclusions: In this study we explored the relationship between residue exposure and protein subcellular location. We developed a new algorithm for subcellular location prediction that uses residue exposure signatures. Our algorithm uses a novel approach to address the multiclass classification problem.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Multilabel Learning for Protein Subcellular Location Prediction
    Li, Guo-Zheng
    Wang, Xiao
    Hu, Xiaohua
    Liu, Jia-Ming
    Zhao, Rui-Wei
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2012, 11 (03) : 237 - 243
  • [32] Recent progress in protein subcellular location prediction
    Chou, Kuo-Chen
    Shen, Hong-Bin
    ANALYTICAL BIOCHEMISTRY, 2007, 370 (01) : 1 - 16
  • [33] Multitask Learning for Protein Subcellular Location Prediction
    Xu, Qian
    Pan, Sinno Jialin
    Xue, Hannah Hong
    Yang, Qiang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (03) : 748 - 759
  • [34] Subcellular location prediction of proteins using support vector machines with alignment of block sequences utilizing amino acid composition
    Takeyuki Tamura
    Tatsuya Akutsu
    BMC Bioinformatics, 8
  • [35] Subcellular location prediction of proteins using support vector machines with alignment of block sequences utilizing amino acid composition
    Tamura, Takeyuki
    Akutsu, Tatsuya
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [36] Prediction of Protein Subcellular Location Using the Information Entropy and the Auto Covariance Transformation
    Guo, Tingwei
    Wang, Guodong
    Zhang, Zili
    Fan, Zichuan
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [37] Protein Subcellular Location: The Gap Between Prediction and Experimentation
    Erhui Xiong
    Chenyu Zheng
    Xiaolin Wu
    Wei Wang
    Plant Molecular Biology Reporter, 2016, 34 : 52 - 61
  • [38] Protein Subcellular Location: The Gap Between Prediction and Experimentation
    Xiong, Erhui
    Zheng, Chenyu
    Wu, Xiaolin
    Wang, Wei
    PLANT MOLECULAR BIOLOGY REPORTER, 2016, 34 (01) : 52 - 61
  • [39] iAPSL-IF: Identification of Apoptosis Protein Subcellular Location Using Integrative Features Captured from Amino Acid Sequences
    Tang, Yadong
    Xie, Lu
    Chen, Lanming
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (04)
  • [40] Subcellular Protein Localization by Using a Genetically Encoded Fluorescent Amino Acid
    Charbon, Godefroid
    Brustad, Eric
    Scott, Kevin A.
    Wang, Jiangyun
    Lobner-Olesen, Anders
    Schultz, Peter G.
    Jacobs-Wagner, Christine
    Chapman, Eli
    CHEMBIOCHEM, 2011, 12 (12) : 1818 - 1821