Protein solvent accessibility prediction using support vector machines and sequence conservations

被引:0
|
作者
Ogul, Hasan [1 ]
Mumcuoglu, Erkan U.
机构
[1] Baskent Univ, Dept Comp Engn, TR-06530 Ankara, Turkey
[2] Middle E Tech Univ, Informat Syst & Hlth Informat, TR-06531 Ankara, Turkey
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A two-stage method is developed for the single sequence prediction of protein solvent accessibility from solely its amino acid sequence. The first stage classifies each residue in a protein sequence as exposed or buried using support vector machine (SVM). The features used in the SVM are physicochemical properties of the amino acid to be predicted as well as the information coming from its neighboring residues. The SVM-based predictions are refined using pairwise conservative patterns, called maximal unique matches (MUMs). The MUMs are identified by an efficient data structure called suffix tree. The baseline predictions, SVM-based predictions and MUM-based refinements are tested on a nonredundant protein data set and similar to 73% prediction accuracy is achieved for a solvent accessibility threshold that provides an evenly distribution between buried and exposed classes. The results demonstrate that the new method achieves slightly better accuracy than recent methods using single sequence prediction.
引用
收藏
页码:141 / 148
页数:8
相关论文
共 50 条
  • [1] Prediction of protein solvent accessibility using support vector machines
    Yuan, Z
    Burrage, K
    Mattick, JS
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2002, 48 (03): : 566 - 570
  • [2] Two-stage support vector machines to protein relative solvent accessibility prediction
    Nguyen, MN
    Rajapakse, JC
    PROCEEDINGS OF THE 2004 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2004, : 67 - 72
  • [3] Prediction of protein domains from sequence information using support vector machines
    Zou, Shuxue
    Huang, Yanxin
    Wang, Yan
    Zhou, Chunguang
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 3, PROCEEDINGS, 2006, 3973 : 674 - 681
  • [4] Prediction of protein-protein interactions using support vector machines
    Dohkan, S
    Koike, A
    Takagi, T
    BIBE 2004: FOURTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2004, : 576 - 583
  • [5] Real value solvent accessibility prediction using adaptive support vector regression
    Gubbi, Jayavardhana
    Shilton, Alistair
    Palaniswami, Marimuthu
    Parker, Michael
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2007, : 395 - +
  • [6] Transmembrane protein topology prediction using support vector machines
    Timothy Nugent
    David T Jones
    BMC Bioinformatics, 10
  • [7] Transmembrane protein topology prediction using support vector machines
    Nugent, Timothy
    Jones, David T.
    BMC BIOINFORMATICS, 2009, 10
  • [8] Prediction of protein subcellular locations using support vector machines
    Li, NN
    Niu, XH
    Shi, F
    Li, XY
    ADVANCES IN NATURAL COMPUTATION, PT 1, PROCEEDINGS, 2005, 3610 : 1047 - 1051
  • [9] Prediction of protein structural classes using support vector machines
    X.-D. Sun
    R.-B. Huang
    Amino Acids, 2006, 30 : 469 - 475
  • [10] Prediction of protein structural classes using support vector machines
    Sun, X. -D.
    Huang, R. -B.
    AMINO ACIDS, 2006, 30 (04) : 469 - 475