Improving the accuracy of transmembrane protein topology prediction using evolutionary information

被引:329
作者
Jones, David T. [1 ]
机构
[1] UCL, Dept Comp Sci, London WC1E 6BT, England
关键词
D O I
10.1093/bioinformatics/btl677
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Many important biological processes such as cell signaling, transport of membrane-impermeable molecules, cell-cell communication, cell recognition and cell adhesion are mediated by membrane proteins. Unfortunately, as these proteins are not water soluble, it is extremely hard to experimentally determine their structure. Therefore, improved methods for predicting the structure of these proteins are vital in biological research. In order to improve transmembrane topology prediction, we evaluate the combined use of both integrated signal peptide prediction and evolutionary information in a single algorithm. Results: A new method (MEMSAT3) for predicting transmembrane protein topology from sequence profiles is described and benchmarked with full cross-validation on a standard data set of 184 transmembrane proteins. The method is found to predict both the correct topology and the locations of transmembrane segments for 80% of the test set. This compares with accuracies of 62-72% for other popular methods on the same benchmark. By using a second neural network specifically to discriminate transmembrane from globular proteins, a very low overall false positive rate (0.5%) can also be achieved in detecting transmembrane proteins. Availability: An implementation of the described method is available both as a web server (http://www.psipred.net) and as downloadable source code from http://bioinf.cs.ucl.ac.uk/memsat. Both the server and source code files are free to non-commercial users. Benchmark and training data are also available from http://bioinf.cs.ucl.ac.uk/memsat. Contact: dtj@cs.ucl.ac.uk.
引用
收藏
页码:538 / 544
页数:7
相关论文
共 26 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   PONGO: a web server for multiple predictions of all-alpha transmembrane proteins [J].
Amico, Mauro ;
Finelli, Michele ;
Rossi, Ivan ;
Zauli, Andrea ;
Elofsson, Arne ;
Viklund, Hakan ;
von Heijne, Gunnar ;
Jones, David ;
Krogh, Anders ;
Fariselli, Piero ;
Martelli, Pier Luigi ;
Casadio, Rita .
NUCLEIC ACIDS RESEARCH, 2006, 34 :W169-W172
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]   Solving the membrane protein folding problem [J].
Bowie, JU .
NATURE, 2005, 438 (7068) :581-589
[5]   Transmembrane helix predictions revisited [J].
Chen, CP ;
Kernytsky, A ;
Rost, B .
PROTEIN SCIENCE, 2002, 11 (12) :2774-2791
[6]  
DONNELLY D, 1993, PROTEIN SCI, V2, P55
[7]   Progress in structure prediction of α-helical membrane proteins [J].
Fleishman, Sarel J. ;
Ben-Tal, Nir .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2006, 16 (04) :496-504
[8]   A MODEL RECOGNITION APPROACH TO THE PREDICTION OF ALL-HELICAL MEMBRANE-PROTEIN STRUCTURE AND TOPOLOGY [J].
JONES, DT ;
TAYLOR, WR ;
THORTON, JM .
BIOCHEMISTRY, 1994, 33 (10) :3038-3049
[9]   Protein secondary structure prediction based on position-specific scoring matrices [J].
Jones, DT .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 292 (02) :195-202
[10]   Do transmembrane protein superfolds exist? [J].
Jones, DT .
FEBS LETTERS, 1998, 423 (03) :281-285