Multiple structure alignment and consensus identification for proteins

被引：27

作者：

Ilinkin, Ivaylo ^{[1
]}

Ye, Jieping ^{[2
]}

Janardan, Ravi ^{[3
]}

机构：

[1] Gettysburg Coll, Dept Comp Sci, Gettysburg, PA 17325 USA

[2] Arizona State Univ, Dept Comp Sci & Engn, Tempe, AZ 85287 USA

[3] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN USA

来源：

BMC BIOINFORMATICS | 2010年 / 11卷

关键词：

SEQUENCE ALIGNMENT;

D O I：

10.1186/1471-2105-11-71

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Background: An algorithm is presented to compute a multiple structure alignment for a set of proteins and to generate a consensus (pseudo) protein which captures common substructures present in the given proteins. The algorithm represents each protein as a sequence of triples of coordinates of the alpha-carbon atoms along the backbone. It then computes iteratively a sequence of transformation matrices (i.e., translations and rotations) to align the proteins in space and generate the consensus. The algorithm is a heuristic in that it computes an approximation to the optimal alignment that minimizes the sum of the pairwise distances between the consensus and the transformed proteins. Results: Experimental results show that the algorithm converges quite rapidly and generates consensus structures that are visually similar to the input proteins. A comparison with other coordinate-based alignment algorithms (MAMMOTH and MATT) shows that the proposed algorithm is competitive in terms of speed and the sizes of the conserved regions discovered in an extensive benchmark dataset derived from the HOMSTRAD and SABmark databases. The algorithm has been implemented in C++ and can be downloaded from the project's web page. Alternatively, the algorithm can be used via a web server which makes it possible to align protein structures by uploading files from local disk or by downloading protein data from the RCSB Protein Data Bank. Conclusions: An algorithm is presented to compute a multiple structure alignment for a set of proteins, together with their consensus structure. Experimental results show its effectiveness in terms of the quality of the alignment and computational cost.

引用

页数：8

共 50 条

[1] Multiple structure alignment and consensus identification for proteins
Ye, Jieping
Ilinkin, Ivaylo
Janardan, Ravi
Isom, Adam
ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS, 2006, 4175 : 115 - 125
[2] Multiple structure alignment and consensus identification for proteins
Ivaylo Ilinkin
Jieping Ye
Ravi Janardan
BMC Bioinformatics, 11
[3] Homology modeling of proteins using multiple models and consensus sequence alignment
Prasad, Jahnavi C.
Silberstein, Michael
Camacho, Carlos J.
Vajda, Sandor
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2003, 2812 : 389 - 401
[4] Homology modeling of proteins using multiple models and consensus sequence alignment
Prasad, JC
Silberstein, M
Camacho, CJ
Vajda, S
ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS, 2003, 2812 : 389 - 401
[5] Structure alignment of membrane proteins: Accuracy of available tools and a consensus strategy
Stamm, Marcus
Forrest, Lucy R.
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2015, 83 (09) : 1720 - 1732
[6] MULTIPLE SEQUENCE ALIGNMENT BY CONSENSUS
WATERMAN, MS
NUCLEIC ACIDS RESEARCH, 1986, 14 (22) : 9095 - 9102
[7] MARNA: multiple alignment and consensus structure prediction of RNAs based on sequence structure comparisons
Siebert, S
Backofen, R
BIOINFORMATICS, 2005, 21 (16) : 3352 - 3359
[8] LOCAL MULTIPLE ALIGNMENT BY CONSENSUS MATRIX
ALEXANDROV, NN
COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1992, 8 (04): : 339 - 345
[9] Identification of functional residues and secondary structure from protein multiple sequence alignment
Livingstone, CD
Barton, GJ
COMPUTER METHODS FOR MACROMOLECULAR SEQUENCE ANALYSIS, 1996, 266 : 497 - 512
[10] Multiple structure alignment with msTALI
Shealy, Paul
Valafar, Homayoun
BMC BIOINFORMATICS, 2012, 13

← 1 2 3 4 5 →