Multiple structure alignment and consensus identification for proteins

被引:27
|
作者
Ilinkin, Ivaylo [1 ]
Ye, Jieping [2 ]
Janardan, Ravi [3 ]
机构
[1] Gettysburg Coll, Dept Comp Sci, Gettysburg, PA 17325 USA
[2] Arizona State Univ, Dept Comp Sci & Engn, Tempe, AZ 85287 USA
[3] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN USA
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
SEQUENCE ALIGNMENT;
D O I
10.1186/1471-2105-11-71
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: An algorithm is presented to compute a multiple structure alignment for a set of proteins and to generate a consensus (pseudo) protein which captures common substructures present in the given proteins. The algorithm represents each protein as a sequence of triples of coordinates of the alpha-carbon atoms along the backbone. It then computes iteratively a sequence of transformation matrices (i.e., translations and rotations) to align the proteins in space and generate the consensus. The algorithm is a heuristic in that it computes an approximation to the optimal alignment that minimizes the sum of the pairwise distances between the consensus and the transformed proteins. Results: Experimental results show that the algorithm converges quite rapidly and generates consensus structures that are visually similar to the input proteins. A comparison with other coordinate-based alignment algorithms (MAMMOTH and MATT) shows that the proposed algorithm is competitive in terms of speed and the sizes of the conserved regions discovered in an extensive benchmark dataset derived from the HOMSTRAD and SABmark databases. The algorithm has been implemented in C++ and can be downloaded from the project's web page. Alternatively, the algorithm can be used via a web server which makes it possible to align protein structures by uploading files from local disk or by downloading protein data from the RCSB Protein Data Bank. Conclusions: An algorithm is presented to compute a multiple structure alignment for a set of proteins, together with their consensus structure. Experimental results show its effectiveness in terms of the quality of the alignment and computational cost.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Alignment of multiple proteins with an ensemble of Hidden Markov Models
    Song, Jia
    Liu, Chunmei
    Song, Yinglei
    Qu, Junfeng
    Hura, Gurdeep S.
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (01) : 60 - 71
  • [32] PRALINE™:: a strategy for improved multiple alignment of transmembrane proteins
    Pirovano, Walter
    Feenstra, K. Anton
    Heringa, Jaap
    BIOINFORMATICS, 2008, 24 (04) : 492 - 497
  • [33] Alignment of multiple proteins with an ensemble of hidden markov models
    Song, Jia
    Liu, Chunmei
    Song, Yinglei
    Qu, Junfeng
    Hura, Gurdeep S.
    ICMLA 2007: SIXTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2007, : 594 - +
  • [34] Threading and alignment using composite predicted secondary structure and multiple structure alignment.
    An, YL
    Friesner, RA
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2001, 222 : U402 - U402
  • [35] Potential for dramatic improvement in sequence alignment against structures of remote homologous proteins by extracting structural information from multiple structure alignment
    Zhang, ZD
    Lindstam, M
    Unge, J
    Peterson, C
    Lu, GU
    JOURNAL OF MOLECULAR BIOLOGY, 2003, 332 (01) : 127 - 142
  • [36] MUSTA - A general, efficient, automated method for multiple structure alignment and detection of common motifs: Application to proteins
    Leibowitz, N
    Nussinov, R
    Wolfson, HJ
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (02) : 93 - 121
  • [37] MicroRNA identification based on sequence and structure alignment
    Wang, XW
    Zhang, J
    Gu, J
    He, T
    Zhang, XG
    Li, YD
    Li, F
    BIOINFORMATICS, 2005, 21 (18) : 3610 - 3614
  • [38] MergeAlign: improving multiple sequence alignment performance by dynamic reconstruction of consensus multiple sequence alignments
    Peter W Collingridge
    Steven Kelly
    BMC Bioinformatics, 13
  • [39] MergeAlign: improving multiple sequence alignment performance by dynamic reconstruction of consensus multiple sequence alignments
    Collingridge, Peter W.
    Kelly, Steven
    BMC BIOINFORMATICS, 2012, 13
  • [40] A model of evolution and structure for multiple sequence alignment
    Loeytynoja, Ari
    Goldman, Nick
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 363 (1512) : 3913 - 3919