Multiple structure alignment and consensus identification for proteins

被引:27
|
作者
Ilinkin, Ivaylo [1 ]
Ye, Jieping [2 ]
Janardan, Ravi [3 ]
机构
[1] Gettysburg Coll, Dept Comp Sci, Gettysburg, PA 17325 USA
[2] Arizona State Univ, Dept Comp Sci & Engn, Tempe, AZ 85287 USA
[3] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN USA
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
SEQUENCE ALIGNMENT;
D O I
10.1186/1471-2105-11-71
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: An algorithm is presented to compute a multiple structure alignment for a set of proteins and to generate a consensus (pseudo) protein which captures common substructures present in the given proteins. The algorithm represents each protein as a sequence of triples of coordinates of the alpha-carbon atoms along the backbone. It then computes iteratively a sequence of transformation matrices (i.e., translations and rotations) to align the proteins in space and generate the consensus. The algorithm is a heuristic in that it computes an approximation to the optimal alignment that minimizes the sum of the pairwise distances between the consensus and the transformed proteins. Results: Experimental results show that the algorithm converges quite rapidly and generates consensus structures that are visually similar to the input proteins. A comparison with other coordinate-based alignment algorithms (MAMMOTH and MATT) shows that the proposed algorithm is competitive in terms of speed and the sizes of the conserved regions discovered in an extensive benchmark dataset derived from the HOMSTRAD and SABmark databases. The algorithm has been implemented in C++ and can be downloaded from the project's web page. Alternatively, the algorithm can be used via a web server which makes it possible to align protein structures by uploading files from local disk or by downloading protein data from the RCSB Protein Data Bank. Conclusions: An algorithm is presented to compute a multiple structure alignment for a set of proteins, together with their consensus structure. Experimental results show its effectiveness in terms of the quality of the alignment and computational cost.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Scoring Consensus of Multiple ECG Annotators by Optimal Sequence Alignment
    Haghpanahi, Masoumeh
    Sameni, Reza
    Borkholder, David A.
    2014 36TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2014, : 1855 - 1859
  • [22] Evolving consensus sequence for multiple sequence alignment with a genetic algorithm
    Shyu, C
    Foster, JA
    GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2003, PT II, PROCEEDINGS, 2003, 2724 : 2313 - 2324
  • [23] Real value prediction of solvent accessibility in proteins using multiple sequence alignment and secondary structure
    Garg, A
    Kaur, H
    Raghava, GPS
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 (02) : 318 - 324
  • [24] Bayesian Multiple Protein Structure Alignment
    Wang, Rui
    Schmidler, Scott C.
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, RECOMB2014, 2014, 8394 : 326 - 339
  • [25] MULTIPLE PROTEIN-STRUCTURE ALIGNMENT
    TAYLOR, WR
    FLORES, TP
    ORENGO, CA
    PROTEIN SCIENCE, 1994, 3 (10) : 1858 - 1870
  • [26] WITCH: Improved Multiple Sequence Alignment Through Weighted Consensus Hidden Markov Model Alignment
    Shen, Chengze
    Park, Minhyuk
    Warnow, Tandy
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2022, 29 (08) : 782 - 801
  • [27] An Algorithm of Multiple Sequence Alignment Based on Consensus Sequence Searched by Simulated Annealing and Star Alignment
    Yao, Dengfeng
    Jiang, Minghu
    You, Xu
    Abulizi, Abudoukelimu
    Hou, Renkui
    2015 INTERNATIONAL SYMPOSIUM ON BIOELECTRONICS AND BIOINFORMATICS (ISBB), 2015, : 3 - 6
  • [28] SEQUENCE ALIGNMENT OF CITRATE SYNTHASE PROTEINS USING A MULTIPLE SEQUENCE ALIGNMENT ALGORITHM AND MULTIPLE SCORING MATRICES
    HENNEKE, CM
    DANSON, MJ
    HOUGH, DW
    OSGUTHORPE, DJ
    PROTEIN ENGINEERING, 1989, 2 (08): : 597 - 604
  • [29] The accuracy of several multiple sequence alignment programs for proteins
    Paulo AS Nuin
    Zhouzhi Wang
    Elisabeth RM Tillier
    BMC Bioinformatics, 7
  • [30] The accuracy of several multiple sequence alignment programs for proteins
    Nuin, Paulo A. S.
    Wang, Zhouzhi
    Tillier, Elisabeth R. M.
    BMC BIOINFORMATICS, 2006, 7 (1)