Progressive multiple sequence alignments from triplets

被引:17
|
作者
Kruspe, Matthias
Stadler, Peter F.
机构
[1] Univ Leipzig, Bioinformat Grp, Dept Comp Sci, D-04107 Leipzig, Germany
[2] Univ Leipzig, Bioinformat Grp, Interdisciplinary Ctr Bioinformat, D-04107 Leipzig, Germany
[3] Fraunhofer Inst Zelltherapie & Immunol IZI, D-04103 Leipzig, Germany
[4] Univ Vienna, Inst Theoret Chem, A-1090 Vienna, Austria
[5] Santa Fe Inst, Santa Fe, NM 87501 USA
关键词
D O I
10.1186/1471-2105-8-254
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The quality of progressive sequence alignments strongly depends on the accuracy of the individual pairwise alignment steps since gaps that are introduced at one step cannot be removed at later aggregation steps. Adjacent insertions and deletions necessarily appear in arbitrary order in pairwise alignments and hence form an unavoidable source of errors. Research: Here we present a modified variant of progressive sequence alignments that addresses both issues. Instead of pairwise alignments we use exact dynamic programming to align sequence or profile triples. This avoids a large fractions of the ambiguities arising in pairwise alignments. In the subsequent aggregation steps we follow the logic of the Neighbor- Net algorithm, which constructs a phylogenetic network by step- wisely replacing triples by pairs instead of combining pairs to singletons. To this end the three- way alignments are subdivided into two partial alignments, at which stage all- gap columns are naturally removed. This alleviates the '' once a gap, always a gap '' problem of progressive alignment procedures. Conclusion: The three- way Neighbor- Net based alignment program aln3nn is shown to compare favorably on both protein sequences and nucleic acids sequences to other progressive alignment tools. In the latter case one easily can include scoring terms that consider secondary structure features. Overall, the quality of resulting alignments in general exceeds that of clustalw or other multiple alignments tools even though our software does not included heuristics for context dependent ( mis) match scores.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] AltAVisT: Comparing alternative multiple sequence alignments
    Morgenstern, B
    Goel, S
    Sczyrba, A
    Dress, A
    BIOINFORMATICS, 2003, 19 (03) : 425 - 426
  • [32] CONSEQUENCES : CONstrained SEQUENCE alignmentS With Multiple UserWeights
    West, Elizabeth
    Daling, Kyle
    Miller, Courtney
    Rosales, Wes
    Vukovic, Sasa
    Jagodzinski, Filip
    ACM-BCB'19: PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, 2019, : 574 - 579
  • [33] Sequence Selection for Multiple Alignments of Transmembrane Proteins
    Nishio, Takuhiro
    Ohta, Teruyuki
    Kaneko, Sunao
    Shimizu, Toshio
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2009, 12 (01): : 235 - 242
  • [34] ESPript:: analysis of multiple sequence alignments in PostScript
    Gouet, P
    Courcelle, E
    Stuart, DI
    Métoz, F
    BIOINFORMATICS, 1999, 15 (04) : 305 - 308
  • [35] COFFEE: An objective function for multiple sequence alignments
    Notredame, C
    Holm, L
    Higgins, DG
    BIOINFORMATICS, 1998, 14 (05) : 407 - 422
  • [36] On Using Consistency Consistently in Multiple Sequence Alignments
    Joao, Mario, Jr.
    Senat, Alexandre C.
    Rebello, Vinod E. F.
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 152 - 161
  • [37] FAST AND SENSITIVE MULTIPLE SEQUENCE ALIGNMENTS ON A MICROCOMPUTER
    HIGGINS, DG
    SHARP, PM
    COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1989, 5 (02): : 151 - 153
  • [38] Measuring the distance between multiple sequence alignments
    Blackburne, Benjamin P.
    Whelan, Simon
    BIOINFORMATICS, 2012, 28 (04) : 495 - 502
  • [39] Identifying subset errors in multiple sequence alignments
    Roy, Aparna
    Taddese, Bruck
    Vohra, Shabana
    Thimmaraju, Phani K.
    Illingworth, Christopher J. R.
    Simpson, Lisa M.
    Mukherjee, Keya
    Reynolds, Christopher A.
    Chintapalli, Sree V.
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2014, 32 (03): : 364 - 371
  • [40] State of the art: refinement of multiple sequence alignments
    Chakrabarti, Saikat
    Lanczycki, Christopher J.
    Panchenko, Anna R.
    Przytycka, Teresa M.
    Thiessen, Paul A.
    Bryant, Stephen H.
    BMC BIOINFORMATICS, 2006, 7 (1)