Multiple sequence threading: An analysis of alignment quality and stability

被引:53
|
作者
Taylor, WR
机构
[1] Division of Mathematical Biology, Natl. Institute for Medical Research, Ridgeway, Mill Hill, London
关键词
protein; sequence; structure; threading; alignment;
D O I
10.1006/jmbi.1997.1008
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Methods that compare a protein sequence directly to a structure can be divided into those tl-lat construct a molecular model (threading methods) and those that perform a sequence alignment with the structure encoded as a sequence of structural states (one-dimensional/three-dimensional (1D/3D) matching). The former take into account the internal packing of the molecule but the latter do not. On the other hand, it is simple to include multiple sequence data in a 1D/3D comparison but difficult in a threading method. Here, a protein sequence/structure alignment method is described that uses a combination of matching predicted and observed residue exposure, predicted and observed secondary structure (1D/3D) together with pairwise packing interactions in the core (threading). Using a variety of distantly related and analogous protein structures, the multiple sequence threading (MST) method was compared to a single sequence treading (SST) method (that uses complex potentials of mean-force) and also to a multiple sequence alignment (MSA) program. It was found that the MST method produced alignments that were better than the best that could be obtained with either the SST or MSA method. The method was found to be stable to error in both secondary structure prediction and predicted exposure and also under variation of the key parameters (fully described in an Appendix). The contribution of the pairwise term was found to be small but without it, the correct alignments were less stable and structurally unreasonable deletions were observed when matching against larger structures. Using the parameters derived for alignment, the method was able to recognise related folds in the structure databank with a specificity comparable to other methods. (C) 1997 Academic Press Limited.
引用
收藏
页码:902 / 943
页数:42
相关论文
共 50 条
  • [41] GAP COSTS FOR MULTIPLE SEQUENCE ALIGNMENT
    ALTSCHUL, SF
    JOURNAL OF THEORETICAL BIOLOGY, 1989, 138 (03) : 297 - 309
  • [42] Multiple Sequence Alignment with Genetic Algorithms
    Botta, Marco
    Negro, Guido
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, 2010, 6160 : 206 - 214
  • [43] Parallel progressive multiple sequence alignment
    Pitzer, E
    COMPUTER AIDED SYSTEMS THEORY - EUROCAST 2005, 2005, 3643 : 473 - 482
  • [44] MULTIPLE SEQUENCE ALIGNMENT BY A PAIRWISE ALGORITHM
    TAYLOR, WR
    COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1987, 3 (02): : 81 - 87
  • [45] Multiple sequence alignment using anytime A*
    Zhou, R
    Hansen, EA
    EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 975 - 976
  • [46] Multiple sequence alignment for phylogenetic purposes
    Morrison, David A.
    AUSTRALIAN SYSTEMATIC BOTANY, 2006, 19 (06) : 479 - 539
  • [47] A genetic algorithm for multiple sequence alignment
    Horng, JT
    Wu, LC
    Lin, CM
    Yang, BH
    SOFT COMPUTING, 2005, 9 (06) : 407 - 420
  • [48] An Optimized System for Multiple Sequence Alignment
    Yilmaz, Caglar
    Gok, Mustafa
    2009 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS, 2009, : 178 - +
  • [49] A FLEXIBLE MULTIPLE SEQUENCE ALIGNMENT PROGRAM
    MARTINEZ, HM
    NUCLEIC ACIDS RESEARCH, 1988, 16 (05) : 1683 - 1691
  • [50] Heuristics for multiobjective multiple sequence alignment
    Maryam Abbasi
    Luís Paquete
    Francisco B. Pereira
    BioMedical Engineering OnLine, 15