Motif-Aware PRALINE: Improving the alignment of motif regions

被引:7
|
作者
Dijkstra, Maurits [1 ]
Bawono, Punto [1 ]
Abeln, Sanne [1 ]
Feenstra, K. Anton [1 ]
Fokkink, Wan [1 ]
Heringa, Jaap [1 ]
机构
[1] Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands
关键词
SEQUENCE ALIGNMENT; PARACOCCUS-DENITRIFICANS; MULTIPLE; DATABASE; REDUCTASE;
D O I
10.1371/journal.pcbi.1006547
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein or DNA motifs are sequence regions which possess biological importance. These regions are often highly conserved among homologous sequences. The generation of multiple sequence alignments (MSAs) with a correct alignment of the conserved sequence motifs is still difficult to achieve, due to the fact that the contribution of these typically short fragments is overshadowed by the rest of the sequence. Here we extended the PRALINE multiple sequence alignment program with a novel motif-aware MSA algorithm in order to address this shortcoming. This method can incorporate explicit information about the presence of externally provided sequence motifs, which is then used in the dynamic programming step by boosting the amino acid substitution matrix towards the motif. The strength of the boost is controlled by a parameter, a. Using a benchmark set of alignments we confirm that a good compromise can be found that improves the matching of motif regions while not significantly reducing the overall alignment quality. By estimating a on an unrelated set of reference alignments we find there is indeed a strong conservation signal for motifs. A number of typical but difficult MSA use cases are explored to exemplify the problems in correctly aligning functional sequence motifs and how the motif-aware alignment method can be employed to alleviate these problems.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] DNA motif alignment by evolving a population of Markov chains
    Bi, Chengpeng
    BMC BIOINFORMATICS, 2009, 10
  • [32] Using Catalytic Site Motif Alignment to Assign Function
    Dodge, Gregory James
    Bobo, Daniel Paul
    Bernstein, Herbert J.
    Craig, Paul A.
    FASEB JOURNAL, 2011, 25
  • [33] Building multiple alignment using iterative dynamic improvement of the initial motif alignment
    Nikolaev, VK
    Leontovich, AM
    Drachev, VA
    Brodsky, LI
    BIOCHEMISTRY-MOSCOW, 1997, 62 (06) : 578 - 582
  • [34] Repeat Motif-containing Regions within Thyroglobulin
    Lee, Jaemin
    Arvan, Peter
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2011, 286 (30) : 26327 - 26333
  • [35] Finding optimal structural motif in protein structures by network alignment
    Chang, Lu-Lu
    International Journal of Applied Mathematics and Statistics, 2013, 46 (16): : 239 - 244
  • [36] An Approximation Algorithm for Alignment of Multiple Sequences using Motif Discovery
    Laxmi Parida
    Aris Floratos
    Isidore Rigoutsos
    Journal of Combinatorial Optimization, 1999, 3 : 247 - 275
  • [37] An approximation algorithm for alignment of multiple sequences using motif discovery
    Parida, L
    Floratos, A
    Rigoutsos, I
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 1999, 3 (2-3) : 247 - 275
  • [38] Automated multiple structure alignment and detection of a common substructural motif
    Leibowitz, N
    Fligelman, ZY
    Nussinov, R
    Wolfson, HJ
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2001, 43 (03) : 235 - 245
  • [39] RNAMotifScanX: a graph alignment approach for RNA structural motif identification
    Zhong, Cuncong
    Zhang, Shaojie
    RNA, 2015, 21 (03) : 333 - 346
  • [40] Improving efficiency in sparse learning with the feedforward inhibitory motif
    Xu, Zihan
    Skorheim, Steven
    Tu, Ming
    Berisha, Visar
    Yu, Shimeng
    Seo, Jae-sun
    Bazhenov, Maxim
    Cao, Yu
    NEUROCOMPUTING, 2017, 267 : 141 - 151