Motif-Aware PRALINE: Improving the alignment of motif regions

被引:7
|
作者
Dijkstra, Maurits [1 ]
Bawono, Punto [1 ]
Abeln, Sanne [1 ]
Feenstra, K. Anton [1 ]
Fokkink, Wan [1 ]
Heringa, Jaap [1 ]
机构
[1] Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands
关键词
SEQUENCE ALIGNMENT; PARACOCCUS-DENITRIFICANS; MULTIPLE; DATABASE; REDUCTASE;
D O I
10.1371/journal.pcbi.1006547
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein or DNA motifs are sequence regions which possess biological importance. These regions are often highly conserved among homologous sequences. The generation of multiple sequence alignments (MSAs) with a correct alignment of the conserved sequence motifs is still difficult to achieve, due to the fact that the contribution of these typically short fragments is overshadowed by the rest of the sequence. Here we extended the PRALINE multiple sequence alignment program with a novel motif-aware MSA algorithm in order to address this shortcoming. This method can incorporate explicit information about the presence of externally provided sequence motifs, which is then used in the dynamic programming step by boosting the amino acid substitution matrix towards the motif. The strength of the boost is controlled by a parameter, a. Using a benchmark set of alignments we confirm that a good compromise can be found that improves the matching of motif regions while not significantly reducing the overall alignment quality. By estimating a on an unrelated set of reference alignments we find there is indeed a strong conservation signal for motifs. A number of typical but difficult MSA use cases are explored to exemplify the problems in correctly aligning functional sequence motifs and how the motif-aware alignment method can be employed to alleviate these problems.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] motif2vec: Motif Aware Node Representation Learning for Heterogeneous Networks
    Dareddy, Manoj Reddy
    Das, Mahashweta
    Yang, Hao
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1052 - 1059
  • [22] Network alignment and motif discovery in dynamic networks
    Cinaglia, Pietro
    Cannataro, Mario
    NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2022, 11 (01):
  • [23] Motif Alignment for Time Series Data Augmentation
    Bahri, Omar
    Li, Peiyu
    Boubrahimi, Soukaina Filali
    Hamdi, Shah Muhammad
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2023, 2023, 14148 : 42 - 48
  • [24] MAFin: motif detection in multiple alignment files
    Patsakis, Michail
    Provatas, Kimonas
    Baltoumas, Fotis A.
    Chantzi, Nikol
    Mouratidis, Ioannis
    Pavlopoulos, Georgios A.
    Georgakopoulos-Soares, Ilias
    BIOINFORMATICS, 2025, 41 (04)
  • [25] Network alignment and motif discovery in dynamic networks
    Pietro Cinaglia
    Mario Cannataro
    Network Modeling Analysis in Health Informatics and Bioinformatics, 2022, 11
  • [26] PairK: Pairwise k-mer alignment for quantifying protein motif conservation in disordered regions
    Halpin, Jackson C.
    Keating, Amy E.
    PROTEIN SCIENCE, 2025, 34 (01)
  • [27] Constrained RNA structural alignment:: Algorithms and application to motif detection in the untranslated regions of Trypanosoma brucei mRNAs
    Khaladkar, Mugdha
    Bellofatto, Vivian
    Wang, Jason T. L.
    Patel, Vandanaben
    Nakayama, Marvin K.
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 334 - +
  • [28] Exploring Motif Composition of Eukaryotic Promoter Regions
    Stojanovic, Nikola
    Singh, Abanish
    ADVANCES IN COMPUTATIONAL BIOLOGY, 2010, 680 : 27 - 34
  • [29] DNA motif alignment by evolving a population of Markov chains
    Chengpeng Bi
    BMC Bioinformatics, 10 (Suppl 1)
  • [30] Local graph alignment and motif search in biological networks
    Berg, J
    Lässig, M
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (41) : 14689 - 14694