Motif-Aware PRALINE: Improving the alignment of motif regions

被引:7
|
作者
Dijkstra, Maurits [1 ]
Bawono, Punto [1 ]
Abeln, Sanne [1 ]
Feenstra, K. Anton [1 ]
Fokkink, Wan [1 ]
Heringa, Jaap [1 ]
机构
[1] Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands
关键词
SEQUENCE ALIGNMENT; PARACOCCUS-DENITRIFICANS; MULTIPLE; DATABASE; REDUCTASE;
D O I
10.1371/journal.pcbi.1006547
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein or DNA motifs are sequence regions which possess biological importance. These regions are often highly conserved among homologous sequences. The generation of multiple sequence alignments (MSAs) with a correct alignment of the conserved sequence motifs is still difficult to achieve, due to the fact that the contribution of these typically short fragments is overshadowed by the rest of the sequence. Here we extended the PRALINE multiple sequence alignment program with a novel motif-aware MSA algorithm in order to address this shortcoming. This method can incorporate explicit information about the presence of externally provided sequence motifs, which is then used in the dynamic programming step by boosting the amino acid substitution matrix towards the motif. The strength of the boost is controlled by a parameter, a. Using a benchmark set of alignments we confirm that a good compromise can be found that improves the matching of motif regions while not significantly reducing the overall alignment quality. By estimating a on an unrelated set of reference alignments we find there is indeed a strong conservation signal for motifs. A number of typical but difficult MSA use cases are explored to exemplify the problems in correctly aligning functional sequence motifs and how the motif-aware alignment method can be employed to alleviate these problems.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Motif-aware Sequential Recommendation
    Cui, Zeyu
    Cai, Yinjiang
    Wu, Shu
    Ma, Xibo
    Wang, Liang
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1738 - 1742
  • [2] Motif-aware diffusion network inference
    Qi Tan
    Yang Liu
    Jiming Liu
    International Journal of Data Science and Analytics, 2020, 9 : 375 - 387
  • [3] Motif-aware diffusion network inference
    Tan, Qi
    Liu, Yang
    Liu, Jiming
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2020, 9 (04) : 375 - 387
  • [4] Scalable Motif-aware Graph Clustering
    Tsourkakis, Charalampos E.
    Pachocki, Jakub
    Mitzenmacher, Michael
    PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 1451 - 1460
  • [5] Motif-aware curriculum learning for node classification
    Cai, Xiaosha
    Chen, Man-Sheng
    Wang, Chang-Dong
    Zhang, Haizhang
    NEURAL NETWORKS, 2025, 184
  • [6] Motif-Aware Adversarial Graph Representation Learning
    Zhao, Ming
    Zhang, Yinglong
    Xia, Xuewen
    Xu, Xing
    IEEE ACCESS, 2022, 10 : 8617 - 8626
  • [7] Community Detection by Motif-Aware Label Propagation
    Li, Pei-Zhen
    Huang, Ling
    Wang, Chang-Dong
    Lai, Jian-Huang
    Huang, Dong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2020, 14 (02)
  • [8] Disentangled Motif-aware Graph Learning for Phrase Grounding
    Mu, Zongshen
    Tang, Siliang
    Tan, Jie
    Yu, Qiang
    Zhuang, Yueting
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13587 - 13594
  • [9] Overlapping Community Detection by Motif-aware Label Propagation
    Liao, Jian-Chao
    Li, Tong
    Weng, Wei
    Wang, Jin-Bo
    Wen, Juan
    Journal of Network Intelligence, 2022, 7 (01): : 260 - 277
  • [10] MARML: Motif-Aware Deep Representation Learning in Multilayer Networks
    Zhang, Da
    Kabuka, Mansur R.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 11649 - 11660