Efficient CGM-based parallel algorithms for the longest common subsequence problem with multiple substring-exclusion constraints

被引:7
|
作者
Tchendji, Vianney Kengne [1 ]
Ngomade, Armel Nkonjoh [1 ]
Zeutouo, Jerry Lacmou [1 ]
Myoupo, Jean Frederic [2 ]
机构
[1] Univ Dschang, Dept Math & Comp Sci, Dschang, Cameroon
[2] Univ Picardie Jules Verne, Comp Sci Lab MIS, Amiens, France
关键词
Parallel algorithms; Coarse grained multicomputer; Dynamic programming; Multiple-constrained LCS; Direct acyclic graph;
D O I
10.1016/j.parco.2019.102598
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A variant of the Longest Common Subsequence (LCS) problem is the LCS problem with multiple substring-exclusion constraints (M-STR-EC-LCS), which has great importance in many fields especially in bioinformatics. This problem consists to compute the LCS of two strings X and Y of length n and m respectively that excluded a set of d constraints P = {P-1, P-2, ..., P-d) of total length r. Recently, Wang et al. proposed a sequential solution based on the dynamic programming technique that requires O(nmr) execution time and space. To the best of our knowledge, there is no parallel solutions for this problem. This paper describes new efficient parallel algorithms on Coarse Grained Multicomputer model (CGM) to solve this problem. Firstly, we propose a multi-level Direct Acyclic Graph (DAG) that determines the correct evaluation order of sub-problems in order to avoid redundancy due to overlap. Secondly, we propose two CGM parallel algorithms based on our DAG. The first algorithm is based on a regular partitioning of the DAG and requires O(nmr/p) execution time with O(p) communication rounds where p is the number of processors used. Its main drawback is high idleness time of processors because due to the dependencies between the nodes in the DAG, over time it has many idle processors. The second algorithm uses an irregular partitioning of the DAG that minimizes this idleness time by allowing the processors to stay active as long as possible. It requires O(nmr/p) execution time with O(kp) communication rounds. k is a constant integer allowing to setup the irregular partitioning. The both algorithms require O(r vertical bar Sigma vertical bar/p) preprocessing time where vertical bar Sigma vertical bar is the length of the alphabet. The experimental results performed show a good agreement with theoretical predictions. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 26 条
  • [21] High-performance CGM-based parallel algorithms for minimum cost parenthesizing problem
    Jerry Lacmou Zeutouo
    Vianney Kengne Tchendji
    Jean Frédéric Myoupo
    The Journal of Supercomputing, 2022, 78 : 5306 - 5332
  • [22] A Branch Elimination-based Efficient Algorithm for Large-scale Multiple Longest Common Subsequence Problem
    Wei, Shiwei
    Wang, Yuping
    Cheung, Yiu-ming
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 1485 - 1486
  • [23] A Branch Elimination-Based Efficient Algorithm for Large-Scale Multiple Longest Common Subsequence Problem
    Wei, Shiwei
    Wang, Yuping
    Cheung, Yiu-ming
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (03) : 2179 - 2192
  • [24] Dominant Point-Based Sequential and Parallel Algorithms for the Multiple Sequential Substring Constrained-LCS Problem
    Tepiele, Hermann bogning
    Tchendji, Vianney kengne
    Onabid, Mathias akong
    Myoupo, J. E. A. N. FReDeRIC
    Ngomade, Armel nkonjoh
    ACM TRANSACTIONS ON PARALLEL COMPUTING, 2024, 11 (04)
  • [25] An Efficient CGM-Based Parallel Algorithm for Solving the Optimal Binary Search Tree Problem Through One-to-All Shortest Paths in a Dynamic Graph
    Vianney Kengne Tchendji
    Jerry Lacmou Zeutouo
    Data Science and Engineering, 2019, 4 : 141 - 156
  • [26] An Efficient CGM-Based Parallel Algorithm for Solving the Optimal Binary Search Tree Problem Through One-to-All Shortest Paths in a Dynamic Graph
    Tchendji, Vianney Kengne
    Zeutouo, Jerry Lacmou
    DATA SCIENCE AND ENGINEERING, 2019, 4 (02) : 141 - 156