Fast Parallel Computation of Longest Common Prefixes

被引:9
|
作者
Shun, Julian [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
SUFFIX; ARRAYS;
D O I
10.1109/SC.2014.37
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Suffix arrays and the corresponding longest common prefix (LCP) array have wide applications in bioinformatics, information retrieval and data compression. In this work, we propose and theoretically analyze new parallel algorithms for computing the LCP array given the suffix array as input. Most of our algorithms have a work and depth (parallel time) complexity related to the LCP values of the input. We also present a slight variation of Karkkainen and Sanders' skew algorithm that requires linear work and poly-logarithmic depth in the worst case. We present a comprehensive experimental study of our parallel algorithms along with existing parallel and sequential LCP algorithms. On a variety of real-world and artificial strings, we show that on a 40-core shared-memory machine our fastest algorithm is up to 2.3 times faster than the fastest existing parallel algorithm, and up to 21.8 times faster than the fastest sequential LCP algorithm.
引用
收藏
页码:387 / 398
页数:12
相关论文
共 50 条
  • [31] Efficient Computation of Longest Common Subsequences with Multiple Substring Inclusive Constraints
    Wang, Xiaodong
    Wang, Lei
    Zhu, Daxin
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2019, 26 (09) : 938 - 947
  • [32] FAST LINEAR-SPACE COMPUTATIONS OF LONGEST COMMON SUBSEQUENCES
    APOSTOLICO, A
    BROWNE, S
    GUERRA, C
    THEORETICAL COMPUTER SCIENCE, 1992, 92 (01) : 3 - 17
  • [33] Fast DNA Sequence Clustering Based on Longest Common Subsequence
    Namiki, Youhei
    Ishida, Takashi
    Akiyama, Yutaka
    EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, 2012, 304 : 453 - 460
  • [34] Parallel Time and Quantifier Prefixes
    Cucker, Felipe
    de Naurois, Paulin Jacobe
    COMPUTATIONAL COMPLEXITY, 2009, 18 (04) : 527 - 550
  • [35] Parallel Time and Quantifier Prefixes
    Felipe Cucker
    Paulin Jacobé de Naurois
    computational complexity, 2009, 18
  • [36] A Real Linear and Parallel Multiple Longest Common Subsequences (MLCS) Algorithm
    Li, Yanni
    Li, Hui
    Duan, Tihua
    Wang, Sheng
    Wang, Zhi
    Cheng, Yang
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1725 - 1734
  • [37] Parallel computing for the longest common subsequences in network intrusion detection system
    Zhong, C
    Chen, GL
    He, JH
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2553 - 2557
  • [38] Systolic-based parallel architecture for the longest common subsequences problem
    Luce, G
    Myoupo, JF
    INTEGRATION-THE VLSI JOURNAL, 1998, 25 (01) : 53 - 70
  • [39] THE TONES OF PREFIXES IN COMMON BANTU
    Meeussen, A. E.
    AFRICA, 1954, 24 (01): : 48 - 53