Scalable Suffix Sorting on a Multicore Machine

被引:6
|
作者
Xie, Jing Yi [1 ]
Nong, Ge [1 ]
Lao, Bin [2 ]
Xu, Wentao [1 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
[2] Guangdong Univ Foreign Studies, Sch Informat Sci & Technol, Guangzhou 510420, Peoples R China
基金
中国国家自然科学基金;
关键词
Sorting; Random access memory; Indexes; Multicore processing; Arrays; Task analysis; Big Data; Suffix sorting; algorithm design; multicore computer; ARRAY CONSTRUCTION;
D O I
10.1109/TC.2020.2972546
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A number of methods have been proposed for suffix sorting on internal memory of RAM and external memory of hard disks. The current best results for suffix sorting on internal or external memory are achieved by several algorithms using the induced sorting (IS) method in various ways. While these algorithms are efficient, the internal ones are much different from those external in terms of the algorithm designs. A scalable IS method that can be applied for suffix sorting on both internal and external memory is highly desired. This article proposes a blockwise IS method to facilitate pipelined access on internal memory and sequential I/Os on external memory. The detailed algorithm of using this method for a 4-stage pipeline with multiple threads is described, where multiple threads are applied to parallelize not only the pipelined stages of consecutive blocks but also the tasks within each stage wherever possible. This algorithm is evaluated by our experiments on a set of realistic and artificial datasets to achieve better overall time and space performance than the existing best results from pSACAK, pDSS and pKS. Beside sorting suffixes on internal memory in linear time, the proposed method can be ported to external memory for sorting massive suffixes in linear I/O complexity.
引用
收藏
页码:1364 / 1375
页数:12
相关论文
共 50 条
  • [21] Scalable parallel suffix array construction
    Kulla, Fabian
    Sanders, Peter
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, 2006, 4192 : 22 - 29
  • [22] Deterministic Sparse Suffix Sorting in the Restore Model
    Fischer, Johannes
    Tomohiro, I
    Koppl, Dominik
    ACM TRANSACTIONS ON ALGORITHMS, 2020, 16 (04)
  • [23] A scalable lock on NUMA multicore
    Yi, ZhengMing
    Yao, YiPing
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (24):
  • [24] p-Suffix Sorting as Arithmetic Coding
    Beal, Richard
    Adjeroh, Donald
    COMBINATORIAL ALGORITHMS, 2011, 7056 : 44 - 56
  • [25] On the sorting-complexity of suffix tree construction
    Farach-Colton, M
    Ferragina, P
    Muthukrishnan, S
    JOURNAL OF THE ACM, 2000, 47 (06) : 987 - 1011
  • [26] Faster semi-external suffix sorting
    Dhaliwal, Jasbir
    INFORMATION PROCESSING LETTERS, 2014, 114 (04) : 174 - 178
  • [27] p-Suffix sorting as arithmetic coding
    Beal, Richard
    Adjeroh, Donald
    JOURNAL OF DISCRETE ALGORITHMS, 2012, 16 : 151 - 169
  • [28] The performance of linear time suffix sorting algorithms
    Puglisi, SJ
    Smyth, WF
    Turpin, A
    DCC 2005: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2005, : 358 - 367
  • [29] High Performance and Scalable Virtual Machine Storage I/O Stack for Multicore Systems
    Zhang, Diming
    Wu, Hao
    Xue, Fei
    Chen, Liangqiang
    Huang, Hao
    2017 IEEE 23RD INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2017, : 292 - 301
  • [30] Sorting Permutations by Prefix and Suffix Versions of Reversals and Transpositions
    Lintzmayer, Carla Negri
    Dias, Zanoni
    LATIN 2014: THEORETICAL INFORMATICS, 2014, 8392 : 671 - 682