An Efficient Algorithm for Suffix Sorting

被引:0
|
作者
Peng, Zhan [1 ]
Wang, Yuping [1 ]
Xue, Xingsi [2 ]
Wei, Jingxuan [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
[2] Fujian Univ Technol, Sch Informat Sci & Engn, Fuzhou 350118, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Suffix sorting; suffix array; text index; computation biology; CONSTRUCTION; ARRAYS;
D O I
10.1142/S0218001416590187
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Sufix Array (SA) is a fundamental data structure which is widely used in the applications such as string matching, text index and computation biology, etc. How to sort the suffixes of a string in lexicographical order is a primary problem in constructing SAs, and one of the widely used suffix sorting algorithms is qsufsort. However, qsufsort suffers one critical limitation that the order of suffixes starting with the same 2(k) characters cannot be determined in the kth round. To this point, in our paper, an efficient suffix sorting algorithm called dsufsort is proposed by overcoming the drawback of the qsufsort algorithm. In particular, our proposal maintains the depth of each unsorted portion of SA, and sorts the suffixes based on the depth in each round. By this means, some suffixes that cannot be sorted by qsufsort in each round can be sorted now, as a result, more sorting results in current round can be utilized by the latter rounds and the total number of sorting rounds will be reduced, which means dsufsort is more efficient than qsufsort. The experimental results show the effectiveness of the proposed algorithm, especially for the text with high repetitions.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] A fast suffix-sorting algorithm
    Ahlswede, R.
    Balkenhol, B.
    Deppe, C.
    Frohlich, M.
    GENERAL THEORY OF INFORMATION TRANSFER AND COMBINATORICS, 2006, 4123 : 719 - 734
  • [2] A Grammar Compression Algorithm based on Induced Suffix Sorting
    Nogueira Nunes, Daniel Saad
    Louza, Felipe A.
    Gog, Simon
    Ayala-Rincon, Mauricio
    Navarro, Gonzalo
    2018 DATA COMPRESSION CONFERENCE (DCC 2018), 2018, : 42 - 51
  • [3] Algorithm design and performance evaluation of sparse induced suffix sorting
    Wu, Wenbo
    Nong, Ge
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (05)
  • [4] Faster suffix sorting
    Larsson, N. Jesper
    Sadakane, Kunihiko
    THEORETICAL COMPUTER SCIENCE, 2007, 387 (03) : 258 - 272
  • [5] A fast algorithm for Burrows-Wheeler Transform using suffix sorting
    Long, Bing-Jie (longbj1107@sinacom), 2015, Science Press (37):
  • [6] In-place suffix sorting
    Franceschini, G.
    Muthukrishnan, S.
    AUTOMATA, LANGUAGES AND PROGRAMMING, PROCEEDINGS, 2007, 4596 : 533 - +
  • [7] An efficient external sorting algorithm
    Leu, FC
    Tsai, YT
    Tang, CY
    INFORMATION PROCESSING LETTERS, 2000, 75 (04) : 159 - 163
  • [8] AN EFFICIENT PARALLEL SORTING ALGORITHM
    LIU, XQ
    KIM, JL
    INFORMATION PROCESSING LETTERS, 1992, 43 (03) : 129 - 133
  • [9] AN EFFICIENT SORTING ALGORITHM WITH CUDA
    Chen, Shifu
    Qin, Jing
    Xie, Yongming
    Zhao, Junping
    Heng, Pheng-Ann
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2009, 32 (07) : 915 - 921
  • [10] Faster Sparse Suffix Sorting
    Tomohiro, I
    Karkkainen, Juha
    Kempa, Dominik
    31ST INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2014), 2014, 25 : 386 - 396