An Efficient Algorithm for Suffix Sorting

被引:0
|
作者
Peng, Zhan [1 ]
Wang, Yuping [1 ]
Xue, Xingsi [2 ]
Wei, Jingxuan [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
[2] Fujian Univ Technol, Sch Informat Sci & Engn, Fuzhou 350118, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Suffix sorting; suffix array; text index; computation biology; CONSTRUCTION; ARRAYS;
D O I
10.1142/S0218001416590187
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Sufix Array (SA) is a fundamental data structure which is widely used in the applications such as string matching, text index and computation biology, etc. How to sort the suffixes of a string in lexicographical order is a primary problem in constructing SAs, and one of the widely used suffix sorting algorithms is qsufsort. However, qsufsort suffers one critical limitation that the order of suffixes starting with the same 2(k) characters cannot be determined in the kth round. To this point, in our paper, an efficient suffix sorting algorithm called dsufsort is proposed by overcoming the drawback of the qsufsort algorithm. In particular, our proposal maintains the depth of each unsorted portion of SA, and sorts the suffixes based on the depth in each round. By this means, some suffixes that cannot be sorted by qsufsort in each round can be sorted now, as a result, more sorting results in current round can be utilized by the latter rounds and the total number of sorting rounds will be reduced, which means dsufsort is more efficient than qsufsort. The experimental results show the effectiveness of the proposed algorithm, especially for the text with high repetitions.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] A space and time efficient algorithm for constructing compressed suffix arrays
    Hon, Wing-Kai
    Lam, Tak-Wah
    Sadakane, Kunihiko
    Sung, Wing-Kin
    Yiu, Siu-Ming
    ALGORITHMICA, 2007, 48 (01) : 23 - 36
  • [32] An OTA Upgrade Differential Compression Algorithm Based on Suffix Array Induced Sorting and BsDiff Methods
    Zhang, Xinrong
    Wang, Ying
    Su, Yanzhao
    Zuo, Hengfeng
    Huang, Jin
    Kong, Linghe
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [33] Deterministic Sparse Suffix Sorting in the Restore Model
    Fischer, Johannes
    Tomohiro, I
    Koppl, Dominik
    ACM TRANSACTIONS ON ALGORITHMS, 2020, 16 (04)
  • [34] REMARK ON AN EFFICIENT ALGORITHM FOR SORTING WITH MINIMAL STORAGE
    GRIFFIN, R
    REDISH, KA
    COMMUNICATIONS OF THE ACM, 1970, 13 (01) : 54 - &
  • [35] A fast and efficient parallel sorting algorithm on LARPBS
    Chen, HJ
    Chen, YX
    Chen, L
    Li, T
    DCABES 2004, Proceedings, Vols, 1 and 2, 2004, : 393 - 397
  • [36] An efficient web traversal pattern mining algorithm based on suffix array
    Jing, T
    Zuo, WL
    Zhang, BZ
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1535 - 1539
  • [37] Efficient parallel algorithm for sorting on the biswapped network
    Department of Computer Science, South China University of Technology, Guangzhou 510641, China
    J. Comput. Inf. Syst., 2008, 4 (1365-1370): : 1365 - 1370
  • [38] An Efficient Parallel Sorting Algorithm on Metacube Multiprocessors
    Li, Yamin
    Peng, Shietung
    Chu, Wanming
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PROCEEDINGS, 2009, 5574 : 372 - +
  • [39] An efficient sorting algorithm for a sequence of kings in a tournament
    Wu, J
    Sheng, L
    INFORMATION PROCESSING LETTERS, 2001, 79 (06) : 297 - 299
  • [40] A more efficient algorithm for perfect sorting by reversals
    Berard, Severine
    Chauve, Cedric
    Paul, Christophe
    INFORMATION PROCESSING LETTERS, 2008, 106 (03) : 90 - 95