An Efficient Algorithm for Suffix Sorting

被引:0
|
作者
Peng, Zhan [1 ]
Wang, Yuping [1 ]
Xue, Xingsi [2 ]
Wei, Jingxuan [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
[2] Fujian Univ Technol, Sch Informat Sci & Engn, Fuzhou 350118, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Suffix sorting; suffix array; text index; computation biology; CONSTRUCTION; ARRAYS;
D O I
10.1142/S0218001416590187
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Sufix Array (SA) is a fundamental data structure which is widely used in the applications such as string matching, text index and computation biology, etc. How to sort the suffixes of a string in lexicographical order is a primary problem in constructing SAs, and one of the widely used suffix sorting algorithms is qsufsort. However, qsufsort suffers one critical limitation that the order of suffixes starting with the same 2(k) characters cannot be determined in the kth round. To this point, in our paper, an efficient suffix sorting algorithm called dsufsort is proposed by overcoming the drawback of the qsufsort algorithm. In particular, our proposal maintains the depth of each unsorted portion of SA, and sorts the suffixes based on the depth in each round. By this means, some suffixes that cannot be sorted by qsufsort in each round can be sorted now, as a result, more sorting results in current round can be utilized by the latter rounds and the total number of sorting rounds will be reduced, which means dsufsort is more efficient than qsufsort. The experimental results show the effectiveness of the proposed algorithm, especially for the text with high repetitions.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Optimal In-Place Suffix Sorting
    Li, Zhize
    Li, Jian
    Huo, Hongwei
    2018 DATA COMPRESSION CONFERENCE (DCC 2018), 2018, : 422 - 422
  • [22] AN EFFICIENT ALGORITHM FOR SORTING WITH MINIMAL STORAGE
    SINGLETON, RC
    COMMUNICATIONS OF THE ACM, 1969, 12 (03) : 185 - +
  • [23] A Space and Time Efficient Algorithm for Constructing Compressed Suffix Arrays
    Wing-Kai Hon
    Tak-Wah Lam
    Kunihiko Sadakane
    Wing-Kin Sung
    Siu-Ming Yiu
    Algorithmica, 2007, 48 : 23 - 36
  • [24] Suffix sorting via matching statistics
    Zsuzsanna Lipták
    Francesco Masillo
    Simon J. Puglisi
    Algorithms for Molecular Biology, 19
  • [25] Optimal in-place suffix sorting
    Li, Zhize
    Li, Jian
    Huo, Hongwei
    INFORMATION AND COMPUTATION, 2022, 285
  • [26] In-Place Sparse Suffix Sorting
    Prezza, Nicola
    SODA'18: PROCEEDINGS OF THE TWENTY-NINTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2018, : 1496 - 1508
  • [28] The Adaptive Suffix Tree: A Space Efficient Sequence Learning Algorithm
    Gunasinghe, Upuli
    Alahakoon, Damminda
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [29] Space-Efficient Construction Algorithm for the Circular Suffix Tree
    Hon, Wing-Kai
    Ku, Tsung-Han
    Shah, Rahul
    Thankachan, Sharma V.
    COMBINATORIAL PATTERN MATCHING, 2013, 7922 : 142 - 152
  • [30] Space-Efficient Construction Algorithm for the Circular Suffix Tree
    Hon, Wing-Kai
    Ku, Tsung-Han
    Shah, Rahul
    Thankachan, Sharma V.
    2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 496 - 496