Dynamic rank/select structures with applications to run-length encoded texts

被引:8
|
作者
Lee, Sunho [1 ]
Park, Kunsoo [1 ]
机构
[1] Seoul Natl Univ, Sch Engn & Comp Sci, Seoul 151742, South Korea
关键词
Succinct data structures; Dynamic rank/select structures; Full-text index; Run-length encoding;
D O I
10.1016/j.tcs.2009.07.021
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Given an n-length text over a sigma-size alphabet, we propose a framework for dynamic rank/select structures on the text and some of its applications. For a small alphabet with a <= log n, we propose a two-level structure consisting of a counting scheme and a storing scheme that supports O(log n) worst-case time rank/select operations and O(log n) amortized time insert/delete operations. For a large alphabet with log n < sigma <= n, we extend it to obtain O((1+ log sigma/log log n) worst-case time rank/select and O((1 + log sigma/log log n) log n) amortized time insert/delete. Our structure provides a simple representation of an index for a collection of texts. In addition, we present rank/select structures on run-length encoding (RLE) of a text. For the n'-length RLE of an n-length text, our static version provides O(1) time select and O(log log sigma) time rank using n' log sigma + O(n) bits and our dynamic version gives O((1 + log sigma/log log n) log n) time operations in n' log sigma + o(n' log sigma ) + O(n) bits. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:4402 / 4413
页数:12
相关论文
共 50 条
  • [41] Algorithms for Jumbled Indexing, Jumbled Border and Jumbled Square on Run-Length Encoded Strings
    Amir, Amihood
    Apostolico, Alberto
    Hirst, Tirza
    Landau, Gad M.
    Lewenstein, Noa
    Rozenberg, Liat
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2014, 2014, 8799 : 45 - 51
  • [42] A fast and simple algorithm for computing the longest common subsequence of run-length encoded strings
    Ann, Hsing-Yen
    Yang, Chang-Biau
    Tseng, Chiou-Ting
    Hor, Chiou-Yi
    INFORMATION PROCESSING LETTERS, 2008, 108 (06) : 360 - 364
  • [43] Algorithms for Jumbled Indexing, Jumbled Border and Jumbled Square on run-length encoded strings
    Amir, Amihood
    Apostolico, Alberto
    Hirst, Tirza
    Landau, Gad M.
    Lewenstein, Noa
    Rozenberg, Liat
    THEORETICAL COMPUTER SCIENCE, 2016, 656 : 146 - 159
  • [44] Regular Abelian Periods and Longest Common Abelian Factors on Run-Length Encoded Strings
    Grabowski, Szymon
    STRING PROCESSING AND INFORMATION RETRIEVAL (SPIRE 2017), 2017, 10508 : 208 - 213
  • [45] Heuristics for the run-length encoded Burrows-Wheeler transform alphabet ordering problem
    Major, Lily
    Clare, Amanda
    Daykin, Jacqueline W.
    Mora, Benjamin
    Zarges, Christine
    JOURNAL OF HEURISTICS, 2025, 31 (01)
  • [46] Rank/select on dynamic compressed sequences and applications
    Gonzalez, Rodrigo
    Navarro, Gonzalo
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (43) : 4414 - 4422
  • [47] Line Detection in Run-Length Encoded Document Images using Monotonically Increasing Graph Model
    Amarnath, R.
    Nagabhushan, P.
    Javed, Mohammed
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 644 - 649
  • [48] SECOND LEVEL ERROR-DETECTION SYSTEM FOR RUN-LENGTH ENCODED NONCODED INFORMATION DATA.
    Garcia, J.E.
    1600, (16):
  • [49] Dynamic model of mastering for multilevel run-length limited read-only disc
    Yuan, Haibo
    Xu, Duanyi
    Zhang, Qicheng
    Song, Jie
    OPTICS EXPRESS, 2007, 15 (07) : 4176 - 4181
  • [50] Fast and space-efficient location of heavy or dense segments in run-length encoded sequences - (Extended abstract)
    Greenberg, RI
    COMPUTING AND COMBINATORICS, PROCEEDINGS, 2003, 2697 : 528 - 536