Optimal Substring Equality Queries with Applications to Sparse Text Indexing

被引:3
|
作者
Prezza, Nicola [1 ,2 ]
机构
[1] LUISS Guido Carli, Viale Romania 32, IT-00197 Rome, Italy
[2] Ca Foscari Univ Venice, Venice, Italy
关键词
Substring equality queries; in-place; suffix sorting; LOWER BOUNDS; SUFFIX; CONSTRUCTION;
D O I
10.1145/3426870
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider the problem of encoding a string of length n from an integer alphabet of size sigma so access, substring equality, and Longest Common Extension (LCE) queries can be answered efficiently. We describe a new space-optimal data structure supporting logarithmic-time queries. Access and substring equality query times can furthermore be improved to the optimal O(1) if O(log n) additional precomputed words are allowed in the total space. Additionally, we provide in-place algorithms for converting between the string and our data structure. Using this new string representation, we obtain the first in-place subquadratic algorithms for several string-processing problems in the restore model: The input string is rewritable and must be restored before the computation terminates. In particular, we describe the first in-place subquadratic Monte Carlo solutions to the sparse suffix sorting, sparse LCP array construction, and suffix selection problems. With the sole exception of suffix selection, our algorithms are also the first running in sublinear time for small enough sets of input suffixes. Combining these solutions, we obtain the first sublinear-time Monte Carlo algorithm for building the sparse suffix tree in compact space. We also show how to build a correct version of our data structure using small working space. This leads to the first Las Vegas in-place algorithm computing the full LCP array in 0(n log n) time w.h.p. and to the first Las Vegas in-place algorithms solving the sparse suffix sorting and sparse LCP array construction problems in O(n(1.5)root log sigma) time w.h.p.
引用
收藏
页数:23
相关论文
共 24 条
  • [1] Shortest Unique Palindromic Substring Queries in Optimal Time
    Nakashima, Yuto
    Inoue, Hiroe
    Mieno, Takuya
    Inenaga, Shunsuke
    Bannai, Hideo
    Takeda, Masayuki
    COMBINATORIAL ALGORITHMS, IWOCA 2017, 2018, 10765 : 397 - 408
  • [2] Sparse Text Indexing in Small Space
    Bille, Philip
    Fischer, Johannes
    Gortz, Inge Li
    Kopelowitz, Tsvi
    Sach, Benjamin
    Vildhoj, Hjalte Wedel
    ACM TRANSACTIONS ON ALGORITHMS, 2016, 12 (03)
  • [3] Indexing Graphs for Path Queries with Applications in Genome Research
    Siren, Jouni
    Valimaki, Niko
    Makinen, Veli
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (02) : 375 - 388
  • [4] Fast and Optimal Extraction for Sparse Equality Graphs
    Goharshady, Amir Kafshdar
    Lam, Chun Kit
    Parreaux, Lionel
    PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2024, 8 (OOPSLA):
  • [5] Scatter bitmap: Space-time efficient bitmap indexing for equality and membership queries
    Vanichayobon, Sirirut
    Manfuekphan, Jarin
    Gruenwald, Le
    2006 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2006, : 754 - +
  • [6] INTERNAL PATTERN MATCHING QUERIES IN A TEXT AND APPLICATIONS
    Kociumaka, Tomasz
    Radoszewski, Jakub
    Rytter, Wojciech
    Walen, Tomasz
    SIAM JOURNAL ON COMPUTING, 2022, 53 (05) : 1524 - 1577
  • [7] an Optimal Locality Preserving Indexing Algorithm for Text Mining
    Tao, Jian-Wen
    Cheng, Guang-Hua
    Lv, Xin-Rong
    Zhao, Jie-Yu
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 165 - +
  • [8] A simple yet time-optimal and linear-space algorithm for shortest unique substring queries
    Ileri, Atalay Mert
    Kulekci, M. Oguzhan
    Xu, Bojian
    THEORETICAL COMPUTER SCIENCE, 2015, 562 : 621 - 633
  • [9] Sparse dominance queries for many points in optimal time and space
    Graf, T
    Kamakoti, V
    INFORMATION PROCESSING LETTERS, 1997, 64 (06) : 287 - 291
  • [10] Music Literature Indexing: Comparing Users' Free-Text Queries and Controlled Vocabularies
    Wu, Shuheng
    Fan, Yun
    CATALOGING & CLASSIFICATION QUARTERLY, 2018, 56 (04) : 330 - 353