Space/time-efficient RDF stores based on circular suffix sorting

被引:0
|
作者
Nieves R. Brisaboa
Ana Cerdeira-Pena
Guillermo de Bernardo
Antonio Fariña
Gonzalo Navarro
机构
[1] University of A Coruña,Department of Computer Science and Information Technology
[2] CITIC Research Center,IMFD, Department of Computer Science
[3] University of Chile,undefined
来源
关键词
Compact data structures; RDF; CSA; Web of data;
D O I
暂无
中图分类号
学科分类号
摘要
The resource description framework (RDF) has gained popularity as a format for the standardized publication and exchange of information in the Web of Data. In this paper, we introduce RDFCSA, a compressed representation of RDF datasets that in addition supports efficient querying. RDFCSA regards the triples of the RDF store as short circular strings and applies suffix sorting on those strings, so that triple-pattern queries reduce to prefix searching on the string set. The RDF store is then represented compactly using a compressed suffix array (CSA), a proved technology in text indexing that efficiently supports prefix searches. Our experiments show that RDFCSA is competitive with state-of-the-art alternatives. It compresses the raw data to 60% of its size, close to the most compact alternatives. While most alternatives perform better in some kinds of triple-patterns than in others, RDFCSA features fast and consistent query times, a few microseconds per result in all cases. This enables efficiently supporting join queries by using either merge- or chaining-join strategies over the triple patterns coupled with some specific optimizations such as variable filling. Our experiments on binary joins show that RDFCSA is faster than the alternatives in most cases.
引用
收藏
页码:5643 / 5683
页数:40
相关论文
共 50 条
  • [1] Space/time-efficient RDF stores based on circular suffix sorting
    Brisaboa, Nieves R.
    Cerdeira-Pena, Ana
    de Bernardo, Guillermo
    Farina, Antonio
    Navarro, Gonzalo
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (05): : 5643 - 5683
  • [2] TIME-EFFICIENT AND SPACE-EFFICIENT RANDOMIZED CONSENSUS
    ASPNES, J
    JOURNAL OF ALGORITHMS, 1993, 14 (03) : 414 - 431
  • [3] Space-Efficient Construction Algorithm for the Circular Suffix Tree
    Hon, Wing-Kai
    Ku, Tsung-Han
    Shah, Rahul
    Thankachan, Sharma V.
    COMBINATORIAL PATTERN MATCHING, 2013, 7922 : 142 - 152
  • [4] Space-Efficient Construction Algorithm for the Circular Suffix Tree
    Hon, Wing-Kai
    Ku, Tsung-Han
    Shah, Rahul
    Thankachan, Sharma V.
    2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 496 - 496
  • [5] TIME-EFFICIENT STATE-SPACE SEARCH
    REINEFELD, A
    RIDINGER, P
    ARTIFICIAL INTELLIGENCE, 1994, 71 (02) : 397 - 408
  • [6] Space- and Time-Efficient Polynomial Multiplication
    Roche, Daniel S.
    ISSAC2009: PROCEEDINGS OF THE 2009 INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND ALGEBRAIC COMPUTATION, 2009, : 295 - 301
  • [7] TIME-EFFICIENT AND SPACE-EFFICIENT GARBAGE COMPACTION ALGORITHM
    MORRIS, FL
    COMMUNICATIONS OF THE ACM, 1978, 21 (08) : 662 - 665
  • [8] Space efficient linear time construction of suffix arrays
    Ko, P
    Aluru, S
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2003, 2676 : 200 - 210
  • [9] Space efficient linear time construction of suffix arrays
    Ko, Pang
    Aluru, Srinivas
    JOURNAL OF DISCRETE ALGORITHMS, 2005, 3 (2-4) : 143 - 156
  • [10] A Space and Time Efficient Algorithm for Constructing Compressed Suffix Arrays
    Wing-Kai Hon
    Tak-Wah Lam
    Kunihiko Sadakane
    Wing-Kin Sung
    Siu-Ming Yiu
    Algorithmica, 2007, 48 : 23 - 36