Space/time-efficient RDF stores based on circular suffix sorting

被引:0
|
作者
Nieves R. Brisaboa
Ana Cerdeira-Pena
Guillermo de Bernardo
Antonio Fariña
Gonzalo Navarro
机构
[1] University of A Coruña,Department of Computer Science and Information Technology
[2] CITIC Research Center,IMFD, Department of Computer Science
[3] University of Chile,undefined
来源
关键词
Compact data structures; RDF; CSA; Web of data;
D O I
暂无
中图分类号
学科分类号
摘要
The resource description framework (RDF) has gained popularity as a format for the standardized publication and exchange of information in the Web of Data. In this paper, we introduce RDFCSA, a compressed representation of RDF datasets that in addition supports efficient querying. RDFCSA regards the triples of the RDF store as short circular strings and applies suffix sorting on those strings, so that triple-pattern queries reduce to prefix searching on the string set. The RDF store is then represented compactly using a compressed suffix array (CSA), a proved technology in text indexing that efficiently supports prefix searches. Our experiments show that RDFCSA is competitive with state-of-the-art alternatives. It compresses the raw data to 60% of its size, close to the most compact alternatives. While most alternatives perform better in some kinds of triple-patterns than in others, RDFCSA features fast and consistent query times, a few microseconds per result in all cases. This enables efficiently supporting join queries by using either merge- or chaining-join strategies over the triple patterns coupled with some specific optimizations such as variable filling. Our experiments on binary joins show that RDFCSA is faster than the alternatives in most cases.
引用
收藏
页码:5643 / 5683
页数:40
相关论文
共 50 条
  • [41] Space- and Time-Efficient Long-Lived Test-And-Set Objects
    Aghazadeh, Zahra
    Woelfel, Philipp
    PRINCIPLES OF DISTRIBUTED SYSTEMS, OPODIS 2014, 2014, 8878 : 404 - 419
  • [42] Time-Efficient Single Constant Multiplication Based on Overlapping Digit Patterns
    Thong, Jason
    Nicolici, Nicola
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2009, 17 (09) : 1353 - 1357
  • [43] Classification based Time-Efficient, Blind Source Camera Identification for Videos
    Kandepu, Abhignana Mihir
    Naskar, Ruchira
    2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 871 - 875
  • [44] Lectin-mediated, time-efficient, and high-yield sorting of different morphologically intact nephron segments
    Jessica Roskosch
    Uyen Huynh-Do
    Stefan Rudloff
    Pflügers Archiv - European Journal of Physiology, 2024, 476 : 379 - 393
  • [45] Lectin-mediated, time-efficient, and high-yield sorting of different morphologically intact nephron segments
    Roskosch, Jessica
    Huynh-Do, Uyen
    Rudloff, Stefan
    PFLUGERS ARCHIV-EUROPEAN JOURNAL OF PHYSIOLOGY, 2024, 476 (03): : 379 - 393
  • [46] Time-Efficient USV Path Planning based on Weighted Dynamic Programming
    Duan, Chaofan
    Wang, Lijuan
    Wang, Hui
    Wang, Xuechun
    Peng, Ye
    2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 258 - 264
  • [47] Knowledge-Based Deep Learning for Time-Efficient Inverse Dynamics
    Ma, Shuhao
    Cao, Yu
    Robertson, Ian D.
    Shi, Chaoyang
    Liu, Jindong
    Zhang, Zhi-Qiang
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2025, 33 : 522 - 531
  • [48] mz5: Space- and Time-efficient Storage of Mass Spectrometry Data Sets
    Wilhelm, Mathias
    Kirchner, Marc
    Steen, Judith A. J.
    Steen, Hanno
    MOLECULAR & CELLULAR PROTEOMICS, 2012, 11 (01)
  • [49] A learning-based time-efficient framework for building energy performance evaluation
    Bhattacharya, Saptarshi
    Chen, Yan
    Huang, Sen
    Vrabie, Draguna
    ENERGY AND BUILDINGS, 2020, 228
  • [50] A time-efficient particle swarm optimization-based codebook generation algorithm
    Tsai, Chun-Wei
    Lin, Chung-Fu
    Chiang, Ming-Chao
    Yang, Chu-Sing
    2010 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2010,