qEndpoint: A novel triple store architecture for large RDF graphs

被引:0
|
作者
Willerval, Antoine [1 ,2 ]
Diefenbach, Dennis [1 ]
Bonifati, Angela [2 ]
机构
[1] QA Co, St Etienne, France
[2] Lyon 1 Univ, CNRS, Liris, IUF, Villeurbanne, France
关键词
RDF; qEndpoint; HDT; RDF4J; Wikidata; ENGINE;
D O I
10.3233/SW-243616
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the relational database realm, there has been a shift towards novel hybrid database architectures combining the properties of transaction processing (OLTP) and analytical processing (OLAP). OLTP workloads are made up by read and write operations on a small number of rows and are typically addressed by indexes such as B+trees. On the other side, OLAP workloads consists of big read operations that scan larger parts of the dataset. To address both workloads some databases introduced an architecture using a buffer or delta partition. Precisely, changes are accumulated in a write-optimized delta partition while the rest of the data is compressed in the read- optimized main partition. Periodically, the delta storage is merged in the main partition. In this paper we investigate for the first time how this architecture can be implemented and behaves for RDF graphs. We describe in detail the indexing-structures one can use for each partition, the merge process as well as the transactional management. We study the performances of our triple store, which we call qEndpoint, over two popular benchmarks, the Berlin SPARQL Benchmark (BSBM) and the recent Wikidata Benchmark (WDBench). We are also studying how it compares against other public Wikidata endpoints. This allows us to study the behavior of the triple store for different workloads, as well as the scalability over large RDF graphs. The results show that, compared to the baselines, our triple store allows for improved indexing times, better response time for some queries, higher insert and delete rates, and low disk and memory footprints, making it ideal to store and serve large Knowledge Graphs.
引用
收藏
页码:2069 / 2087
页数:19
相关论文
共 50 条
  • [31] Novel Node Importance Measures to Improve Keyword Search over RDF Graphs
    Menendez, Elisa S.
    Casanova, Marco A.
    Leme, Luiz A. P. Paes
    Boughanem, Mohand
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT II, 2019, 11707 : 143 - 158
  • [32] Heuristics-Based Query Processing for Large RDF Graphs Using Cloud Computing
    Husain, Mohammad Farhan
    McGlothlin, James
    Masud, Mohammad Mehedy
    Khan, Latifur R.
    Thuraisingham, Bhavani
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (09) : 1312 - 1327
  • [33] Distributed Efficient Provenance-Aware Regular Path Queries on Large RDF Graphs
    Xin, Yueqi
    Wang, Xin
    Jin, Di
    Wang, Simiao
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2018, PT I, 2018, 10827 : 766 - 782
  • [34] LiteMat: a scalable, cost-efficient inference encoding scheme for large RDF graphs
    Cure, Olivier
    Naacke, Hubert
    Randriamalala, Tendry
    Amann, Bernd
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1823 - 1830
  • [35] TraPath: Fast Regular Path Query Evaluation on Large-Scale RDF Graphs
    Wang, Xin
    Rao, Guozheng
    Jiang, Longxiang
    Lyu, Xuedong
    Yang, Yajun
    Feng, Zhiyong
    WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 372 - 383
  • [36] PDSM: Pregel-Based Distributed Subgraph Matching on Large Scale RDF Graphs
    Xu, Qiang
    Wang, Xin
    Xin, Yueqi
    Feng, Zhiyong
    Chen, Renhai
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 17 - 18
  • [37] Navigating large clustered graphs with triple-layer display
    Huang, Mao Lin
    Nguyen, Quang Vinh
    11TH INTERNATIONAL CONFERENCE INFORMATION VISUALIZATION, 2007, : 684 - +
  • [38] FFD-Index: An Efficient Indexing Scheme for Star Subgraph Matching on Large RDF Graphs
    Lyu, Xuedong
    Wang, Xin
    Li, Yuan-Fang
    Feng, Zhiyong
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2015, 2015, 9052 : 240 - 245
  • [39] A Memory Centric Architecture of the Link Assessment Algorithm in Large Graphs
    Brugger, Christian
    Grigorovici, Valentin
    Jung, Matthias
    De Schryver, Christian
    Weis, Christian
    Wehn, Norbert
    Zweig, Katharina Anna
    IEEE DESIGN & TEST, 2018, 35 (01) : 7 - 15
  • [40] Vadalog: A modern architecture for automated reasoning with large knowledge graphs
    Bellomarini, Luigi
    Benedetto, Davide
    Gottlob, Georg
    Sallinger, Emanuel
    INFORMATION SYSTEMS, 2022, 105