Distributed processing of regular path queries in RDF graphs

被引:0
|
作者
Xintong Guo
Hong Gao
Zhaonian Zou
机构
[1] Harbin Institute of Technology,
来源
关键词
Knowledge graph; RDF/SPARQL; Regular path queries; Graph summarization; Graph partitioning;
D O I
暂无
中图分类号
学科分类号
摘要
SPARQL 1.1 offers a type of navigational query for RDF systems, called regular path query (RPQ). A regular path query allows for retrieving node pairs with the paths between them satisfying regular expressions. Regular path queries are always difficult to be evaluated efficiently because of the possible large search space. Thus there has been no scalable and practical solution so far. In this paper, we present Leon+, an in-memory distributed framework, to address the RPQ problem in the context of the knowledge graph. To reduce search space and mitigate mounting communication costs, Leon+ takes advantage of join-ahead pruning via a novel RDF summarization technique together with a path partitioning strategy. We also develop a subtle cost model to devise query plans to achieve high efficiency for complex RPQs. As there has been no available RPQ benchmark, we create micro-benchmarks on both synthetic and real-world datasets. A thorough experimental evaluation is presented between our approach and the state-of-the-art RDF stores. The results show that our approach outperforms 5x faster than the competitors on single RPQ. For query workload, it saves up to 1/2 time and 2/3 communication overheads over the baseline method.
引用
收藏
页码:993 / 1027
页数:34
相关论文
共 50 条
  • [21] Distributed multi-source regular path queries
    Shoaran, Maryam
    Thomo, Alex
    FRONTIERS OF HIGH PERFORMANCE COMPUTING AND NETWORKING - ISPA 2007 WORKSHOPS, 2007, 4743 : 365 - +
  • [22] The Complexity of Regular Trail and Simple Path Queries on Undirected Graphs
    Martens, Wim
    Popp, Tina
    PROCEEDINGS OF THE 41ST ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS (PODS '22), 2022, : 165 - 174
  • [23] Fault-tolerant computation of distributed regular path queries
    Shoaran, Maryam
    Thomo, Alex
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (01) : 62 - 77
  • [24] Top-k queries on RDF graphs
    Wang, Dong
    Zou, Lei
    Zhao, Dongyan
    INFORMATION SCIENCES, 2015, 316 : 201 - 217
  • [25] RDFPath: Path Query Processing on Large RDF Graphs with Map Reduce
    Przyjaciel-Zablocki, Martin
    Schaetzle, Alexander
    Hornung, Thomas
    Lausen, Georg
    SEMANTIC WEB: ESWC 2011 WORKSHOPS, 2012, 7117 : 50 - 64
  • [26] Relational Processing of RDF Queries: A Survey
    Sakr, Sherif
    Al-Naymat, Ghazi
    SIGMOD RECORD, 2009, 38 (04) : 23 - 28
  • [27] Parametric regular path queries
    Liu, YHA
    Rothamel, T
    Yu, FX
    Stoller, SD
    Hu, NJ
    ACM SIGPLAN NOTICES, 2004, 39 (06) : 219 - 230
  • [28] Efficient distributed path computation on RDF knowledge graphs using partial evaluation
    Qaiser Mehmood
    Muhammad Saleem
    Alokkumar Jha
    Mathieu d’Aquin
    World Wide Web, 2022, 25 : 1005 - 1036
  • [29] Temporal Regular Path Queries
    Arenas, Marcelo
    Bahamondes, Pedro
    Aghasadeghi, Amir
    Stoyanovich, Julia
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2412 - 2425
  • [30] Regular path queries with constraints
    Abiteboul, S
    Vianu, V
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1999, 58 (03) : 428 - 452