Distributed Efficient Provenance-Aware Regular Path Queries on Large RDF Graphs

被引:4
|
作者
Xin, Yueqi [1 ,2 ]
Wang, Xin [1 ,2 ]
Jin, Di [1 ,2 ]
Wang, Simiao [1 ,2 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin, Peoples R China
[2] Tianjin Key Lab Cognit Comp & Applicat, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
Regular path query; Provenance-aware; RDF graph Pregel;
D O I
10.1007/978-3-319-91452-7_49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the proliferation of knowledge graphs, massive RDF graphs have been published on the Web. As an essential type of queries for RDF graphs, Regular Path Queries (RPQs) have been attracting increasing research efforts. However, the existing query processing approaches mainly focus on the standard semantics of RPQs, which cannot provide provenance of the answer sets. We propose dProvRPQ that is a distributed approach to evaluating provenance-aware RPQs over big RDF graphs. Our Pregel-based method employs Glushkov automata to keep track of matching processes of RPQs in parallel. Meanwhile, four optimization strategies are devised, including edge filtering, candidate states, message compression, and message selection, which can reduce the intermediate results of the basic dProvRPQ algorithm dramatically and overcome the counting-paths problem to some extent. The proposed algorithms are verified by extensive experiments on both synthetic and real-world datasets, which show that our approach can efficiently answer the provenance-aware RPQs over large RDF graphs.
引用
收藏
页码:766 / 782
页数:17
相关论文
共 50 条
  • [1] ProvRPQ: An Interactive Tool for Provenance-Aware Regular Path Queries on RDF Graphs
    Wang, Xin
    Wang, Junhu
    DATABASES THEORY AND APPLICATIONS, (ADC 2016), 2016, 9877 : 480 - 484
  • [2] Answering Provenance-Aware Regular Path Queries on RDF Graphs Using an Automata-Based Algorithm
    Wang, Xin
    Ling, Jun
    Wang, Junhu
    Wang, Kewen
    Feng, Zhiyong
    WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 395 - 396
  • [3] Distributed Pregel-based provenance-aware regular path query processing on RDF knowledge graphs
    Wang, Xin
    Wang, Simiao
    Xin, Yueqi
    Yang, Yajun
    Li, Jianxin
    Wang, Xiaofei
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (03): : 1465 - 1496
  • [4] Distributed Pregel-based provenance-aware regular path query processing on RDF knowledge graphs
    Xin Wang
    Simiao Wang
    Yueqi Xin
    Yajun Yang
    Jianxin Li
    Xiaofei Wang
    World Wide Web, 2020, 23 : 1465 - 1496
  • [5] Distributed processing of regular path queries in RDF graphs
    Guo, Xintong
    Gao, Hong
    Zou, Zhaonian
    KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (04) : 993 - 1027
  • [6] Distributed processing of regular path queries in RDF graphs
    Xintong Guo
    Hong Gao
    Zhaonian Zou
    Knowledge and Information Systems, 2021, 63 : 993 - 1027
  • [7] Efficient Distributed Regular Path Queries on RDF Graphs Using Partial Evaluation
    Wang, Xin
    Wang, Junhu
    Zhang, Xiaowang
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1933 - 1936
  • [8] P3RPQ: Pregel-Based Parallel Provenance-Aware Regular Path Query Processing on Large RDF Graphs
    Xin, Yueqi
    Zhang, Bingyi
    Wang, Xin
    Xu, Qiang
    Feng, Zhiyong
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 19 - 20
  • [9] Answering Provenance-Aware Queries on RDF Data Cubes Under Memory Budgets
    Galarraga, Luis
    Ahlstrom, Kim
    Hose, Katja
    Pedersen, Torben Bach
    SEMANTIC WEB - ISWC 2018, PT I, 2018, 11136 : 547 - 565
  • [10] Regular Path Queries on Large Graphs
    Koschmieder, Andre
    Leser, Ulf
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2012, 2012, 7338 : 177 - 194