Ganite: A distributed engine for scalable path queries over temporal property graphs

被引:3
|
作者
Ramesh, Shriram [1 ]
Baranawal, Animesh [1 ]
Simmhan, Yogesh [1 ]
机构
[1] Indian Inst Sci, Dept Computat & Data Sci, Bangalore 560012, Karnataka, India
关键词
Graph processing; Temporal graphs; Distributed scheduling; Big data platforms; Query planning;
D O I
10.1016/j.jpdc.2021.02.004
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Property graphs are a common form of linked data, with path queries used to traverse and explore them for enterprise transactions and mining. Temporal property graphs are a recent variant where time is a first-class entity to be queried over, and their properties and structure vary over time. These are seen in social, telecom, transit and epidemic networks. However, current graph databases and query engines have limited support for temporal relations among graph entities, no support for time varying entities and/or do not scale on distributed resources. We address this gap by extending a linear path query model over property graphs to include intuitive temporal predicates and aggregation operators over temporal graphs. We design a distributed execution model for these temporal path queries using the interval-centric computing model, and develop a novel cost model to select an efficient execution plan from several. We perform detailed experiments of our granite distributed query engine using both static and dynamic temporal property graphs as large as 52M vertices, 218M edges and 325M properties, and a 1600-query workload, derived from the LDBC benchmark. We frequently offer sub-second query latencies on a commodity cluster, which is 149x-1140x faster compared to industry-leading Neo4J shared-memory graph database and the JanusGraph/Spark distributed graph query engine. granite also completes 100% of the queries for all graphs, compared to only 32-92% workload completion by the baseline systems. Further, our cost model selects a query plan that is within 10% of the optimal execution time in 90% of the cases. Despite the irregular nature of graph processing, we exhibit a weak-scaling efficiency of >= 60% on 8 nodes and >= 40% on 16 nodes, for most query workloads. (C) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:94 / 111
页数:18
相关论文
共 35 条
  • [1] A Distributed Path Query Engine for Temporal Property Graphs
    Ramesh, Shriram
    Baranawal, Animesh
    Simmhan, Yogesh
    2020 20TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2020), 2020, : 499 - 508
  • [2] Distributed processing of regular path queries in RDF graphs
    Xintong Guo
    Hong Gao
    Zhaonian Zou
    Knowledge and Information Systems, 2021, 63 : 993 - 1027
  • [3] Distributed processing of regular path queries in RDF graphs
    Guo, Xintong
    Gao, Hong
    Zou, Zhaonian
    KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (04) : 993 - 1027
  • [4] The μ-RA System for Recursive Path Queries over Graphs
    Fejza, Amela
    Geneves, Pierre
    Layaida, Nabil
    Chlyah, Sarah
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 5041 - 5045
  • [5] Processing SPARQL queries over distributed RDF graphs
    Peng Peng
    Lei Zou
    M. Tamer Özsu
    Lei Chen
    Dongyan Zhao
    The VLDB Journal, 2016, 25 : 243 - 268
  • [6] Processing SPARQL queries over distributed RDF graphs
    Peng, Peng
    Zou, Lei
    Ozsu, M. Tamer
    Chen, Lei
    Zhao, Dongyan
    VLDB JOURNAL, 2016, 25 (02): : 243 - 268
  • [7] Reachability and Time-Based Path Queries in Temporal Graphs
    Wu, Huanhuan
    Huang, Yuzhen
    Cheng, James
    Li, Jinfeng
    Ke, Yiping
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 145 - 156
  • [8] Efficient algorithms for reachability and path queries on temporal bipartite graphs
    Wang, Kai
    Cai, Minghao
    Chen, Xiaoshuang
    Lin, Xuemin
    Zhang, Wenjie
    Qin, Lu
    Zhang, Ying
    VLDB JOURNAL, 2024, 33 (05): : 1399 - 1426
  • [9] Efficiently Answering Reachability and Path Queries on Temporal Bipartite Graphs
    Chen, Xiaoshuang
    Wang, Kai
    Lin, Xuemin
    Zhang, Wenjie
    Qin, Lu
    Zhang, Ying
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (10): : 1845 - 1858
  • [10] Answering Property Path Queries over Federated RDF Systems
    Ge, Ningchao
    Peng, Peng
    Wu, Jibing
    Liu, Lihua
    Chen, Haiwen
    Wang, Tengyun
    WEB AND BIG DATA, PT III, APWEB-WAIM 2023, 2024, 14333 : 16 - 31