PathQuery Pregel: high-performance graph query with bulk synchronous processing

被引:0
|
作者
Bogdan Arsintescu
Shardul Deo
Warren Harris
机构
[1] Google Inc,
来源
关键词
Distributed graph compute ; Pregel; Graph query; Bulk synchronous parallel computing; Graph database;
D O I
暂无
中图分类号
学科分类号
摘要
High-performance graph query systems are a scalable way to mine information in Knowledge Graphs, especially when the queries benefit from a high-level expressive query language. This paper presents techniques to algorithmically compile queries expressed in a high-level language (e.g., Datalog) into a directed acyclic graph query plan and details how these queries can be run on a Pregel graph vertex-centric compute system. Our solution, called PathQuery Pregel, creates plans for any conjunctive or disjunctive queries with aggregation and negation; we describe how the query execution extracts graph results optimally while avoiding many join operations where parallel map execution is permitted. We provide details of how we scaled this system out to execute large set of queries in parallel over the Google Knowledge Graph, a graph of 70B edges, or facts; we describe our production experience with PathQuery Pregel.
引用
收藏
页码:1493 / 1504
页数:11
相关论文
共 50 条
  • [1] PathQuery Pregel: high-performance graph query with bulk synchronous processing
    Arsintescu, Bogdan
    Deo, Shardul
    Harris, Warren
    PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (03) : 1493 - 1504
  • [2] PARALLELISM FOR HIGH-PERFORMANCE QUERY-PROCESSING
    WINTERS, VG
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 580 : 344 - 356
  • [3] On the Impact of Memory Allocation on High-Performance Query Processing
    Durner, Dominik
    Leis, Viktor
    Neumann, Thomas
    15TH INTERNATIONAL WORKSHOP ON DATA MANAGEMENT ON NEW HARDWARE (DAMON 2019), 2019,
  • [4] GraQL: A Query Language for High-Performance Attributed Graph Databases
    Chavarria-Miranda, Daniel
    Castellana, Vito Giovanni
    Morari, Alessandro
    Haglin, David
    Feo, John
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 1453 - 1462
  • [5] Bulk processing of high-performance nanocrystalline intermetallics
    Shang, CH
    Cammarata, RC
    Van Heerden, C
    Weihs, TP
    Chien, CL
    JOURNAL OF MATERIALS RESEARCH, 2003, 18 (09) : 2017 - 2020
  • [6] Bulk processing of high-performance nanocrystalline intermetallics
    C. H. Shang
    R. C. Cammarata
    C. Van Heerden
    T. P. Weihs
    C. L. Chien
    Journal of Materials Research, 2003, 18 : 2017 - 2020
  • [7] High-Performance Graph Storage and Mutation for Graph Processing and Streaming: A Review
    Firmli, Soukaina
    Chiadmi, Dalila
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2025, 16 (01) : 19 - 30
  • [8] Gunrock: A High-Performance Graph Processing Library on the GPU
    Wang, Yangzihao
    Davidson, Andrew
    Pan, Yuechao
    Wu, Yuduo
    Riffel, Andy
    Owens, John D.
    ACM SIGPLAN NOTICES, 2016, 51 (08) : 123 - 134
  • [9] Gunrock: A High-Performance Graph Processing Library on the GPU
    Wang, Yangzihao
    Davidson, Andrew
    Pan, Yuechao
    Wu, Yuduo
    Riffel, Andy
    Owens, John D.
    ACM SIGPLAN NOTICES, 2015, 50 (08) : 265 - 266
  • [10] Two-Tier Storage DBMS for High-Performance Query Processing
    Eo, Sang-Hun
    Li, Yan
    Kim, Ho-Seok
    Bae, Hae-Young
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2008, 4 (01): : 9 - 16