PathQuery Pregel: high-performance graph query with bulk synchronous processing

被引:0
|
作者
Bogdan Arsintescu
Shardul Deo
Warren Harris
机构
[1] Google Inc,
来源
关键词
Distributed graph compute ; Pregel; Graph query; Bulk synchronous parallel computing; Graph database;
D O I
暂无
中图分类号
学科分类号
摘要
High-performance graph query systems are a scalable way to mine information in Knowledge Graphs, especially when the queries benefit from a high-level expressive query language. This paper presents techniques to algorithmically compile queries expressed in a high-level language (e.g., Datalog) into a directed acyclic graph query plan and details how these queries can be run on a Pregel graph vertex-centric compute system. Our solution, called PathQuery Pregel, creates plans for any conjunctive or disjunctive queries with aggregation and negation; we describe how the query execution extracts graph results optimally while avoiding many join operations where parallel map execution is permitted. We provide details of how we scaled this system out to execute large set of queries in parallel over the Google Knowledge Graph, a graph of 70B edges, or facts; we describe our production experience with PathQuery Pregel.
引用
收藏
页码:1493 / 1504
页数:11
相关论文
共 50 条
  • [21] Processing bulk insulating CaTiO3 into a high-performance thermoelectric material
    Li, Jianbo
    Wang, Yanxia
    Yang, Xiong
    Kang, Huijun
    Cao, Zhiqiang
    Jiang, Xue
    Chen, Zongning
    Guo, Enyu
    Wang, Tongmin
    CHEMICAL ENGINEERING JOURNAL, 2022, 428
  • [22] Processing bulk insulating CaTiO3 into a high-performance thermoelectric material
    Li, Jianbo
    Wang, Yanxia
    Yang, Xiong
    Kang, Huijun
    Cao, Zhiqiang
    Jiang, Xue
    Chen, Zongning
    Guo, Enyu
    Wang, Tongmin
    Chemical Engineering Journal, 2022, 428
  • [23] Effective runtime scheduling for high-performance graph processing on heterogeneous dataflow architecture
    Qingxiang Chen
    Long Zheng
    Xiaofei Liao
    Hai Jin
    Qinggang Wang
    CCF Transactions on High Performance Computing, 2020, 2 : 362 - 375
  • [24] TDGraph: A Topology-Driven Accelerator for High-Performance Streaming Graph Processing
    Zhao, Jin
    Yang, Yun
    Zhang, Yu
    Liao, Xiaofei
    Gu, Lin
    He, Ligang
    He, Bingsheng
    Jin, Hai
    Liu, Haikun
    Jiang, Xinyu
    Yu, Hui
    PROCEEDINGS OF THE 2022 THE 49TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '22), 2022, : 116 - 129
  • [25] Effective runtime scheduling for high-performance graph processing on heterogeneous dataflow architecture
    Chen, Qingxiang
    Zheng, Long
    Liao, Xiaofei
    Jin, Hai
    Wang, Qinggang
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2020, 2 (04) : 362 - 375
  • [26] High performance distributed parallel query processing
    Jiang, Y
    Taniar, D
    Leung, CHC
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2001, 16 (05): : 277 - 289
  • [27] TuNao: A High-Performance and Energy-Efficient Reconfigurable Accelerator for Graph Processing
    Zhou, Jinhong
    Liu, Shaoli
    Guo, Qi
    Zhou, Xuda
    Zhi, Tian
    Liu, Daofu
    Wang, Chao
    Zhou, Xuehai
    Chen, Yunji
    Chen, Tianshi
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 731 - 734
  • [28] Graph analysis with high-performance computing
    Hendrickson, Bruce
    Berry, JonatHan W.
    COMPUTING IN SCIENCE & ENGINEERING, 2008, 10 (02) : 14 - 19
  • [29] Graphlt: A High-Performance Graph DSL
    Zhang, Yunming
    Yang, Mengjiao
    Baghdadi, Riyadh
    Kamil, Shoaib
    Shun, Julian
    Amarasinghe, Saman
    PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2018, 2
  • [30] GraphIt: A High-Performance Graph DSL
    Zhang, Yunming
    Yang, Mengjiao
    Baghdadi, Riyadh
    Kamil, Shoaib
    Shun, Julian
    Amarasinghe, Saman
    PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2018, 2