Kaskade: Graph Views for Efficient Graph Analytics

被引:7
|
作者
da Trindade, Joana M. F. [1 ]
Karanasos, Konstantinos [2 ]
Curino, Carlo [2 ]
Madden, Samuel [1 ]
Shun, Julian [1 ]
机构
[1] MIT, CSAIL, Cambridge, MA 02139 USA
[2] Microsoft, Albuquerque, NM USA
关键词
D O I
10.1109/ICDE48307.2020.00024
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graphs are a natural way to model real-world entities and relationships between them, ranging from social networks to data lineage graphs and biological datasets. Queries over these large graphs often involve expensive sub-graph traversals and complex analytical computations. These real-world graphs are often substantially more structured than a generic vertex-and-edge model would suggest, but this insight has remained mostly unexplored by existing graph engines for graph query optimization purposes. In this work, we leverage structural properties of graphs and queries to automatically derive materialized graph views that can dramatically speed up query evaluation. We present KASKADE, the first graph query optimization framework to exploit materialized graph views for query optimization purposes. KASKADE employs a novel constraint-based view enumeration technique that mines constraints from query workloads and graph schemas, and injects them during view enumeration to significantly reduce the search space of views to be considered. Moreover, it introduces a graph view size estimator to pick the most beneficial views to materialize given a query set and to select the best query evaluation plan given a set of materialized views. We evaluate its performance over real-world graphs, including the provenance graph that we maintain at Microsoft to enable auditing, service analytics, and advanced system optimizations. Our results show that KASKADE substantially reduces the effective graph size and yields significant performance speedups (up to 50X), in some cases making otherwise intractable queries possible.
引用
收藏
页码:193 / 204
页数:12
相关论文
共 50 条
  • [31] Accelerating Graph Analytics by Utilising the Memory Locality of Graph Partitioning
    Sun, Jiawen
    Vandierendonck, Hans
    Nikolopoulos, Dimitrios S.
    2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2017, : 181 - 190
  • [32] Improving Graph Compression for EfficientResource-Constrained Graph Analytics
    Xu, Qian
    Yang, Juan
    Zhang, Feng
    Chen, Zheng
    Guan, Jiawei
    Chen, Kang
    Fan, Ju
    Shen, Youren
    Yang, Ke
    Zhang, Yu
    Du, Xiaoyong
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (09): : 2212 - 2226
  • [33] Balancing Memory Accesses for Energy-Efficient Graph Analytics Accelerators
    Yan, Mingyu
    Hu, Xing
    Li, Shuangchen
    Akgun, Itir
    Li, Han
    Ma, Xin
    Deng, Lei
    Ye, Xiaochun
    Zhang, Zhimin
    Fan, Dongrui
    Xie, Yuan
    2019 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2019,
  • [34] A Scalable Data Structure for Efficient Graph Analytics and In-Place Mutations
    Firmli, Soukaina
    Chiadmi, Dalila
    DATA, 2023, 8 (11)
  • [35] Explore Efficient Data Organization for Large Scale Graph Analytics and Storage
    Xia, Yinglong
    Tanase, Ilie Gabriel
    Nai, Lifeng
    Tan, Wei
    Liu, Yanbin
    Crawford, Jason
    Lin, Ching-Yung
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014, : 942 - 951
  • [36] Efficient Distributed Graph Analytics using Triply Compressed Sparse Format
    Mofrad, Mohammad Hasanzadeh
    Melhem, Rami
    Ahmad, Yousuf
    Hammoud, Mohammad
    2019 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2019, : 351 - 361
  • [37] Maintaining hierarchical graph views
    Buchsbaum, AL
    Wesbrook, JR
    PROCEEDINGS OF THE ELEVENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2000, : 566 - 575
  • [38] Structure-aware Fisheye Views for Efficient Large Graph Exploration
    Wang, Yunhai
    Wang, Yanyan
    Zhang, Haifeng
    Sun, Yinqi
    Fu, Chi-Wing
    Sedlmair, Michael
    Chen, Baoquan
    Deussen, Oliver
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (01) : 566 - 575
  • [39] Persistent graph stream summarization for real-time graph analytics
    Yan Jia
    Zhaoquan Gu
    Zhihao Jiang
    Cuiyun Gao
    Jianye Yang
    World Wide Web, 2023, 26 : 2647 - 2667
  • [40] Navigating the Maze of Graph Analytics Frameworks using Massive Graph Datasets
    Satish, Nadathur
    Sundaram, Narayanan
    Patwary, Md Mostofa Ali
    Seo, Jiwon
    Park, Jongsoo
    Hassaan, M. Amber
    Sengupta, Shubho
    Yin, Zhaoming
    Dubey, Pradeep
    SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 979 - 990