iTurboGraph: Scaling and Automating Incremental Graph Analytics

被引:3
|
作者
Ko, Seongyun [1 ]
Lee, Taesung [1 ]
Hong, Kijae [1 ]
Lee, Wonseok [1 ]
Seo, In [1 ]
Seo, Jiwon [2 ]
Han, Wook-Shin [1 ]
机构
[1] POSTECH, Pohang, South Korea
[2] Hanyang Univ, Seoul, South Korea
关键词
OPTIMIZATION; LANGUAGE; MODEL;
D O I
10.1145/3448016.3457243
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rise of streaming data for dynamic graphs, large-scale graph analytics meets a new requirement of Incremental Computation because the larger the graph, the higher the cost for updating the analytics results by re-execution. A dynamic graph consists of an initial graph G and graph mutation updates Delta G of edge insertions or deletions. Given a query Q, its results Q(G), and updates for Delta G to G, incremental graph analytics computes updates Delta Q such that Q(G boolean OR Delta G) = Q(G) boolean OR Delta Q where boolean OR is a union operator. In this paper, we consider the problem of large-scale incremental neighbor-centric graph analytics (NGA). We solve the limitations of previous systems: lack of usability due to the difficulties in programming incremental algorithms for NGA and limited scalability and efficiency due to the overheads in maintaining intermediate results for graph traversals in NGA. First, we propose a domain-specific language, L-NGA, and develop its compiler for intuitive programming of NGA, automatic query incrementalization, and query optimizations. Second, we define Graph Streaming Algebra as a theoretical foundation for scalable processing of incremental NGA. We introduce a concept of Nested Graph Windows and model graph traversals as the generation of walk streams. Lastly, we present a system iTURBOGRAPH, which efficiently processes incremental NGA for large graphs. Comprehensive experiments show that it effectively avoids costly re-executions and efficiently updates the analytics results with reduced IO and computations.
引用
收藏
页码:977 / 990
页数:14
相关论文
共 50 条
  • [41] Semantic Property Graph for Scalable Knowledge Graph Analytics
    Purohit, Sumit
    Nhuy Van
    Chin, George
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 2672 - 2677
  • [42] FAM-Graph: Graph Analytics on Disaggregated Memory
    Zahka, Daniel
    Gavrilovska, Ada
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 81 - 92
  • [43] Incremental Graph Pattern Matching
    Fan, Wenfei
    Wang, Xin
    Wu, Yinghui
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2013, 38 (03):
  • [44] Incremental Streaming Graph Partitioning
    Durbeck, Lisa
    Athanas, Peter
    2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
  • [45] DDA SCALING GRAPH
    LEAKE, RJ
    ALTHAUS, HL
    IEEE TRANSACTIONS ON COMPUTERS, 1968, C 17 (01) : 81 - &
  • [46] Incremental Update for Graph Rewriting
    Boutillier, Pierre
    Ehrhard, Thomas
    Krivine, Jean
    PROGRAMMING LANGUAGES AND SYSTEMS (ESOP 2017): 26TH EUROPEAN SYMPOSIUM ON PROGRAMMING, 2017, 10201 : 201 - 228
  • [47] Parallel incremental graph partitioning
    Ou, CW
    Ranka, S
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1997, 8 (08) : 884 - 896
  • [48] Incremental Lossless Graph Summarization
    Ko, Jihoon
    Kook, Yunbum
    Shin, Kijung
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 317 - 327
  • [49] Distributed Incremental Graph Analysis
    Gupta, Upa
    Fegaras, Leonidas
    2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 75 - 82
  • [50] Automating Data Narratives in Learning Analytics Dashboards using GenAI
    Pinargote, Adriano (adriano.pinargote@cti.espol.edu.ec), 1600, CEUR-WS (3667):