iTurboGraph: Scaling and Automating Incremental Graph Analytics

被引:3
|
作者
Ko, Seongyun [1 ]
Lee, Taesung [1 ]
Hong, Kijae [1 ]
Lee, Wonseok [1 ]
Seo, In [1 ]
Seo, Jiwon [2 ]
Han, Wook-Shin [1 ]
机构
[1] POSTECH, Pohang, South Korea
[2] Hanyang Univ, Seoul, South Korea
关键词
OPTIMIZATION; LANGUAGE; MODEL;
D O I
10.1145/3448016.3457243
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rise of streaming data for dynamic graphs, large-scale graph analytics meets a new requirement of Incremental Computation because the larger the graph, the higher the cost for updating the analytics results by re-execution. A dynamic graph consists of an initial graph G and graph mutation updates Delta G of edge insertions or deletions. Given a query Q, its results Q(G), and updates for Delta G to G, incremental graph analytics computes updates Delta Q such that Q(G boolean OR Delta G) = Q(G) boolean OR Delta Q where boolean OR is a union operator. In this paper, we consider the problem of large-scale incremental neighbor-centric graph analytics (NGA). We solve the limitations of previous systems: lack of usability due to the difficulties in programming incremental algorithms for NGA and limited scalability and efficiency due to the overheads in maintaining intermediate results for graph traversals in NGA. First, we propose a domain-specific language, L-NGA, and develop its compiler for intuitive programming of NGA, automatic query incrementalization, and query optimizations. Second, we define Graph Streaming Algebra as a theoretical foundation for scalable processing of incremental NGA. We introduce a concept of Nested Graph Windows and model graph traversals as the generation of walk streams. Lastly, we present a system iTURBOGRAPH, which efficiently processes incremental NGA for large graphs. Comprehensive experiments show that it effectively avoids costly re-executions and efficiently updates the analytics results with reduced IO and computations.
引用
收藏
页码:977 / 990
页数:14
相关论文
共 50 条
  • [31] Big graph visual analytics
    Haglin, David
    Trimm, David
    Wong, Pak Chung
    INFORMATION VISUALIZATION, 2017, 16 (03) : 155 - 156
  • [32] Automating tolerance charting using graph theory
    Britton, GA
    Cheong, FS
    Whybrew, K
    SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 885 - 890
  • [33] Automating weight function generation in graph pebbling
    Flocco, Dominic
    Pulaj, Jonad
    Yerger, Carl
    Discrete Applied Mathematics, 2024, 347 : 155 - 174
  • [34] Big Graph Analytics Platforms
    不详
    FOUNDATIONS AND TRENDS IN DATABASES, 2015, 7 (1-2): : 2 - +
  • [35] A Lightweight Infrastructure for Graph Analytics
    Nguyen, Donald
    Lenharth, Andrew
    Pingali, Keshav
    SOSP'13: PROCEEDINGS OF THE TWENTY-FOURTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, 2013, : 456 - 471
  • [36] The Taxonomy of Distributed Graph Analytics
    Rao, T. Ramalingeswara
    Mitra, Pabitra
    Goswami, A.
    2018 FIFTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2018, : 315 - 322
  • [37] Essentials of Parallel Graph Analytics
    Osama, Muhammad
    Porumbescu, Serban D.
    Owens, John D.
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 314 - 317
  • [38] Automating Weight Function Generation in Graph Pebbling
    Flocco, Dominic
    Pulaj, Jonad
    Yerger, Carl
    SSRN, 2022,
  • [39] Automating Weight Function Generation in Graph Pebbling
    Flocco, Dominic
    Pulaj, Jonad
    Yerger, Carl
    arXiv, 2023,
  • [40] Automating weight function generation in graph pebbling
    Flocco, Dominic
    Pulaj, Jonad
    Yerger, Carl
    DISCRETE APPLIED MATHEMATICS, 2024, 347 : 155 - 174