A Scalable Data Structure for Efficient Graph Analytics and In-Place Mutations

被引:2
|
作者
Firmli, Soukaina [1 ]
Chiadmi, Dalila [1 ]
机构
[1] Mohammed V Univ, Rabat IT Ctr, EMI, SIP Res Team, Rabat, Morocco
关键词
data structures; concurrency; graph processing; graph mutations;
D O I
10.3390/data8110166
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The graph model enables a broad range of analyses; thus, graph processing (GP) is an invaluable tool in data analytics. At the heart of every GP system lies a concurrent graph data structure that stores the graph. Such a data structure needs to be highly efficient for both graph algorithms and queries. Due to the continuous evolution, the sparsity, and the scale-free nature of real-world graphs, GP systems face the challenge of providing an appropriate graph data structure that enables both fast analytical workloads and fast, low-memory graph mutations. Existing graph structures offer a hard tradeoff among read-only performance, update friendliness, and memory consumption upon updates. In this paper, we introduce CSR++, a new graph data structure that removes these tradeoffs and enables both fast read-only analytics, and quick and memory-friendly mutations. CSR++ combines ideas from CSR, the fastest read-only data structure, and adjacency lists (ALs) to achieve the best of both worlds. We compare CSR++ to CSR, ALs from the Boost Graph Library (BGL), and the following state-of-the-art update-friendly graph structures: LLAMA, STINGER, GraphOne, and Teseo. In our evaluation, which is based on popular GP algorithms executed over real-world graphs, we show that CSR++ remains close to CSR in read-only concurrent performance (within 10% on average) while significantly outperforming CSR (by an order of magnitude) and LLAMA (by almost 2x) with frequent updates. We also show that both CSR++'s update throughput and analytics performance exceed those of several state-of-the-art graph structures while maintaining low memory consumption when the workload includes updates.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Area-Delay-Energy Efficient VLSI Architecture for Scalable In-Place Computation of FFT on Real Data
    Mohanty, Basant K.
    Meher, Pramod Kumar
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2019, 66 (03) : 1042 - 1050
  • [2] Scalable and Efficient Data Analytics and Mining with Lemonade
    dos Santos, Walter
    Avelar, Gustavo P.
    Ribeiro, Manoel Horta
    Guedes, Dorgival
    Meira Jr, Wagner
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (12): : 2070 - 2073
  • [3] BigObject Store: In-Place Computing for Interactive Analytics
    Huang, Yi-Cheng
    Hseush, Wenwey
    Lai, Yu-Chun
    Fong, Michael
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 537 - 544
  • [4] In-place Graph Rewriting with Interaction Nets
    Mackie, Ian
    Sato, Shinya
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2016, (225): : 15 - 24
  • [5] Frameworks for designing in-place graph algorithms
    Chakraborty, Sankardeep
    Mukherjee, Anish
    Raman, Venkatesh
    Satti, Srinivasa Rao
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2022, 123 : 1 - 19
  • [6] Laika: Efficient In-Place Scheduling for 3D Mesh Graph Computations
    Gruevski, Predrag
    Hasenplaugh, William
    Lugato, David
    Thomas, James J.
    SPAA'18: PROCEEDINGS OF THE 30TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2018, : 415 - 426
  • [7] Asymptotically efficient in-place merging
    Geffert, V
    Katajainen, J
    Pasanen, T
    THEORETICAL COMPUTER SCIENCE, 2000, 237 (1-2) : 159 - 181
  • [8] Optimal In-place Algorithms for Basic Graph Problems
    Chakraborty, Sankardeep
    Sadakane, Kunihiko
    Satti, Srinivasa Rao
    COMBINATORIAL ALGORITHMS, IWOCA 2020, 2020, 12126 : 126 - 139
  • [9] A Scalable Graph Analytics Framework for Programming with Big Data in R (pbdR)
    Hasan, S. M. Shamimul
    Schmidt, Drew
    Kannan, Ramakrishnan
    Imam, Neena
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4783 - 4792
  • [10] Socrates A System For Scalable Graph Analytics
    Savkli, C.
    Carr, R.
    Chapman, M.
    Chee, B.
    Minch, D.
    2014 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2014,