A Scalable Data Structure for Efficient Graph Analytics and In-Place Mutations

被引:2
|
作者
Firmli, Soukaina [1 ]
Chiadmi, Dalila [1 ]
机构
[1] Mohammed V Univ, Rabat IT Ctr, EMI, SIP Res Team, Rabat, Morocco
关键词
data structures; concurrency; graph processing; graph mutations;
D O I
10.3390/data8110166
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The graph model enables a broad range of analyses; thus, graph processing (GP) is an invaluable tool in data analytics. At the heart of every GP system lies a concurrent graph data structure that stores the graph. Such a data structure needs to be highly efficient for both graph algorithms and queries. Due to the continuous evolution, the sparsity, and the scale-free nature of real-world graphs, GP systems face the challenge of providing an appropriate graph data structure that enables both fast analytical workloads and fast, low-memory graph mutations. Existing graph structures offer a hard tradeoff among read-only performance, update friendliness, and memory consumption upon updates. In this paper, we introduce CSR++, a new graph data structure that removes these tradeoffs and enables both fast read-only analytics, and quick and memory-friendly mutations. CSR++ combines ideas from CSR, the fastest read-only data structure, and adjacency lists (ALs) to achieve the best of both worlds. We compare CSR++ to CSR, ALs from the Boost Graph Library (BGL), and the following state-of-the-art update-friendly graph structures: LLAMA, STINGER, GraphOne, and Teseo. In our evaluation, which is based on popular GP algorithms executed over real-world graphs, we show that CSR++ remains close to CSR in read-only concurrent performance (within 10% on average) while significantly outperforming CSR (by an order of magnitude) and LLAMA (by almost 2x) with frequent updates. We also show that both CSR++'s update throughput and analytics performance exceed those of several state-of-the-art graph structures while maintaining low memory consumption when the workload includes updates.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Efficient Support for In-Place Metadata in Transactional Memory
    Dias, Ricardo J.
    Vale, Tiago M.
    Lourenco, Joao M.
    EURO-PAR 2012 PARALLEL PROCESSING, 2012, 7484 : 589 - 600
  • [32] An analysis and implementation of an efficient in-place bucket sort
    Burnetas, A
    Solow, D
    Agarwal, R
    ACTA INFORMATICA, 1997, 34 (09) : 687 - 700
  • [33] An Efficient In-Place VLSI Architecture for Viterbi Algorithm
    Yun-Nan Chang
    Journal of VLSI signal processing systems for signal, image and video technology, 2003, 33 (3): : 317 - 324
  • [34] Scalable Analytics on Fast Data
    Kipf, Andreas
    Pandey, Varun
    Boettcher, Jan
    Braun, Lucas
    Neumann, Thomas
    Kemper, Alfons
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2019, 44 (01):
  • [35] Kaskade: Graph Views for Efficient Graph Analytics
    da Trindade, Joana M. F.
    Karanasos, Konstantinos
    Curino, Carlo
    Madden, Samuel
    Shun, Julian
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 193 - 204
  • [36] Explore Efficient Data Organization for Large Scale Graph Analytics and Storage
    Xia, Yinglong
    Tanase, Ilie Gabriel
    Nai, Lifeng
    Tan, Wei
    Liu, Yanbin
    Crawford, Jason
    Lin, Ching-Yung
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014, : 942 - 951
  • [37] In-Place Collaboration in Extended Reality Data Visualization
    Abdelhamed, Heidi
    El-Faransawy, Nourhan
    Sharaf, Nada
    2022 26TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV), 2022, : 215 - 220
  • [38] Scalable Knowledge Graph Analytics at 136 Petaflop/s
    Kannan, Ramakrishnan
    Sao, Piyush
    Lu, Hao
    Herrmannova, Drahomira
    Thakkar, Vijay
    Patton, Robert
    Vuduc, Richard
    Potok, Thomas
    PROCEEDINGS OF SC20: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC20), 2020,
  • [39] PARADIS: An Efficient Parallel Algorithm for In-place Radix Sort
    Cho, Minsik
    Brand, Daniel
    Bordawekar, Rajesh
    Finkler, Ulrich
    Kulandaisamy, Vincent
    Puri, Ruchir
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (12): : 1518 - 1529
  • [40] MSL: An efficient adaptive in-place radix sort algorithm
    El-Aker, F
    Al-Badarneh, A
    COMPUTATIONAL SCIENCE - ICCS 2004, PT 2, PROCEEDINGS, 2004, 3037 : 606 - 609