A Scalable Data Structure for Efficient Graph Analytics and In-Place Mutations

被引:2
|
作者
Firmli, Soukaina [1 ]
Chiadmi, Dalila [1 ]
机构
[1] Mohammed V Univ, Rabat IT Ctr, EMI, SIP Res Team, Rabat, Morocco
关键词
data structures; concurrency; graph processing; graph mutations;
D O I
10.3390/data8110166
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The graph model enables a broad range of analyses; thus, graph processing (GP) is an invaluable tool in data analytics. At the heart of every GP system lies a concurrent graph data structure that stores the graph. Such a data structure needs to be highly efficient for both graph algorithms and queries. Due to the continuous evolution, the sparsity, and the scale-free nature of real-world graphs, GP systems face the challenge of providing an appropriate graph data structure that enables both fast analytical workloads and fast, low-memory graph mutations. Existing graph structures offer a hard tradeoff among read-only performance, update friendliness, and memory consumption upon updates. In this paper, we introduce CSR++, a new graph data structure that removes these tradeoffs and enables both fast read-only analytics, and quick and memory-friendly mutations. CSR++ combines ideas from CSR, the fastest read-only data structure, and adjacency lists (ALs) to achieve the best of both worlds. We compare CSR++ to CSR, ALs from the Boost Graph Library (BGL), and the following state-of-the-art update-friendly graph structures: LLAMA, STINGER, GraphOne, and Teseo. In our evaluation, which is based on popular GP algorithms executed over real-world graphs, we show that CSR++ remains close to CSR in read-only concurrent performance (within 10% on average) while significantly outperforming CSR (by an order of magnitude) and LLAMA (by almost 2x) with frequent updates. We also show that both CSR++'s update throughput and analytics performance exceed those of several state-of-the-art graph structures while maintaining low memory consumption when the workload includes updates.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] GRAPHVINE: Exploiting Multicast for Scalable Graph Analytics
    Belayneh, Leul
    Bertacco, Valeria
    PROCEEDINGS OF THE 2020 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2020), 2020, : 762 - 767
  • [22] GALGO: Scalable Graph Analytics with a Parallel DBMS
    Cabrera, Wellington
    Zhou, Xiantian
    Bellatreche, Ladjel
    Ordonez, Carlos
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4823 - 4827
  • [23] Towards Intelligent Distributed Data Systems for Scalable Efficient and Accurate Analytics
    Triantafillou, Peter
    2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 1192 - 1202
  • [24] Lemonade: A scalable and efficient Spark-based platform for data analytics
    dos Santos, Walter
    Carvalho, Luiz F. M.
    Avelar, Gustavo de P.
    Silva, Atila, Jr.
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 745 - 748
  • [25] Topological Graph Sketching for Incremental and Scalable Analytics
    Bandyopadhyay, Bortik
    Fuhry, David
    Chakrabarti, Aniket
    Parthasarathy, Srinivasan
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1231 - 1240
  • [26] An efficient architecture for the in-place fast cosine transform
    Sánchez, M
    López, J
    Plata, O
    Trenas, MA
    Zapata, EL
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 1999, 21 (02): : 91 - 102
  • [27] An Efficient General In-Place Parallel Sorting Scheme
    S. Q. Zheng
    Balaji Calidas
    Yanjun Zhang
    The Journal of Supercomputing, 1999, 14 : 5 - 17
  • [28] An Efficient Architecture for the In-Place Fast Cosine Transform
    Manuel Sánchez
    Juan López
    Oscar Plata
    Maria A. Trenas
    Emilio L. Zapata
    Journal of VLSI signal processing systems for signal, image and video technology, 1999, 21 : 91 - 102
  • [29] An efficient general in-place parallel sorting scheme
    Zheng, SQ
    Calidas, B
    Zhang, YJ
    JOURNAL OF SUPERCOMPUTING, 1999, 14 (01): : 5 - 17
  • [30] An analysis and implementation of an efficient in-place bucket sort
    Apostolos Burnetas
    Daniel Solow
    Rishi Agarwal
    Acta Informatica, 1997, 34 : 687 - 700