A Scalable Data Structure for Efficient Graph Analytics and In-Place Mutations

被引：2

作者：

Firmli, Soukaina ^{[1
]}

Chiadmi, Dalila ^{[1
]}

机构：

[1] Mohammed V Univ, Rabat IT Ctr, EMI, SIP Res Team, Rabat, Morocco

来源：

DATA | 2023年 / 8卷 / 11期

关键词：

data structures; concurrency; graph processing; graph mutations;

D O I：

10.3390/data8110166

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The graph model enables a broad range of analyses; thus, graph processing (GP) is an invaluable tool in data analytics. At the heart of every GP system lies a concurrent graph data structure that stores the graph. Such a data structure needs to be highly efficient for both graph algorithms and queries. Due to the continuous evolution, the sparsity, and the scale-free nature of real-world graphs, GP systems face the challenge of providing an appropriate graph data structure that enables both fast analytical workloads and fast, low-memory graph mutations. Existing graph structures offer a hard tradeoff among read-only performance, update friendliness, and memory consumption upon updates. In this paper, we introduce CSR++, a new graph data structure that removes these tradeoffs and enables both fast read-only analytics, and quick and memory-friendly mutations. CSR++ combines ideas from CSR, the fastest read-only data structure, and adjacency lists (ALs) to achieve the best of both worlds. We compare CSR++ to CSR, ALs from the Boost Graph Library (BGL), and the following state-of-the-art update-friendly graph structures: LLAMA, STINGER, GraphOne, and Teseo. In our evaluation, which is based on popular GP algorithms executed over real-world graphs, we show that CSR++ remains close to CSR in read-only concurrent performance (within 10% on average) while significantly outperforming CSR (by an order of magnitude) and LLAMA (by almost 2x) with frequent updates. We also show that both CSR++'s update throughput and analytics performance exceed those of several state-of-the-art graph structures while maintaining low memory consumption when the workload includes updates.

引用

页数：24

共 50 条

[21] GRAPHVINE: Exploiting Multicast for Scalable Graph Analytics
Belayneh, Leul
Bertacco, Valeria
PROCEEDINGS OF THE 2020 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2020), 2020, : 762 - 767
[22] GALGO: Scalable Graph Analytics with a Parallel DBMS
Cabrera, Wellington
Zhou, Xiantian
Bellatreche, Ladjel
Ordonez, Carlos
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4823 - 4827
[23] Towards Intelligent Distributed Data Systems for Scalable Efficient and Accurate Analytics
Triantafillou, Peter
2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 1192 - 1202
[24] Lemonade: A scalable and efficient Spark-based platform for data analytics
dos Santos, Walter
Carvalho, Luiz F. M.
Avelar, Gustavo de P.
Silva, Atila, Jr.
2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 745 - 748
[25] Topological Graph Sketching for Incremental and Scalable Analytics
Bandyopadhyay, Bortik
Fuhry, David
Chakrabarti, Aniket
Parthasarathy, Srinivasan
CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1231 - 1240
[26] An efficient architecture for the in-place fast cosine transform
Sánchez, M
López, J
Plata, O
Trenas, MA
Zapata, EL
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 1999, 21 (02): : 91 - 102
[27] An Efficient General In-Place Parallel Sorting Scheme
S. Q. Zheng
Balaji Calidas
Yanjun Zhang
The Journal of Supercomputing, 1999, 14 : 5 - 17
[28] An Efficient Architecture for the In-Place Fast Cosine Transform
Manuel Sánchez
Juan López
Oscar Plata
Maria A. Trenas
Emilio L. Zapata
Journal of VLSI signal processing systems for signal, image and video technology, 1999, 21 : 91 - 102
[29] An efficient general in-place parallel sorting scheme
Zheng, SQ
Calidas, B
Zhang, YJ
JOURNAL OF SUPERCOMPUTING, 1999, 14 (01): : 5 - 17
[30] An analysis and implementation of an efficient in-place bucket sort
Apostolos Burnetas
Daniel Solow
Rishi Agarwal
Acta Informatica, 1997, 34 : 687 - 700

← 1 2 3 4 5 →