A Scalable Data Structure for Efficient Graph Analytics and In-Place Mutations

被引:2
|
作者
Firmli, Soukaina [1 ]
Chiadmi, Dalila [1 ]
机构
[1] Mohammed V Univ, Rabat IT Ctr, EMI, SIP Res Team, Rabat, Morocco
关键词
data structures; concurrency; graph processing; graph mutations;
D O I
10.3390/data8110166
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The graph model enables a broad range of analyses; thus, graph processing (GP) is an invaluable tool in data analytics. At the heart of every GP system lies a concurrent graph data structure that stores the graph. Such a data structure needs to be highly efficient for both graph algorithms and queries. Due to the continuous evolution, the sparsity, and the scale-free nature of real-world graphs, GP systems face the challenge of providing an appropriate graph data structure that enables both fast analytical workloads and fast, low-memory graph mutations. Existing graph structures offer a hard tradeoff among read-only performance, update friendliness, and memory consumption upon updates. In this paper, we introduce CSR++, a new graph data structure that removes these tradeoffs and enables both fast read-only analytics, and quick and memory-friendly mutations. CSR++ combines ideas from CSR, the fastest read-only data structure, and adjacency lists (ALs) to achieve the best of both worlds. We compare CSR++ to CSR, ALs from the Boost Graph Library (BGL), and the following state-of-the-art update-friendly graph structures: LLAMA, STINGER, GraphOne, and Teseo. In our evaluation, which is based on popular GP algorithms executed over real-world graphs, we show that CSR++ remains close to CSR in read-only concurrent performance (within 10% on average) while significantly outperforming CSR (by an order of magnitude) and LLAMA (by almost 2x) with frequent updates. We also show that both CSR++'s update throughput and analytics performance exceed those of several state-of-the-art graph structures while maintaining low memory consumption when the workload includes updates.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Scalable Storage Structure for Pattern Matching on Big Graph Data
    Balaji, Janani
    Sunderraman, Rajshekhar
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1848 - 1855
  • [42] MSL: An efficient adaptive in-place radix sort algorithm
    El-Aker, Fouad
    Al-Badarneh, Amer
    Lect. Notes Comput. Sci., (606-609):
  • [43] Efficient and Scalable Integrity Verification of Data and Query Results for Graph Databases
    Arshad, Muhammad U.
    Kundu, Ashish
    Bertino, Elisa
    Ghafoor, Arif
    Kundu, Chinmay
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (05) : 866 - 879
  • [44] Scalable Data-Intensive Analytics
    Hsu, Meichun
    Chen, Qiming
    BUSINESS INTELLIGENCE FOR THE REAL-TIME ENTERPRISE, 2009, 27 : 97 - +
  • [45] Clouds for Scalable Big Data Analytics
    Talia, Domenico
    COMPUTER, 2013, 46 (05) : 98 - 101
  • [46] Calculation of the Damping Factors of the Flexible Pavement Structure Courses According to the In-Place Testing Data
    Uglova, Evgeniya
    Tiraturyan, Artem
    TRANSBALTICA 2017: TRANSPORTATION SCIENCE AND TECHNOLOGY, 2017, 187 : 742 - 748
  • [47] An Efficient Data Structure for Dynamic Graph on GPUs
    Zou, Lei
    Zhang, Fan
    Lin, Yinnian
    Yu, Yanpeng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (11) : 11051 - 11066
  • [48] Efficient in-place update with grouped and pipelined data transmission in erasure-coded storage systems
    Pei, Xiaoqiang
    Wang, Yijie
    Ma, Xingkong
    Xu, Fangliang
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 69 : 24 - 40
  • [49] Efficient Graph Analytics in Python']Python for Large-Scale Data Science
    Zhou, Xiantian
    Ordonez, Carlos
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY (DAWAK 2021), 2021, 12925 : 158 - 164
  • [50] Scalable Energy-Efficient Distributed Data Analytics for Crowdsensing Applications in Mobile Environments
    Jayaraman, Prem Prakash
    Gomes, Joao Bartolo
    Nguyen, Hai-Long
    Abdallah, Zahraa Said
    Krishnaswamy, Shonali
    Zaslaysky, Arkady
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2015, 2 (03): : 109 - 123