Large scale graph processing systems: survey and an experimental evaluation

被引:50
|
作者
Batarfi, Omar [1 ,5 ]
El Shawi, Radwa [2 ]
Fayoumi, Ayman G. [1 ,5 ]
Nouri, Reza [3 ]
Beheshti, Seyed-Mehdi-Reza [3 ,6 ]
Barnawi, Ahmed [1 ,5 ]
Sakr, Sherif [3 ,4 ,7 ]
机构
[1] King Abdulaziz Univ, Jeddah 21413, Saudi Arabia
[2] Princess Nourah Bint Abdulrahman Univ, Coll Comp Sci & Informat Technol, Riyadh, Saudi Arabia
[3] Univ New S Wales, Sydney, NSW, Australia
[4] King Saud Bin Abdulaziz Univ Hlth Sci, Dept Hlth Informat, Riyadh, Saudi Arabia
[5] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah 21413, Saudi Arabia
[6] Univ New S Wales, Sch Comp Sci & Engn CSE, Serv Oriented Comp Grp, Sydney, NSW, Australia
[7] Univ New S Wales, Comp Sci, Sydney, NSW, Australia
关键词
Big graph; Graph processing; Experimental evaluation; FRAMEWORK; ANALYTICS; MAPREDUCE;
D O I
10.1007/s10586-015-0472-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graph is a fundamental data structure that captures relationships between different data entities. In practice, graphs are widely used for modeling complicated data in different application domains such as social networks, protein networks, transportation networks, bibliographical networks, knowledge bases and many more. Currently, graphs with millions and billions of nodes and edges have become very common. In principle, graph analytics is an important big data discovery technique. Therefore, with the increasing abundance of large graphs, designing scalable systems for processing and analyzing large scale graphs has become one of the most timely problems facing the big data research community. In general, scalable processing of big graphs is a challenging task due to their size and the inherent irregular structure of graph computations. Thus, in recent years, we have witnessed an unprecedented interest in building big graph processing systems that attempted to tackle these challenges. In this article, we provide a comprehensive survey over the state-of-the-art of large scale graph processing platforms. In addition, we present an extensive experimental study of five popular systems in this domain, namely, GraphChi, Apache Giraph, GPS, GraphLab and GraphX. In particular, we report and analyze the performance characteristics of these systems using five common graph processing algorithms and seven large graph datasets. Finally, we identify a set of the current open research challenges and discuss some promising directions for future research in the domain of large scale graph processing.
引用
收藏
页码:1189 / 1213
页数:25
相关论文
共 50 条
  • [1] Large scale graph processing systems: survey and an experimental evaluation
    Omar Batarfi
    Radwa El Shawi
    Ayman G. Fayoumi
    Reza Nouri
    Seyed-Mehdi-Reza Beheshti
    Ahmed Barnawi
    Sherif Sakr
    Cluster Computing, 2015, 18 : 1189 - 1213
  • [2] Correction To: Large scale graph processing systems: survey and an experimental evaluation
    Omar Batarfi
    Radwa El Shawi
    Ayman G. Fayoumi
    Reza Nouri
    Seyed-Mehdi-Reza Beheshti
    Ahmed Barnawi
    Sherif Sakr
    Cluster Computing, 2018, 21 : 1455 - 1455
  • [3] Large scale graph processing systems: survey and an experimental evaluation (vol 18, pg 1189, 2015)
    Batarfi, Omar
    El Shawi, Radwa
    Fayoumi, Ayman G.
    Nouri, Reza
    Beheshti, Seyed-Mehdi-Reza
    Barnawi, Ahmed
    Sakr, Sherif
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2018, 21 (02): : 1455 - 1455
  • [4] Large-scale graph processing systems: a survey
    Ning Liu
    Dong-sheng Li
    Yi-ming Zhang
    Xiong-lve Li
    Frontiers of Information Technology & Electronic Engineering, 2020, 21 : 384 - 404
  • [5] Large-scale graph processing systems: a survey
    Liu, Ning
    Li, Dong-sheng
    Zhang, Yi-ming
    Li, Xiong-lve
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (03) : 384 - 404
  • [6] Large-Scale Distributed Graph Computing Systems: An Experimental Evaluation
    Lu, Yi
    Cheng, James
    Yan, Da
    Wu, Huanhuan
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 8 (03): : 281 - 292
  • [7] Survey on Large-scale Graph Neural Network Systems
    Zhao G.
    Wang Q.-G.
    Yao F.
    Zhang Y.-F.
    Yu G.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (01): : 150 - 170
  • [8] EXPERIMENTAL METHODOLOGIES FOR LARGE-SCALE SYSTEMS: A SURVEY
    Gustedt, Jens
    Jeannot, Emmanuel
    Quinson, Martin
    PARALLEL PROCESSING LETTERS, 2009, 19 (03) : 399 - 418
  • [9] An Experimental Evaluation of Large Scale GBDT Systems
    Fu, Fangcheng
    Jiang, Jiawei
    Shao, Yingxia
    Cui, Bin
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (11): : 1357 - 1370
  • [10] Design and Experimental Evaluation of Distributed Heterogeneous Graph-Processing Systems
    Guo, Yong
    Varbanescu, Ana Lucia
    Epema, Dick
    Iosup, Alexandru
    2016 16TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2016, : 203 - 212