LazyGraph: Lazy Data Coherency for Replicas in Distributed Graph-Parallel Computation

被引:24
|
作者
Wang, Lei [1 ]
Zhuang, Liangji [1 ,2 ]
Chen, Junhang [1 ,2 ]
Cui, Huimin [1 ,2 ]
Lv, Fang [1 ]
Liu, Ying [1 ]
Fen, Xiaobing [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, State Key Lab Comp Architecture, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Lazy Data Coherency; Distributed Graphparallel Computation; Execution Model; FRAMEWORK;
D O I
10.1145/3200691.3178508
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Replicas of a vertex play an important role in existing distributed graph processing systems which make a single vertex to be parallel processed by multiple machines and access remote neighbors locally without any remote access. However, replicas of vertices introduce data coherency problem. Existing distributed graph systems treat replicas of a vertex v as an atomic and indivisible vertex, and use an eager data coherency approach to guarantee replicas atomicity. In eager data coherency approach, any changes to vertex data must be immediately communicated to all replicas of v, thus leading to frequent global synchronizations and communications. In this paper, we propose a lazy data coherency approach, called LazyAsync, which treats replicas of a vertex as independent vertices and maintains the data coherency by computations, rather than communications in existing eager approach. Our approach automatically selects some data coherency points from the graph algorithm, and maintains all replicas to share the same global view only at such points, which means the replicas are enabled to maintain different local views between any two adjacent data coherency points. Based on PowerGraph, we develop a distributed graph processing system LazyGraph to implement the LazyAsync approach and exploit graph-aware optimizations. On a 48-node EC2-like cluster, LazyGraph outperforms PowerGraph on four widely used graph algorithms across a variety of real-world graphs, with a speedup ranging from 1.25x to 10.69x.
引用
收藏
页码:276 / 289
页数:14
相关论文
共 18 条
  • [1] LazyGraph: Lazy Data Coherency for Replicas in Distributed Graph-Parallel Computation
    Wang L.
    Zhuang L.
    Chen J.
    Cui H.
    Lv F.
    Liu Y.
    Feng X.
    ACM SIGPLAN Notices, 2018, 53 (01): : 276 - 289
  • [2] SYNC or ASYNC: Time to Fuse for Distributed Graph-Parallel Computation
    Xie, Chenning
    Chen, Rong
    Guan, Haibing
    Zang, Binyu
    Chen, Haibo
    ACM SIGPLAN NOTICES, 2015, 50 (08) : 194 - 204
  • [3] LightGraph: Lighten Communication in Distributed Graph-Parallel Processing
    Zhao, Yue
    Yoshigoe, Kenji
    Xie, Mengjun
    Zhou, Suijian
    Seker, Remzi
    Bian, Jiang
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 717 - 724
  • [4] Reducing the synchronizing communication overhead for distributed graph-parallel computing
    Zhao, Yue
    Yoshigoe, Kenji
    Li, Hongliang
    Xiong, Ke
    INTELLIGENT DATA ANALYSIS, 2019, 23 (02) : 313 - 332
  • [5] L-PowerGraph: a lightweight distributed graph-parallel communication mechanism
    Yue Zhao
    Kenji Yoshigoe
    Mengjun Xie
    Jiang Bian
    Ke Xiong
    The Journal of Supercomputing, 2020, 76 : 1850 - 1879
  • [6] L-PowerGraph: a lightweight distributed graph-parallel communication mechanism
    Zhao, Yue
    Yoshigoe, Kenji
    Xie, Mengjun
    Bian, Jiang
    Xiong, Ke
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (03): : 1850 - 1879
  • [7] Graphine: Programming Graph-Parallel Computation of Large Natural Graphs for Multicore Clusters
    Yan, Jie
    Tan, Guangming
    Mo, Zeyao
    Sun, Ninghui
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (06) : 1647 - 1659
  • [8] Timescale Graph-Parallel Computation and Mechanism Analysis of Economical Predictive Driving for Commercial Trucks
    Hong, Jinlong
    Guo, Lulu
    Na, Xiaoxiang
    Li, Xianning
    Chu, Hongqing
    Gao, Bingzhao
    Chen, Hong
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 454 - 459
  • [9] Lifting sequential graph algorithms for distributed-memory parallel computation
    Gregor, D
    Lumsdaine, A
    ACM SIGPLAN NOTICES, 2005, 40 (10) : 423 - 437
  • [10] Automatic data and computation decomposition on distributed memory parallel computers
    Lee, P
    Kedem, ZM
    ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2002, 24 (01): : 1 - 50