Optimizing the Critical Path of Distributed Dataflow Graph Algorithms

被引:0
|
作者
Durrman, Dante [1 ]
Saule, Erik [2 ]
机构
[1] UNC Charlotte, Dept Math, Charlotte, NC 28223 USA
[2] UNC Charlotte, Dept Comp Sci, Charlotte, NC 28223 USA
基金
美国国家科学基金会;
关键词
graph analysis; distributed computing; partial order; interval coloring; randomized algorithms; COLOR;
D O I
10.1109/IPDPSW59300.2023.00147
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Executing graph algorithms in a parallel or distributed context is a challenging problem. Solving race conditions with locks is usually prohibitively expensive and some algorithms opt for a strategy that ignores the race condition altogether and corrects later the derived solution if it is invalid. Alternatively, dataflow algorithms solve the synchronization problem by executing the algorithm by following a partial order on the graph. While removing the cost of locks or avoiding a checking phase improves performance, it is possible that the algorithm picks a partial order with long chains, which limit parallelism. In this paper, we investigate how distributed dataflow graph algorithm obtain a partial order and how one could favor orders with shorter long chains. Most dataflow algorithms obtain their order by having each vertex of the graph pick a uniformly random number in [0; 1) and order the vertices based on that number. We believe that this type of order could lead to long chains in graphs with dense regions such as small world graph. We design two alternative ways of generating the order to make it similar to a largest degree first order. We study the behavior of these different algorithms on a wide range of randomly generated RMAT graphs and on a set of real world graphs. And we show that our ordering methods can significantly reduce the length of the longest chain.
引用
收藏
页码:898 / 904
页数:7
相关论文
共 50 条
  • [21] Graph Collapsing in Shortest Path Auction Algorithms
    R. Cerulli
    P. Festa
    G. Raiconi
    Computational Optimization and Applications, 2001, 18 : 199 - 220
  • [22] Efficient Fault-tolerance for Iterative Graph Processing on Distributed Dataflow Systems
    Xu, Chen
    Holzemer, Markus
    Kaul, Manohar
    Markl, Volker
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 613 - 624
  • [23] Shortening the Path to Designing Efficient Graph Algorithms
    Peng, Richard
    COMMUNICATIONS OF THE ACM, 2025, 68 (02) : 86 - 86
  • [24] Graph Topology Abstraction for Distributed Path Queries
    Balaji, Janani
    Sunderraman, Rajshekhar
    PROCEEDINGS OF THE ACM WORKSHOP ON HIGH PERFORMANCE GRAPH PROCESSING (HPGP'16), 2016, : 27 - 34
  • [25] Distributed Graph Path Queries using Spark
    Balaji, Janani
    Sunderraman, Rajshekhar
    PROCEEDINGS 2016 IEEE 40TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSAC), VOL 2, 2016, : 326 - 331
  • [26] Optimizing path query performance: graph clustering strategies
    Huang, YW
    Jing, N
    Rundensteiner, EA
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2000, 8 (1-6) : 381 - 408
  • [27] Optimizing regular path expressions using graph schemas
    Fernandez, M
    Suciu, D
    14TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1998, : 14 - 23
  • [28] Fast distributed algorithms for testing graph properties
    Censor-Hillel, Keren
    Fischer, Eldar
    Schwartzman, Gregory
    Vasudev, Yadu
    DISTRIBUTED COMPUTING, 2019, 32 (01) : 41 - 57
  • [29] Fast distributed algorithms for testing graph properties
    Keren Censor-Hillel
    Eldar Fischer
    Gregory Schwartzman
    Yadu Vasudev
    Distributed Computing, 2019, 32 : 41 - 57
  • [30] A Graph Theoretic Approach for Resilient Distributed Algorithms
    Parter, Merav
    PROCEEDINGS OF THE 2022 ACM SYMPOSIUM ON PRINCIPLES OF DISTRIBUTED COMPUTING, PODC 2022, 2022, : 324 - 324