Practical Parallel Algorithms for Near-Optimal Densest Subgraphs on Massive Graphs

被引:0
|
作者
Sukprasert, Pattara [1 ,2 ]
Liu, Quanquan C. [3 ]
Dhulipala, Laxman [4 ]
Shun, Julian [5 ]
机构
[1] Databricks, San Francisco, CA 94105 USA
[2] Northwestern Univ, Evanston, IL 60208 USA
[3] Univ Calif Berkeley, Simons Inst, Berkeley, CA USA
[4] Univ Maryland, College Pk, MD 20742 USA
[5] MIT, CSAIL, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The densest subgraph problem has received significant attention, both in theory and in practice, due to its applications in problems such as community detection, social network analysis, and spam detection. Due to the high cost of obtaining exact solutions, much attention has focused on designing approximate densest subgraph algorithms. However, existing approaches are not able to scale to massive graphs with billions of edges. In this paper, we introduce a new framework that combines approximate densest subgraph algorithms with a pruning optimization. We design new parallel variants of the state-of-the-art sequential Greedy++ algorithm, and plug it into our framework in conjunction with a parallel pruning technique based on k-core decomposition to obtain parallel (1+epsilon)-approximate densest subgraph algorithms. On a single thread, our algorithms achieve 2.6-34x speedup over Greedy++, and obtain up to 22.37x self-relative parallel speedup on a 30core machine with two-way hyper-threading. Compared with the state-of-the-art parallel algorithm by Harb et al. [NeurIPS'22], we achieve up to a 114x speedup on the same machine. Finally, against the recent sequential algorithm of Xu et al. [PACMMOD'23], we achieve up to a 25.9x speedup. The scalability of our algorithms enables us to obtain near-optimal density statistics on the hyperlink2012 (with roughly 113 billion edges) and clueweb (with roughly 37 billion edges) graphs for the first time in the literature.
引用
收藏
页码:59 / 73
页数:15
相关论文
共 50 条
  • [31] OPTIMAL PARALLEL ALGORITHMS ON PLANAR GRAPHS
    HAGERUP, T
    LECTURE NOTES IN COMPUTER SCIENCE, 1988, 319 : 24 - 32
  • [32] Optimal and Near-Optimal Partner Selection Algorithms in Cooperative OFDMA
    Baksi, Saygin
    Kaya, Onur
    Biyikoglu, Turker
    2012 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2012,
  • [33] OPTIMAL PARALLEL ALGORITHMS ON PLANAR GRAPHS
    HAGERUP, T
    INFORMATION AND COMPUTATION, 1990, 84 (01) : 71 - 96
  • [34] OPTIMAL PARALLEL ALGORITHMS FOR SPARSE GRAPHS
    PANTZIOU, GE
    SPIRAKIS, PG
    ZAROLIAGIS, CD
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 484 : 1 - 17
  • [35] Optimal and Near-Optimal Resource Allocation Algorithms for OFDMA Networks
    Lin, Yuan-Bin
    Chiu, Tai-Hsiang
    Su, Yu T.
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2009, 8 (08) : 4066 - 4077
  • [36] Optimal and near-optimal algorithms for k-item broadcast
    Santos, EE
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1999, 57 (02) : 121 - 139
  • [37] Near-Optimal Leader Election in Population Protocols on Graphs
    Alistarh, Dan
    Rybicki, Joel
    Voitovych, Sasha
    PROCEEDINGS OF THE 2022 ACM SYMPOSIUM ON PRINCIPLES OF DISTRIBUTED COMPUTING, PODC 2022, 2022, : 246 - 256
  • [38] Near-optimal induced universal graphs for cycles and paths
    Abrahamsen, Mikkel
    Alstrup, Stephen
    Holm, Jacob
    Knudsen, Mathias Baek Tejs
    Stockel, Morten
    DISCRETE APPLIED MATHEMATICS, 2020, 282 : 1 - 13
  • [39] An operational approach for generating near-optimal station keeping strategies via parallel genetic algorithms
    Smith, JE
    Proulx, RJ
    Cefola, PJ
    Draim, JE
    ASTRODYNAMICS 1999, PTS 1-3, 2000, 103 : 1349 - 1369
  • [40] Near-Optimal Sublinear Time Algorithms for Ulam Distance
    Andoni, Alexandr
    Nguyen, Huy L.
    PROCEEDINGS OF THE TWENTY-FIRST ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2010, 135 : 76 - +