A robust, efficient, and balanced parallel algorithm for finding connected components

被引:0
|
作者
Asokan, M. [1 ]
机构
[1] Syncsort Inc, Pearl River, NY 10965 USA
来源
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2019年
关键词
Graph Mining; Connected Components; Hadoop; MapReduce; MAPREDUCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding connected components in an undirected graph has many practical applications. For example in a graph representing a social network, a connected component represents a group of related individuals with common interest. Also, finding connected components forms the basis for other clustering algorithms. In this paper, we will present a parallel algorithm which uses the well known sequential algorithm as the basis for finding connected components in an undirected graph. The algorithm can be adopted to run on a single computer with multiple cores or MapReduce. It is robust in the sense that it honors memory limits. This is important in today's containerized environments. It balances the workload even in the presence of data skew. For the best known algorithm running in MapReduce, the number of iterations is the square of the logarithmic function of the number of vertices in the graph. For our algorithm, we will prove that the upper bound on the number of iterations is a logarithmic function of the maximum size of a connected component. In each iteration, the amount of data read from or written to a file system is bounded by four times the number of edges in the graph.
引用
收藏
页码:2113 / 2118
页数:6
相关论文
共 50 条
  • [1] Efficient parallel algorithm for finding strongly connected components based on granulation strategy
    Xu, Taihua
    He, Huixing
    Yang, Xibei
    Yang, Jie
    Song, Jingjing
    Cui, Yun
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (03) : 2855 - 2879
  • [2] AN OPTIMAL RANDOMIZED PARALLEL ALGORITHM FOR FINDING CONNECTED COMPONENTS IN A GRAPH
    GAZIT, H
    SIAM JOURNAL ON COMPUTING, 1991, 20 (06) : 1046 - 1067
  • [3] A space-efficient algorithm for finding strongly connected components
    Pearce, David J.
    INFORMATION PROCESSING LETTERS, 2016, 116 (01) : 47 - 52
  • [4] ALZI: An Improved Parallel Algorithm for Finding Connected Components in Large Graphs
    Boddu, Sharon
    Khan, Maleq
    EURO-PAR 2024: PARALLEL PROCESSING, PT III, EURO-PAR 2024, 2024, 14803 : 134 - 147
  • [5] Finding strongly connected components in parallel
    Pogorilyy, S. D.
    Lozytskyi, S., I
    APPLIED AND COMPUTATIONAL MATHEMATICS, 2007, 6 (02) : 236 - 245
  • [6] FINDING CONNECTED COMPONENTS AND CONNECTED ONES ON A MESH-CONNECTED PARALLEL COMPUTER
    NASSIMI, D
    SAHNI, S
    SIAM JOURNAL ON COMPUTING, 1980, 9 (04) : 744 - 757
  • [8] An Adaptive Parallel Algorithm for Computing Connected Components
    Jain, Chirag
    Flick, Patrick
    Pan, Tony
    Green, Oded
    Aluru, Srinivas
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (09) : 2428 - 2439
  • [9] Parallel algorithms for finding connected components using linear algebra
    Zhang, Yongzhe
    Azad, Ariful
    Buluc, Aydin
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 144 (14-27) : 14 - 27
  • [10] An Efficient Parallel Algorithm for Multi-Scale Analysis of Connected Components in Gigapixel Images
    Wilkinson, Michael H. F.
    Pesaresi, Martino
    Ouzounis, Georgios K.
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2016, 5 (03)