BFS-based distributed algorithm for parallel local-directed subgraph enumeration

被引:1
|
作者
Levinas, Itay [1 ]
Scherz, Roy [1 ]
Louzoun, Yoram [1 ]
机构
[1] Bar Ilan Univ, Dept Math, IL-5290000 Ramat Gan, Israel
关键词
subgraphs; BFS; GPU; full enumeration; NETWORK; GRAPHLETS;
D O I
10.1093/comnet/cnac051
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Estimating the frequency of subgraphs is of importance for many tasks, including subgraph isomorphism, kernel-based anomaly detection and network structure analysis. While multiple algorithms were proposed for full enumeration or sampling-based estimates, these methods fail in very large graphs. Recent advances in parallelization allow for estimates of total subgraph counts in very large graphs. The task of counting the frequency of each subgraph associated with each vertex also received excellent solutions for undirected graphs. However, there is currently no good solution for very large directed graphs.We here propose VDMC (Vertex specific Distributed Motif Counting)-a fully distributed algorithm to optimally count all the three and four vertices connected directed graphs (network motifs) associated with each vertex of a graph. VDMC counts each motif only once and its efficiency is linear in the number of counted motifs. It is fully parallelized to be efficient in GPU-based computation. VDMC is based on three main elements: (1) Ordering the vertices and only counting motifs containing increasing order vertices; (2) sub-ordering motifs based on the average depth of the tree spanning them via a BFS traversal; and (3) removing isomorphisms only once for the entire graph. We here compare VDMC to analytical estimates of the expected number of motifs in Erdos-Renyi graphs and show its accuracy. VDMC is available as a highly efficient CPU and GPU code with a novel data structure for efficient graph manipulation. We show the efficacy of VDMC on real-world graphs. VDMC allows for the precise analysis of subgraph frequency around each vertex in large graphs and opens the way for the extension of methods until now limited to graphs of thousands of edges to graphs with millions of edges and above.GIT:PyPI:
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Research of Distributed Algorithm based on Parallel Computer Cluster System
    Xu He-li
    Liu Yan
    THIRD INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY (ISCSCT 2010), 2010, : 369 - 372
  • [22] A GVT Based Algorithm for Butterfly Barrier in Parallel and Distributed Systems
    Rizvi, Syed S.
    Potham, Shalini
    Elleithy, Khaled M.
    ADVANCES TECHNIQUES IN COMPUTING SCIENCES AND SOFTWARE ENGINEERING, 2010, : 589 - 593
  • [23] Research on parallel algorithm based on hadoop distributed computing platform
    Heilongjiang University of Technology, Jixi, China
    Int. J. Grid Distrib. Comput., 4 (163-170):
  • [24] MapReduce based distributed parallel algorithm for extracting the hot path
    Gui, Zhiming
    Xiang, Yu
    Li, Yujian
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2012, 52 (SUPPL.1): : 29 - 32
  • [25] PARALLEL IMPLEMENTATION FOR SAM ALGORITHM BASED ON GPU AND DISTRIBUTED COMPUTING
    Qu, Haicheng
    Zhang, Junping
    Chen, Yushi
    Chen, Hao
    Lin, Zhouhan
    2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 4074 - 4077
  • [26] Parallel Distributed Genetic Algorithm Development Based on Microcontrollers Framework
    Krishnan, Prajindra Sankar
    Kiong, Tiong Sieh
    Koh, Johnny
    DFMA 2008: FIRST INTERNATIONAL CONFERENCE ON DISTRIBUTED FRAMEWORKS & APPLICATIONS, PROCEEDINGS, 2008, : 35 - 40
  • [27] Distributed Parallel algorithm based on direct transformation method of substructure
    Wang Xiaohong
    Zhuang Yi
    ICMS2009: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON MODELLING AND SIMULATION, VOL 5, 2009, : 452 - 457
  • [28] Apriori Parallel Improved Algorithm Based on MapReduce Distributed Architecture
    She Xiangyang
    Zhang Ling
    PROCEEDINGS OF 2016 SIXTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION & MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2016), 2016, : 517 - 521
  • [30] Compact Local IRBF and Domain Decomposition Method for solving PDEs using a Distributed termination detection based parallel algorithm
    Pham-Sy, N.
    Tran, C. -D.
    Hoang-Trieu, T. -T.
    Mai-Duy, N.
    Tran-Cong, T.
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2013, 92 (01): : 1 - 31