BFS-based distributed algorithm for parallel local-directed subgraph enumeration

被引:1
|
作者
Levinas, Itay [1 ]
Scherz, Roy [1 ]
Louzoun, Yoram [1 ]
机构
[1] Bar Ilan Univ, Dept Math, IL-5290000 Ramat Gan, Israel
关键词
subgraphs; BFS; GPU; full enumeration; NETWORK; GRAPHLETS;
D O I
10.1093/comnet/cnac051
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Estimating the frequency of subgraphs is of importance for many tasks, including subgraph isomorphism, kernel-based anomaly detection and network structure analysis. While multiple algorithms were proposed for full enumeration or sampling-based estimates, these methods fail in very large graphs. Recent advances in parallelization allow for estimates of total subgraph counts in very large graphs. The task of counting the frequency of each subgraph associated with each vertex also received excellent solutions for undirected graphs. However, there is currently no good solution for very large directed graphs.We here propose VDMC (Vertex specific Distributed Motif Counting)-a fully distributed algorithm to optimally count all the three and four vertices connected directed graphs (network motifs) associated with each vertex of a graph. VDMC counts each motif only once and its efficiency is linear in the number of counted motifs. It is fully parallelized to be efficient in GPU-based computation. VDMC is based on three main elements: (1) Ordering the vertices and only counting motifs containing increasing order vertices; (2) sub-ordering motifs based on the average depth of the tree spanning them via a BFS traversal; and (3) removing isomorphisms only once for the entire graph. We here compare VDMC to analytical estimates of the expected number of motifs in Erdos-Renyi graphs and show its accuracy. VDMC is available as a highly efficient CPU and GPU code with a novel data structure for efficient graph manipulation. We show the efficacy of VDMC on real-world graphs. VDMC allows for the precise analysis of subgraph frequency around each vertex in large graphs and opens the way for the extension of methods until now limited to graphs of thousands of edges to graphs with millions of edges and above.GIT:PyPI:
引用
收藏
页数:15
相关论文
共 50 条
  • [1] A BFS-Based Pruning Algorithm for Disease-Symptom Knowledge Graph Database
    Mondal, Safikureshi
    Mukherjee, Nandini
    INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS, ICTIS 2018, VOL 2, 2019, 107 : 417 - 426
  • [2] AN IMPROVED PARALLEL ALGORITHM THAT COMPUTES THE BFS NUMBERING OF A DIRECTED GRAPH
    GAZIT, H
    MILLER, GL
    INFORMATION PROCESSING LETTERS, 1988, 28 (02) : 61 - 65
  • [3] BENU: Distributed Subgraph Enumeration with Backtracking-based Framework
    Wang, Zhaokang
    Gu, Rong
    Hu, Weiwei
    Yuan, Chunfeng
    Huang, Yihua
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 136 - 147
  • [4] An Effective and Efficient MapReduce Algorithm for Computing BFS-Based Traversals of Large-Scale RDF Graphs
    Cuzzocrea, Alfredo
    Cosulschi, Mirel
    de Virgilio, Roberto
    ALGORITHMS, 2016, 9 (01)
  • [5] Towards Efficient Distributed Subgraph Enumeration Via Backtracking-Based Framework
    Wang, Zhaokang
    Hu, Weiwei
    Chen, Guowang
    Yuan, Chunfeng
    Gu, Rong
    Huang, Yihua
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (12) : 2953 - 2969
  • [6] The Index-Based Subgraph Matching Algorithm with General Symmetries (ISMAGS): Exploiting Symmetry for Faster Subgraph Enumeration
    Houbraken, Maarten
    Demeyer, Sofie
    Michoel, Tom
    Audenaert, Pieter
    Colle, Didier
    Pickavet, Mario
    PLOS ONE, 2014, 9 (05):
  • [7] Optimized Distributed Subgraph Matching Algorithm Based on Partition Replication
    Yuan, Ling
    Bin, Jiali
    Pan, Peng
    ELECTRONICS, 2020, 9 (01)
  • [8] The Index-Based Subgraph Matching Algorithm (ISMA): Fast Subgraph Enumeration in Large Networks Using Optimized Search Trees
    Demeyer, Sofie
    Michoel, Tom
    Fostier, Jan
    Audenaert, Pieter
    Pickavet, Mario
    Demeester, Piet
    PLOS ONE, 2013, 8 (04):
  • [9] A Local Stability Supported Parallel Distributed Constraint Optimization Algorithm
    Duan Peibo
    Zhang Changsheng
    Zhang Bin
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [10] FDTD parallel algorithm based on distributed platform
    Feng Y.
    Dai X.
    Tang X.
    Gong X.
    Tang, Xiaobin (science_xiaoxiao@163.com), 1874, Beijing University of Aeronautics and Astronautics (BUAA) (42): : 1874 - 1883