Towards Efficient Distributed Subgraph Enumeration Via Backtracking-Based Framework

被引:1
|
作者
Wang, Zhaokang [1 ]
Hu, Weiwei [1 ]
Chen, Guowang [1 ]
Yuan, Chunfeng [1 ]
Gu, Rong [1 ]
Huang, Yihua [1 ]
机构
[1] Nanjing Univ, Dept Comp Sci & Technol, State Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Backtracking-based framework; distributed graph querying; incremental subgraph matching; subgraph isomorphism; sub-graph matching; ISOMORPHISM;
D O I
10.1109/TPDS.2021.3076246
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Finding or monitoring subgraph instances that are isomorphic to a given pattern graph in a data graph is a fundamental query operation in many graph analytic applications, such as network motif mining and fraud detection. Existing distributed methods are inefficient in communication. They have to shuffle partial matching results during the distributed multiway join. The partial matching results may be much larger than the data graph itself. To overcome the drawback, we develop the Batch-BENU framework for distributed subgraph enumeration on static data graphs. Batch-BENU executes a group of local search tasks in parallel. Each task enumerates subgraphs around a vertex in the data graph, guided by a backtracking-based execution plan. To handle large-scale data graphs that may exceed the memory capacity of a single machine, Batch-BENU stores the data graph in a distributed database. Each task queries adjacency sets of the data graph on demand, shuffling the data graph instead of partial matching results. To support incremental subgraph enumeration on dynamic data graphs, we propose the Streaming-BENU framework. Streaming-BENU turns the problem of enumerating incremental matching results into enumerating all matching results of incremental pattern graphs at each time step. We implement Batch-BENU and Streaming-BEND with the local database cache and the load balance optimization to improve their efficiency. Extensive experiments show that Batch-BENU and Streaming-BENU can scale to big graphs and complex pattern graphs. They outperform the state-of-the-art distributed methods by up to one and two orders of magnitude, respectively.
引用
收藏
页码:2953 / 2969
页数:17
相关论文
共 50 条
  • [1] BENU: Distributed Subgraph Enumeration with Backtracking-based Framework
    Wang, Zhaokang
    Gu, Rong
    Hu, Weiwei
    Yuan, Chunfeng
    Huang, Yihua
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 136 - 147
  • [2] Backtracking-based matching pursuit method for distributed compressed sensing
    Zhang, Yujie
    Qi, Rui
    Zeng, Yanni
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (13) : 14691 - 14710
  • [3] Backtracking-based matching pursuit method for distributed compressed sensing
    Yujie Zhang
    Rui Qi
    Yanni Zeng
    Multimedia Tools and Applications, 2017, 76 : 14691 - 14710
  • [4] A new backtracking-based sparsity adaptive algorithm for distributed compressed sensing
    徐勇
    张玉洁
    邢婧
    李宏伟
    JournalofCentralSouthUniversity, 2015, 22 (10) : 3946 - 3956
  • [5] A new backtracking-based sparsity adaptive algorithm for distributed compressed sensing
    Xu Yong
    Zhang Yu-jie
    Xing Jing
    Li Hong-wei
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2015, 22 (10) : 3946 - 3956
  • [6] A new backtracking-based sparsity adaptive algorithm for distributed compressed sensing
    Yong Xu
    Yu-jie Zhang
    Jing Xing
    Hong-wei Li
    Journal of Central South University, 2015, 22 : 3946 - 3956
  • [7] An Efficient Backtracking-based Approach to Turn-constrained Path Planning for Aerial Mobile Robots
    Sharma, Hrishikesh
    Sebastian, Tom
    Balamuralidhar, P.
    2017 EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR), 2017,
  • [8] BFS-based distributed algorithm for parallel local-directed subgraph enumeration
    Levinas, Itay
    Scherz, Roy
    Louzoun, Yoram
    JOURNAL OF COMPLEX NETWORKS, 2022, 10 (06)
  • [9] Towards Lightweight and Efficient Distributed Intrusion Detection Framework
    Yuan, Shuai
    Li, Hongwei
    Zhang, Rui
    Hao, Meng
    Li, Yiran
    Lu, Rongxing
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [10] Husky: Towards a More Efficient and Expressive Distributed Computing Framework
    Yang, Fan
    Li, Jinfeng
    Cheng, James
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (05): : 420 - 431