Towards Efficient Distributed Subgraph Enumeration Via Backtracking-Based Framework

被引:1
|
作者
Wang, Zhaokang [1 ]
Hu, Weiwei [1 ]
Chen, Guowang [1 ]
Yuan, Chunfeng [1 ]
Gu, Rong [1 ]
Huang, Yihua [1 ]
机构
[1] Nanjing Univ, Dept Comp Sci & Technol, State Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Backtracking-based framework; distributed graph querying; incremental subgraph matching; subgraph isomorphism; sub-graph matching; ISOMORPHISM;
D O I
10.1109/TPDS.2021.3076246
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Finding or monitoring subgraph instances that are isomorphic to a given pattern graph in a data graph is a fundamental query operation in many graph analytic applications, such as network motif mining and fraud detection. Existing distributed methods are inefficient in communication. They have to shuffle partial matching results during the distributed multiway join. The partial matching results may be much larger than the data graph itself. To overcome the drawback, we develop the Batch-BENU framework for distributed subgraph enumeration on static data graphs. Batch-BENU executes a group of local search tasks in parallel. Each task enumerates subgraphs around a vertex in the data graph, guided by a backtracking-based execution plan. To handle large-scale data graphs that may exceed the memory capacity of a single machine, Batch-BENU stores the data graph in a distributed database. Each task queries adjacency sets of the data graph on demand, shuffling the data graph instead of partial matching results. To support incremental subgraph enumeration on dynamic data graphs, we propose the Streaming-BENU framework. Streaming-BENU turns the problem of enumerating incremental matching results into enumerating all matching results of incremental pattern graphs at each time step. We implement Batch-BENU and Streaming-BEND with the local database cache and the load balance optimization to improve their efficiency. Extensive experiments show that Batch-BENU and Streaming-BENU can scale to big graphs and complex pattern graphs. They outperform the state-of-the-art distributed methods by up to one and two orders of magnitude, respectively.
引用
收藏
页码:2953 / 2969
页数:17
相关论文
共 50 条
  • [31] Towards Efficient Traffic Engineering via Distributed Optimization in Large-Scale LEO Constellation
    Wei, Linhui
    Tien-Thanh Le
    Ji, Yusheng
    Wang, Mingqian
    Liu, Yu
    Wang, Yumei
    Lui, John C. S.
    2024 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS, ICC WORKSHOPS 2024, 2024, : 353 - 358
  • [32] Integration Framework of Technological Paradigms Towards a Distributed Semantic Web based on Process Management
    Alberto Pereira-Marin, Carlos
    GECONTEC-REVISTA INTERNACIONAL DE GESTION DEL CONOCIMIENTO Y LA TECNOLOGIA, 2018, 6 (01): : 82 - 100
  • [33] Towards a Runtime Standard-based Testing Framework for Dynamic Distributed Information Systems
    Krichen, Moez
    Alroobaea, Roobaea
    Lahami, Mariam
    PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS (ICEIS), VOL 1, 2019, : 121 - 129
  • [34] An Efficient Rule-Based Distributed Reasoning Framework for Resource-bounded Systems
    Rakib, Abdur
    Uddin, Ijaz
    MOBILE NETWORKS & APPLICATIONS, 2019, 24 (01): : 82 - 99
  • [35] An efficient approach based on identity for distributed data possession in multicloud using SelCSP framework
    Ahemad, Shaikh Rahil
    Dole, Lalit
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [36] An Efficient Rule-Based Distributed Reasoning Framework for Resource-bounded Systems
    Abdur Rakib
    Ijaz Uddin
    Mobile Networks and Applications, 2019, 24 : 82 - 99
  • [37] Towards a Trustworthy Semantics-Based Language Framework via Proof Generation
    Chen, Xiaohong
    Lin, Zhengyao
    Minh-Thai Trinh
    Rosu, Grigore
    COMPUTER AIDED VERIFICATION, PT II, CAV 2021, 2021, 12760 : 477 - 499
  • [38] Towards Edge-enabled Distributed Computing Framework for Heterogeneous Android-based Devices
    Yao, Yongtao
    Liu, Bin
    Zhao, Yiwei
    Shi, Weisong
    2022 IEEE/ACM 7TH SYMPOSIUM ON EDGE COMPUTING (SEC 2022), 2022, : 531 - 536
  • [39] Towards Efficient Tensor Decomposition-Based DNN Model Compression with Optimization Framework
    Yin, Miao
    Sui, Yang
    Liao, Siyu
    Yuan, Bo
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10669 - 10678
  • [40] A HADOOP-BASED DISTRIBUTED FRAMEWORK FOR EFFICIENT MANAGING AND PROCESSING BIG REMOTE SENSING IMAGES
    Wang, C.
    Hu, F.
    Hu, X.
    Zhao, S.
    Wen, W.
    Yang, C.
    ISPRS International Workshop on Spatiotemporal Computing, 2015, : 63 - 66