Distributed Set Label-Constrained Reachability Queries over Billion-Scale Graphs

被引:3
|
作者
Zeng, Yuanyuan [1 ]
Yang, Wangdong [1 ]
Zhou, Xu [1 ]
Xiao, Guoqing [1 ]
Gao, Yunjun [2 ]
Li, Kenli [1 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
关键词
2-hop index; boundary graph; distributed system; graph traversal; EFFECTIVE COMMUNITY SEARCH;
D O I
10.1109/ICDE53745.2022.00193
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Set label-constrained reachability (SLCR) query in edge-labeled graphs is a building block of many graph-based applications. Formally, given two sets S and T of source and target vertices and a label set zeta, it returns all reachable vertex pairs (s; t) under the constraint of zeta, where s2S and t2T. There have been abundant index-based approaches to be applied to process the SLCR query. However, distributed approaches are desirable to process large-scale graphs because of the advantages of good scalability and real-time response. Now, there is no efficient distributed approach to the SLCR query. Most indexbased approaches face limitations in terms of index construction and query performance when being extended to the distributed environment for processing large-scale graphs. To alleviate these problems, we first build a boundary graph-based index (BoundG) to reduce the time overhead of index construction. Consider the query performance of the BoundG-based approach has no noticeable improvement. We further construct a novel two layers 2-hop index (TL2hop), and a TL2hop-based query algorithm (TLQA) is designed by integrating an early termination strategy that reduces the communication overhead and boosts the query performance. Experimental results over eight data graphs demonstrate that the index time of BoundG is comparable to that of the state-of-the-art, and TL2hop significantly outperforms the state-of-the-art technique in terms of query response time (up to 4 orders of magnitude speedup).
引用
收藏
页码:1969 / 1981
页数:13
相关论文
共 50 条
  • [21] Hop-Constrained s-t Simple Path Enumeration in Billion-Scale Labelled Graphs
    Li, Xia
    Hao, Kongzhang
    Yang, Zhengyi
    Cao, Xin
    Zhang, Wenjie
    Yuan, Long
    Lin, Xuemin
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2022, 2022, 13724 : 49 - 64
  • [22] Efficient MapReduce algorithms for triangle listing in billion-scale graphs
    Yuanyuan Zhu
    Hao Zhang
    Lu Qin
    Hong Cheng
    Distributed and Parallel Databases, 2017, 35 : 149 - 176
  • [23] Efficient MapReduce algorithms for triangle listing in billion-scale graphs
    Zhu, Yuanyuan
    Zhang, Hao
    Qin, Lu
    Cheng, Hong
    DISTRIBUTED AND PARALLEL DATABASES, 2017, 35 (02) : 149 - 176
  • [24] Label and Distance-Constraint Reachability Queries in Uncertain Graphs
    Chen, Minghan
    Gu, Yu
    Bao, Yubin
    Yu, Ge
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014, PT I, 2014, 8421 : 188 - 202
  • [25] Efficient Label-Constrained Shortest Path Queries on Road Networks: A Tree Decomposition Approach
    Zhang, Junhua
    Yuan, Long
    Li, Wentao
    Qin, Lu
    Zhang, Ying
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 15 (03): : 686 - 698
  • [26] Efficient processing of label-constraint reachability queries in large graphs
    Zou, Lei
    Xu, Kun
    Yu, Jeffrey Xu
    Chen, Lei
    Xiao, Yanghua
    Zhao, Dongyan
    INFORMATION SYSTEMS, 2014, 40 : 47 - 66
  • [27] TGL: A General Framework for Temporal GNN Training on Billion-Scale Graphs
    Zhou, Hongkuan
    Zheng, Da
    Nisa, Israt
    Ioannidis, Vasileios
    Song, Xiang
    Karypis, George
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (08): : 1572 - 1580
  • [28] A Practical Distributed ADMM Solver for Billion-Scale Generalized Assignment Problems
    Zhou, Jun
    Qi, Feng
    Hua, Zhigang
    Jian, Daohong
    Liu, Ziqi
    Wu, Hua
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3715 - 3724
  • [29] The Reachability Query over Distributed Uncertain Graphs
    Cheng, Yurong
    Yuan, Ye
    Chen, Lei
    Wang, Guoren
    2015 IEEE 35TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 2015, : 786 - 787
  • [30] Answering reachability and K-reach queries on large graphs with label constraints
    You Peng
    Xuemin Lin
    Ying Zhang
    Wenjie Zhang
    Lu Qin
    The VLDB Journal, 2022, 31 : 101 - 127