Distributed Set Label-Constrained Reachability Queries over Billion-Scale Graphs

被引:3
|
作者
Zeng, Yuanyuan [1 ]
Yang, Wangdong [1 ]
Zhou, Xu [1 ]
Xiao, Guoqing [1 ]
Gao, Yunjun [2 ]
Li, Kenli [1 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
关键词
2-hop index; boundary graph; distributed system; graph traversal; EFFECTIVE COMMUNITY SEARCH;
D O I
10.1109/ICDE53745.2022.00193
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Set label-constrained reachability (SLCR) query in edge-labeled graphs is a building block of many graph-based applications. Formally, given two sets S and T of source and target vertices and a label set zeta, it returns all reachable vertex pairs (s; t) under the constraint of zeta, where s2S and t2T. There have been abundant index-based approaches to be applied to process the SLCR query. However, distributed approaches are desirable to process large-scale graphs because of the advantages of good scalability and real-time response. Now, there is no efficient distributed approach to the SLCR query. Most indexbased approaches face limitations in terms of index construction and query performance when being extended to the distributed environment for processing large-scale graphs. To alleviate these problems, we first build a boundary graph-based index (BoundG) to reduce the time overhead of index construction. Consider the query performance of the BoundG-based approach has no noticeable improvement. We further construct a novel two layers 2-hop index (TL2hop), and a TL2hop-based query algorithm (TLQA) is designed by integrating an early termination strategy that reduces the communication overhead and boosts the query performance. Experimental results over eight data graphs demonstrate that the index time of BoundG is comparable to that of the state-of-the-art, and TL2hop significantly outperforms the state-of-the-art technique in terms of query response time (up to 4 orders of magnitude speedup).
引用
收藏
页码:1969 / 1981
页数:13
相关论文
共 50 条
  • [1] Answering Billion-Scale Label-Constrained Reachability Queries within Microsecond
    Peng, You
    Zhang, Ying
    Lin, Xuemin
    Qin, Lu
    Zhang, Wenjie
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (06): : 812 - 825
  • [2] DLQ: A System for Label-Constrained Reachability Queries on Dynamic Graphs
    Peng, You
    Zhao, Wenjie
    Zhang, Wenjie
    Lin, Xuemin
    Zhang, Ying
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4764 - 4768
  • [3] I/O Efficient Label-Constrained Reachability Queries in Large Graphs
    Yuan, Long
    Li, Xia
    Chen, Zi
    Lin, Xuemin
    Zhao, Xiang
    Zhang, Wenjie
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (10): : 2590 - 2602
  • [4] DLCR : Efficient Indexing for Label-Constrained Reachability Queries on Large Dynamic Graphs
    Chen, Xin
    Peng, You
    Wang, Sibo
    Yu, Jeffrey Xu
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (08): : 1645 - 1657
  • [5] Landmark Indexing for Evaluation of Label-Constrained Reachability Queries
    Valstar, Lucien D. J.
    Fletcher, George H. L.
    Yoshida, Yuichi
    SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 345 - 358
  • [6] Graph Indexing for Efficient Evaluation of Label-constrained Reachability Queries
    Chen, Yangjun
    Singh, Gagandeep
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2021, 46 (02):
  • [7] DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs
    Zheng, Da
    Ma, Chao
    Wang, Minjie
    Zhou, Jinjing
    Su, Qidong
    Song, Xiang
    Gan, Quan
    Zhang, Zheng
    George Karypis
    PROCEEDINGS OF IA3 2020: 2020 IEEE/ACM 10TH WORKSHOP ON IRREGULAR APPLICATIONS: ARCHITECTURES AND ALGORITHMS (IA3), 2020, : 36 - 44
  • [8] PEGASUS: MINING BILLION-SCALE GRAPHS IN THE CLOUD
    Kang, U.
    Chau, Duen Horng Polo
    Faloutsos, Christos
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5341 - 5344
  • [9] Answering reachability queries with ordered label constraints over labeled graphs
    He, Daoliang
    Yuan, Pingpeng
    Jin, Hai
    FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (01)
  • [10] Distributed Reachability Queries on Massive Graphs
    Zhang, Tianming
    Gao, Yunjun
    Li, Congzheng
    Ge, Congcong
    Guo, Wei
    Zhou, Qiang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 406 - 410