Distributed Set Label-Constrained Reachability Queries over Billion-Scale Graphs

被引:3
|
作者
Zeng, Yuanyuan [1 ]
Yang, Wangdong [1 ]
Zhou, Xu [1 ]
Xiao, Guoqing [1 ]
Gao, Yunjun [2 ]
Li, Kenli [1 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
关键词
2-hop index; boundary graph; distributed system; graph traversal; EFFECTIVE COMMUNITY SEARCH;
D O I
10.1109/ICDE53745.2022.00193
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Set label-constrained reachability (SLCR) query in edge-labeled graphs is a building block of many graph-based applications. Formally, given two sets S and T of source and target vertices and a label set zeta, it returns all reachable vertex pairs (s; t) under the constraint of zeta, where s2S and t2T. There have been abundant index-based approaches to be applied to process the SLCR query. However, distributed approaches are desirable to process large-scale graphs because of the advantages of good scalability and real-time response. Now, there is no efficient distributed approach to the SLCR query. Most indexbased approaches face limitations in terms of index construction and query performance when being extended to the distributed environment for processing large-scale graphs. To alleviate these problems, we first build a boundary graph-based index (BoundG) to reduce the time overhead of index construction. Consider the query performance of the BoundG-based approach has no noticeable improvement. We further construct a novel two layers 2-hop index (TL2hop), and a TL2hop-based query algorithm (TLQA) is designed by integrating an early termination strategy that reduces the communication overhead and boosts the query performance. Experimental results over eight data graphs demonstrate that the index time of BoundG is comparable to that of the state-of-the-art, and TL2hop significantly outperforms the state-of-the-art technique in terms of query response time (up to 4 orders of magnitude speedup).
引用
收藏
页码:1969 / 1981
页数:13
相关论文
共 50 条
  • [41] Processing SPARQL queries over distributed RDF graphs
    Peng, Peng
    Zou, Lei
    Ozsu, M. Tamer
    Chen, Lei
    Zhao, Dongyan
    VLDB JOURNAL, 2016, 25 (02): : 243 - 268
  • [42] Straight-Line Grid Drawings of Label-Constrained Outerplanar Graphs with O(n log n) Area
    Karim, Md. Rezaul
    Alam, Md. Jawaherul
    Rahman, Md. Saidur
    WALCOM: ALGORITHMS AND COMPUTATION, PROCEEDINGS, 2009, 5431 : 310 - 321
  • [43] DistR: A Distributed Method for the Reachability Query over Large Uncertain Graphs
    Cheng, Yurong
    Yuan, Ye
    Chen, Lei
    Wang, Guoren
    Giraud-Carrier, Christophe
    Sun, Yongjiao
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (11) : 3172 - 3185
  • [44] Efficient Parallel Processing of Distance Join Queries Over Distributed Graphs
    Zhang, Xiaofei
    Chen, Lei
    Wang, Min
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (03) : 740 - 754
  • [45] Efficiently Answering Minimum Reachable Label Set Queries in Edge-Labeled Graphs
    Wu, Yanping
    Sun, Renjie
    Chen, Chen
    Wang, Xiaoyang
    Fu, Xianming
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4585 - 4589
  • [46] Distributed constrained optimization over unbalanced graphs and delayed gradient
    Huang, Qing
    Fan, Yuan
    Cheng, Songsong
    JOURNAL OF THE FRANKLIN INSTITUTE, 2025, 362 (02)
  • [47] An Efficient Index-Based Approach to Distributed Set Reachability on Small-World Graphs
    Zeng, Yuanyuan
    Li, Kenli
    Zhou, Xu
    Luo, Wensheng
    Gao, Yunjun
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (10) : 2358 - 2371
  • [48] Workload prediction and balance for distributed reachability processing for large-scale attribute graphs
    Ho, Li-Yung
    Wu, Jan-Jan
    Liu, Pangfeng
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (06):
  • [49] Ganite: A distributed engine for scalable path queries over temporal property graphs
    Ramesh, Shriram
    Baranawal, Animesh
    Simmhan, Yogesh
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021, 151 : 94 - 111
  • [50] Distributed Proximal Point Algorithm for Constrained Optimization over Unbalanced Graphs
    Li, Xiuxian
    Feng, Gang
    Xie, Lihua
    2019 IEEE 15TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2019, : 824 - 829