An Efficient Index-Based Approach to Distributed Set Reachability on Small-World Graphs

被引:4
|
作者
Zeng, Yuanyuan [1 ,2 ]
Li, Kenli [1 ,2 ]
Zhou, Xu [1 ,2 ]
Luo, Wensheng [1 ,2 ]
Gao, Yunjun [3 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410012, Peoples R China
[2] Natl Supercomp Ctr, Changsha 410082, Peoples R China
[3] Zhejiang Univ, Key Lab Big Data Intelligent Comp Zhejiang Provin, Hangzhou 310027, Peoples R China
关键词
2-hop labeling; distributed processing; indexing; set reachabiity; QUERIES;
D O I
10.1109/TPDS.2021.3139111
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Set reachability query in directed graphs has a plethora of graph-based applications such as dependency analysis and graph centrality calculation. Given two sets S and T of source and target vertices, set reachability query needs to acquire all pairs (s, t) where s is an element of S and t is an element of T and s can reach t. The state-of-the-art approach distributed set reachability (DSR) investigates the set reachability query in a distributed environment and adopts a static graph-based index to enhance the query efficiency. Nevertheless, DSR needs to store the graph-based index in all partitions, which causes a huge space overhead. Furthermore, it cannot efficiently solve the negative query (s, t) where s cannot reach t, since DSR needs to traverse the whole reachable paths and becomes unable to efficiently reduce the computations. To alleviate these issues, we propose a novel multi-level 2-hop (ML2hop) index for the set reachability query in a distributed environment. Based on ML2hop, we further present a bi-directional query algorithm, called MLQA, to achieve efficient support for both positive and negative queries in Pregel-like systems. Generally, MLQA is equipped with the following three significant properties: (1) Low computation costs. It reduces redundant local computations in each partition by controlling the rounds of path traversals. (2) Low communication costs. It restricts the message exchange among different partitions within one single round with guaranteed accuracy of query results. (3) High parallelism. It adopts a bi-directional query technique for message propagation, achieving the better query efficiency than the forward-traversal query strategy utilized in DSR. Experimental results over several real-world graphs demonstrate that MLQA significantly outperforms the state-of-the-art algorithm by up to two orders of magnitude speedup.
引用
收藏
页码:2358 / 2371
页数:14
相关论文
共 50 条
  • [41] iTri: Index-based triangle listing in massive graphs
    Rase, Mostofa Kamal
    Han, Yongkoo
    Kim, Jinseung
    Park, Kisung
    Nguyen Anh Tu
    Lee, Young-Koo
    INFORMATION SCIENCES, 2016, 336 : 1 - 20
  • [42] Deterministic small-world graphs and the eigenvalue power law of Internet
    Comellas, F
    Gago, S
    I-SPAN 2004: 7TH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND NETWORKS, PROCEEDINGS, 2004, : 374 - 379
  • [43] Recursive graphs with small-world scale-free properties
    Comellas, F
    Fertin, G
    Raspaud, A
    PHYSICAL REVIEW E, 2004, 69 (03) : 037104 - 1
  • [44] On the Hyperbolicity of Small-World and Tree-Like Random Graphs
    Chen, Wei
    Fang, Wenjie
    Hu, Guangda
    Mahoney, Michael W.
    ALGORITHMS AND COMPUTATION, ISAAC 2012, 2012, 7676 : 278 - 288
  • [45] Algebraic approach to small-world network models
    Rudolph-Lilith, Michelle
    Muller, Lyle E.
    PHYSICAL REVIEW E, 2014, 89 (01)
  • [46] An Efficient Method of Generating Deterministic Small-World and Scale-Free Graphs for Simulating Real-World Networks
    Jiang, Wenchao
    Zhai, Yinhu
    Zhuang, Zhigang
    Martin, Paul
    Zhao, Zhiming
    Liu, Jia-Bao
    IEEE ACCESS, 2018, 6 : 59833 - 59842
  • [47] An index-based checkpointing algorithm for autonomous distributed systems
    Baldoni, R
    Quaglia, F
    Fornara, P
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1999, 10 (02) : 181 - 192
  • [48] Index-based query processing on distributed multidimensional data
    Tsatsanifos, George
    Sacharidis, Dimitris
    Sellis, Timos
    GEOINFORMATICA, 2013, 17 (03) : 489 - 519
  • [49] Efficient associative memory using small-world architecture
    Bohland, JW
    Minai, AA
    NEUROCOMPUTING, 2001, 38 : 489 - 496
  • [50] Index-based query processing on distributed multidimensional data
    George Tsatsanifos
    Dimitris Sacharidis
    Timos Sellis
    GeoInformatica, 2013, 17 : 489 - 519