GUISE: Uniform Sampling of Graphlets for Large Graph Analysis

被引:65
|
作者
Bhuiyan, Mansurul A. [1 ]
Rahman, Mahmudur [1 ]
Rahman, Mahmuda [2 ]
Al Hasan, Mohammad [1 ]
机构
[1] Indiana Univ Purdue Univ, Dept Comp Sci, Indianapolis, IN 46202 USA
[2] Syracuse Univ, Dept Comp Sci, New York, NY USA
关键词
D O I
10.1109/ICDM.2012.87
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graphlet frequency distribution (GFD) has recently become popular for characterizing large networks. However, the computation of GFD for a network requires the exact count of embedded graphlets in that network, which is a computationally expensive task. As a result, it is practically infeasible to compute the GFD for even a moderately large network. In this paper, we propose GUISE, which uses a Markov Chain Monte Carlo (MCMC) sampling method for constructing the approximate GFD of a large network. Our experiments on networks with millions of nodes show that GUISE obtains the GFD within few minutes, whereas the exhaustive counting based approach takes several days.
引用
收藏
页码:91 / 100
页数:10
相关论文
共 50 条
  • [41] The Noise Characteristic Analysis of the Periodically Non-uniform Sampling
    Wu, Shuai
    Jiang, Kaili
    Zhu, Jun
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2019, 463 : 1482 - 1489
  • [42] A subgraph sampling method for training large-scale graph convolutional network
    Zhang, Qi
    Sun, Yanfeng
    Hu, Yongli
    Wang, Shaofan
    Yin, Baocai
    INFORMATION SCIENCES, 2023, 649
  • [43] Context-aware Sampling of Large Networks via Graph Representation Learning
    Zhou, Zhiguang
    Shi, Chen
    Shen, Xilong
    Cai, Lihong
    Wang, Haoxuan
    Liu, Yuhua
    Zhao, Ying
    Chen, Wei
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (02) : 1709 - 1719
  • [44] Global triangle estimation based on first edge sampling in large graph streams
    Yu, Changyong
    Liu, Huimin
    Wahab, Fazal
    Ling, Zihan
    Ren, Tianmei
    Ma, Haitao
    Zhao, Yuhai
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (13): : 14079 - 14116
  • [45] A Sampling-Based Graph Clustering Algorithm for Large-Scale Networks
    Zhang J.-P.
    Chen H.-C.
    Wang K.
    Zhu K.-J.
    Wang Y.-W.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (08): : 1731 - 1737
  • [46] Global triangle estimation based on first edge sampling in large graph streams
    Changyong Yu
    Huimin Liu
    Fazal Wahab
    Zihan Ling
    Tianmei Ren
    Haitao Ma
    Yuhai Zhao
    The Journal of Supercomputing, 2023, 79 : 14079 - 14116
  • [47] Randomized graph sampling
    Ducey, Mark J.
    ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2012, 19 (01) : 1 - 21
  • [48] Randomized graph sampling
    Mark J. Ducey
    Environmental and Ecological Statistics, 2012, 19 : 1 - 21
  • [49] Sampling the eigenvalues of a graph
    Baskaya, E.
    Boumenir, A.
    SAMPLING THEORY SIGNAL PROCESSING AND DATA ANALYSIS, 2023, 21 (01):
  • [50] Graph spatial sampling
    Zhang, Li-Chun
    STAT, 2024, 13 (03):