GUISE: Uniform Sampling of Graphlets for Large Graph Analysis

被引:65
|
作者
Bhuiyan, Mansurul A. [1 ]
Rahman, Mahmudur [1 ]
Rahman, Mahmuda [2 ]
Al Hasan, Mohammad [1 ]
机构
[1] Indiana Univ Purdue Univ, Dept Comp Sci, Indianapolis, IN 46202 USA
[2] Syracuse Univ, Dept Comp Sci, New York, NY USA
关键词
D O I
10.1109/ICDM.2012.87
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graphlet frequency distribution (GFD) has recently become popular for characterizing large networks. However, the computation of GFD for a network requires the exact count of embedded graphlets in that network, which is a computationally expensive task. As a result, it is practically infeasible to compute the GFD for even a moderately large network. In this paper, we propose GUISE, which uses a Markov Chain Monte Carlo (MCMC) sampling method for constructing the approximate GFD of a large network. Our experiments on networks with millions of nodes show that GUISE obtains the GFD within few minutes, whereas the exhaustive counting based approach takes several days.
引用
收藏
页码:91 / 100
页数:10
相关论文
共 50 条
  • [21] Differential Domain Analysis for Non-uniform Sampling
    Wei, Li-Yi
    Wang, Rui
    ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (04):
  • [22] Numerical analysis of the non-uniform sampling problem
    Strohmer, T
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2000, 122 (1-2) : 297 - 316
  • [23] Graph Sampling
    Kim, Jae-Kwang
    AMERICAN STATISTICIAN, 2023, 77 (02): : 236 - 236
  • [24] Graph sampling
    Zhang L.-C.
    Patone M.
    METRON, 2017, 75 (3) : 277 - 299
  • [25] ANALYSIS OF PWM TECHNIQUES USING UNIFORM SAMPLING IN VARIABLE-SPEED ELECTRICAL DRIVES WITH LARGE SPEED RANGE
    FRANZO, G
    MAZZUCCHELLI, M
    PUGLISI, L
    SCIUTTO, G
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 1985, 21 (04) : 966 - 974
  • [26] Snowball sampling for estimating exponential random graph models for large networks
    Stivala, Alex D.
    Koskinen, Johan H.
    Rolls, David A.
    Wang, Peng
    Robins, Garry L.
    SOCIAL NETWORKS, 2016, 47 : 167 - 188
  • [27] Random Sampling Method of Large-Scale Graph Data Classification
    Mustafa, Rashed
    Mahmud, Mohammad Sultan
    Shadid, Mahir
    JURNAL KEJURUTERAAN, 2024, 36 (02): : 525 - 532
  • [28] Efficiently counting all orbits of graphlets of any order in a graph using autogenerated equations
    Melckenbeeck, Ine
    Audenaert, Pieter
    Colle, Didier
    Pickavet, Mario
    BIOINFORMATICS, 2018, 34 (08) : 1372 - 1380
  • [29] Large Graph Analysis in the GMine System
    Rodrigues, Jose F., Jr.
    Tong, Hanghang
    Pan, Jia-Yu
    Traina, Agma J. M.
    Traina, Caetano, Jr.
    Faloutsos, Christos
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (01) : 106 - 118
  • [30] Analysis of farthest point sampling for approximating geodesics in a graph
    Kamousi, Pegah
    Lazard, Sylvain
    Maheshwari, Anil
    Wuhrer, Stefanie
    COMPUTATIONAL GEOMETRY-THEORY AND APPLICATIONS, 2016, 57 : 1 - 7