GUISE: a uniform sampler for constructing frequency histogram of graphlets

被引:0
|
作者
Mahmudur Rahman
Mansurul Alam Bhuiyan
Mahmuda Rahman
Mohammad Al Hasan
机构
[1] Indiana University–Purdue University,Department of Computer Science
[2] Syracuse University,Department of Computer Science
来源
关键词
Graphlet counting; MCMC sampling; Graph analysis ; Graph mining; Graphlet sampling; Graphlet degree distribution; Uniform sampling; Subgraph concentration;
D O I
暂无
中图分类号
学科分类号
摘要
Graphlet frequency distribution (GFD) has recently become popular for characterizing large networks. However, the computation of GFD for a network requires the exact count of embedded graphlets in that network, which is a computationally expensive task. As a result, it is practically infeasible to compute the GFD for even a moderately large network. In this paper, we propose Guise, which uses a Markov Chain Monte Carlo sampling method for constructing the approximate GFD of a large network. Our experiments on networks with millions of nodes show that Guise obtains the GFD with very low rate of error within few minutes, whereas the exhaustive counting-based approach takes several days.
引用
收藏
页码:511 / 536
页数:25
相关论文
共 50 条