GUISE: Uniform Sampling of Graphlets for Large Graph Analysis

被引:65
|
作者
Bhuiyan, Mansurul A. [1 ]
Rahman, Mahmudur [1 ]
Rahman, Mahmuda [2 ]
Al Hasan, Mohammad [1 ]
机构
[1] Indiana Univ Purdue Univ, Dept Comp Sci, Indianapolis, IN 46202 USA
[2] Syracuse Univ, Dept Comp Sci, New York, NY USA
关键词
D O I
10.1109/ICDM.2012.87
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graphlet frequency distribution (GFD) has recently become popular for characterizing large networks. However, the computation of GFD for a network requires the exact count of embedded graphlets in that network, which is a computationally expensive task. As a result, it is practically infeasible to compute the GFD for even a moderately large network. In this paper, we propose GUISE, which uses a Markov Chain Monte Carlo (MCMC) sampling method for constructing the approximate GFD of a large network. Our experiments on networks with millions of nodes show that GUISE obtains the GFD within few minutes, whereas the exhaustive counting based approach takes several days.
引用
收藏
页码:91 / 100
页数:10
相关论文
共 50 条
  • [1] GUISE: a uniform sampler for constructing frequency histogram of graphlets
    Rahman, Mahmudur
    Bhuiyan, Mansurul Alam
    Rahman, Mahmuda
    Al Hasan, Mohammad
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 38 (03) : 511 - 536
  • [2] GUISE: a uniform sampler for constructing frequency histogram of graphlets
    Mahmudur Rahman
    Mansurul Alam Bhuiyan
    Mahmuda Rahman
    Mohammad Al Hasan
    Knowledge and Information Systems, 2014, 38 : 511 - 536
  • [3] Uniform random sampling not recommended for large graph size estimation
    Lu, Jianguo
    Wang, Hao
    INFORMATION SCIENCES, 2017, 421 : 136 - 153
  • [4] SCALABLE UNIFORM GRAPH SAMPLING BY LOCAL COMPUTATION
    Elisa Schaeffer, Satu
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2010, 32 (05): : 2937 - 2963
  • [5] From quasirandom graphs to graph limits and graphlets
    Chung, Fan
    ADVANCES IN APPLIED MATHEMATICS, 2014, 56 : 135 - 174
  • [6] A Survey of Large Graph Sampling Techniques
    Zhang X.
    Ni Y.
    Li S.
    Gao G.
    Fang L.
    Wang Y.
    Zhao Y.
    Zhou Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (12): : 1805 - 1814
  • [7] Variance reduction in large graph sampling
    Lu, Jianguo
    Wang, Hao
    INFORMATION PROCESSING & MANAGEMENT, 2014, 50 (03) : 476 - 491
  • [8] A Graph Entropy Measure From Urelement to Higher-Order Graphlets for Network Analysis
    Huang, Ru
    Chen, Zijian
    Zhai, Guangtao
    He, Jianhua
    Chu, Xiaoli
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (02): : 631 - 644
  • [9] Differentiating Non-Isomorphic Graphlets for Graph Analytics
    Alumbaugh, John Calvin
    Li, Qinghua
    Hu, Vincent
    2016 IEEE 2ND INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (IEEE CIC), 2016, : 349 - 356
  • [10] An Efficient Algorithm for Sampling of a Single Large Graph
    Bhatia, Vandana
    Rani, Rinkle
    2017 TENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2017, : 32 - 37