Efficient structural graph clustering: an index-based approach

被引:14
|
作者
Wen, Dong [1 ]
Qin, Lu [1 ]
Zhang, Ying [2 ]
Chang, Lijun [3 ]
Lin, Xuemin [4 ]
机构
[1] Univ Technol Sydney, Ctr Artificial Intelligence, Sydney, NSW, Australia
[2] Zhejiang Gongshang Univ, Hangzhou, Zhejiang, Peoples R China
[3] Univ Sydney, Sydney, NSW, Australia
[4] Univ New South Wales, Sydney, NSW, Australia
来源
VLDB JOURNAL | 2019年 / 28卷 / 03期
关键词
Graph clustering; SCAN; Dynamic graphs; I; O Efficient algorithms; ALGORITHM; DECOMPOSITION;
D O I
10.1007/s00778-019-00541-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graph clustering is a fundamental problem widely applied in many applications. The structural graph clustering (SCAN) method obtains not only clusters but also hubs and outliers. However, the clustering results heavily depend on two parameters, E and mu, while the optimal parameter setting depends on different graph properties and various user requirements. In addition, all existing SCAN solutions need to scan at least the whole graph, even if only a small number of vertices belong to clusters. In this paper, we propose an index-based method for SCAN. Based on our index, we cluster the graph for any E and mu in O(Sigma C is an element of C|EC|) time, where C is the result set of all clusters and |EC| is the number of edges in a specific cluster C. In other words, the time spent on computing structural clustering depends only on the result size, not on the size of the original graph. Our index's space complexity is O(m), where m is the number of edges in the graph. To handle dynamic graph updates, we propose algorithms and several optimization techniques for maintaining our index. We also design an index for I/O efficient query processing. We conduct extensive experiments to evaluate the performance of all our proposed algorithms on 10 real-world networks, with the largest one containing more than 1 billion edges. The experimental results demonstrate that our approaches significantly outperform existing solutions.
引用
收藏
页码:377 / 399
页数:23
相关论文
共 50 条
  • [21] Damage Index-Based Lower Bound Structural Design
    Mitropoulou, Chara Ch
    Maranoz, Giuseppe C.
    Lagaros, Nikos D.
    FRONTIERS IN BUILT ENVIRONMENT, 2018, 4
  • [22] Efficient processing of similarity search under time warping in sequence databases: an index-based approach
    Kim, SW
    Park, S
    Chu, WW
    INFORMATION SYSTEMS, 2004, 29 (05) : 405 - 420
  • [23] COIN: Correlation Index-Based Similarity Measure for Clustering Categorical Data
    Sowmiya, N.
    Gupta, N.Srinivasa
    Natarajan, Elango
    Valarmathi, B.
    Elamvazuthi, I.
    Parasuraman, S.
    Kit, Chun Ang
    Freitas, Lídio Inácio
    Abraham Gnanamuthu, Ezra Morris
    Mathematical Problems in Engineering, 2022, 2022
  • [24] COIN: Correlation Index-Based Similarity Measure for Clustering Categorical Data
    Sowmiya, N.
    Gupta, N. Srinivasa
    Natarajan, Elango
    Valarmathi, B.
    Elamvazuthi, I.
    Parasuraman, S.
    Kit, Chun Ang
    Freitas, Lidio Inacio
    Abraham Gnanamuthu, Ezra Morris
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [25] An index-based checkpointing/recovery approach for distributed systems
    Gupta, B
    Banerjee, SK
    Wang, Z
    COMPUTERS AND THEIR APPLICATIONS, 2001, : 166 - 170
  • [26] An Efficient Sparse CNN Architecture with Index-based Kernel Transformation
    Chen, Po-Ting
    Yu, Shan-Chi
    Lin, Ing-Chao
    2024 IEEE THE 20TH ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, APCCAS 2024, 2024, : 790 - 794
  • [27] An efficient index-based protein structure database searching method
    Aung, ZY
    Fu, W
    Tan, KL
    EIGHTH INTERNATIONAL CONFERENCE ON DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2003, : 311 - 318
  • [28] Pythagorean Fuzzy Clustering Analysis: A Hierarchical Clustering Algorithm with the Ratio Index-Based Ranking Methods
    Zhang, Xiaolu
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2018, 33 (09) : 1798 - 1822
  • [29] Efficient scheduling of page access in index-based join processing
    Chan, CY
    Ooi, BC
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1997, 9 (06) : 1005 - 1011
  • [30] General Purpose Index-Based Method for Efficient MaxRS Query
    Zhou, Xiaoling
    Wang, Wei
    Xu, Jianliang
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2016, PT I, 2016, 9827 : 20 - 36