Efficient structural graph clustering: an index-based approach

被引:14
|
作者
Wen, Dong [1 ]
Qin, Lu [1 ]
Zhang, Ying [2 ]
Chang, Lijun [3 ]
Lin, Xuemin [4 ]
机构
[1] Univ Technol Sydney, Ctr Artificial Intelligence, Sydney, NSW, Australia
[2] Zhejiang Gongshang Univ, Hangzhou, Zhejiang, Peoples R China
[3] Univ Sydney, Sydney, NSW, Australia
[4] Univ New South Wales, Sydney, NSW, Australia
来源
VLDB JOURNAL | 2019年 / 28卷 / 03期
关键词
Graph clustering; SCAN; Dynamic graphs; I; O Efficient algorithms; ALGORITHM; DECOMPOSITION;
D O I
10.1007/s00778-019-00541-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graph clustering is a fundamental problem widely applied in many applications. The structural graph clustering (SCAN) method obtains not only clusters but also hubs and outliers. However, the clustering results heavily depend on two parameters, E and mu, while the optimal parameter setting depends on different graph properties and various user requirements. In addition, all existing SCAN solutions need to scan at least the whole graph, even if only a small number of vertices belong to clusters. In this paper, we propose an index-based method for SCAN. Based on our index, we cluster the graph for any E and mu in O(Sigma C is an element of C|EC|) time, where C is the result set of all clusters and |EC| is the number of edges in a specific cluster C. In other words, the time spent on computing structural clustering depends only on the result size, not on the size of the original graph. Our index's space complexity is O(m), where m is the number of edges in the graph. To handle dynamic graph updates, we propose algorithms and several optimization techniques for maintaining our index. We also design an index for I/O efficient query processing. We conduct extensive experiments to evaluate the performance of all our proposed algorithms on 10 real-world networks, with the largest one containing more than 1 billion edges. The experimental results demonstrate that our approaches significantly outperform existing solutions.
引用
收藏
页码:377 / 399
页数:23
相关论文
共 50 条
  • [1] Efficient structural graph clustering: an index-based approach
    Dong Wen
    Lu Qin
    Ying Zhang
    Lijun Chang
    Xuemin Lin
    The VLDB Journal, 2019, 28 : 377 - 399
  • [2] Efficient Structural Graph Clustering: An Index-Based Approach
    Wen, Dong
    Qin, Lu
    Zhang, Ying
    Chang, Lijun
    Lin, Xuemin
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 11 (03): : 243 - 255
  • [3] Structural graph clustering on signed graphs: An index-based approach
    Zhao, Zheng
    Li, Wei
    Wang, Xiao
    Meng, Xiangxu
    Zheng, Xiangping
    Wang, Chenhao
    INFORMATION SCIENCES, 2025, 699
  • [4] Parallel Index-Based Structural Graph Clustering and Its Approximation
    Tseng, Tom
    Dhulipala, Laxman
    Shun, Julian
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 1851 - 1864
  • [5] Index-based Structural Clustering on Directed Graphs
    Meng, Lingkai
    Yuan, Long
    Chen, Zi
    Lin, Xuemin
    Yang, Shiyu
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2831 - 2844
  • [6] Efficient reachability queries in multi-relation graph: An index-based approach
    Liu, Xijuan
    Zhang, Mengqi
    Fu, Xianming
    Chen, Chen
    Wang, Xiaoyang
    Wu, Yanping
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 96
  • [7] Index-Based Solutions for Efficient Density Peak Clustering
    Rasool, Zafaryab
    Zhou, Rui
    Chen, Lu
    Liu, Chengfei
    Xu, Jiajie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (05) : 2212 - 2226
  • [8] Efficient (α, β)-core Computation: an Index-based Approach
    Liu, Boge
    Yuan, Long
    Lin, Xuemin
    Qin, Lu
    Zhang, Wenjie
    Zhou, Jingren
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 1130 - 1141
  • [9] Index-based Solutions for Efficient Density Peak Clustering (Extended Abstract)
    Rasool, Zafaryab
    Zhou, Rui
    Chen, Lu
    Liu, Chengfei
    Xu, Jiajie
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 2342 - 2343
  • [10] Correlation clustering algorithm for dynamic complete signed graphs: an index-based approach
    Shakiba, Ali
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025,