Efficient structural graph clustering: an index-based approach

被引:14
|
作者
Wen, Dong [1 ]
Qin, Lu [1 ]
Zhang, Ying [2 ]
Chang, Lijun [3 ]
Lin, Xuemin [4 ]
机构
[1] Univ Technol Sydney, Ctr Artificial Intelligence, Sydney, NSW, Australia
[2] Zhejiang Gongshang Univ, Hangzhou, Zhejiang, Peoples R China
[3] Univ Sydney, Sydney, NSW, Australia
[4] Univ New South Wales, Sydney, NSW, Australia
来源
VLDB JOURNAL | 2019年 / 28卷 / 03期
关键词
Graph clustering; SCAN; Dynamic graphs; I; O Efficient algorithms; ALGORITHM; DECOMPOSITION;
D O I
10.1007/s00778-019-00541-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graph clustering is a fundamental problem widely applied in many applications. The structural graph clustering (SCAN) method obtains not only clusters but also hubs and outliers. However, the clustering results heavily depend on two parameters, E and mu, while the optimal parameter setting depends on different graph properties and various user requirements. In addition, all existing SCAN solutions need to scan at least the whole graph, even if only a small number of vertices belong to clusters. In this paper, we propose an index-based method for SCAN. Based on our index, we cluster the graph for any E and mu in O(Sigma C is an element of C|EC|) time, where C is the result set of all clusters and |EC| is the number of edges in a specific cluster C. In other words, the time spent on computing structural clustering depends only on the result size, not on the size of the original graph. Our index's space complexity is O(m), where m is the number of edges in the graph. To handle dynamic graph updates, we propose algorithms and several optimization techniques for maintaining our index. We also design an index for I/O efficient query processing. We conduct extensive experiments to evaluate the performance of all our proposed algorithms on 10 real-world networks, with the largest one containing more than 1 billion edges. The experimental results demonstrate that our approaches significantly outperform existing solutions.
引用
收藏
页码:377 / 399
页数:23
相关论文
共 50 条
  • [31] Index-based hyperlinks
    Hartman, JH
    Proebsting, TA
    Sundaram, R
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1997, 29 (8-13): : 1129 - 1135
  • [32] An efficient graph embedding clustering approach for heterogeneous network
    Sajjadi, Zahra Sadat
    Esmaeili, Mahdi
    Ghobaei-Arani, Mostafa
    Minaei-Bidgoli, Behrouz
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (13): : 19562 - 19591
  • [33] An efficient graph theoretic approach to video scene clustering
    Lu, H
    Tan, YP
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1782 - 1786
  • [34] An efficient structural index for graph-structured data
    Fan, Yingjie
    Zhang, Chenghong
    Wang, Shuyun
    Hao, Xiulan
    Hu, Yunfa
    7TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE IN CONJUNCTION WITH 2ND IEEE/ACIS INTERNATIONAL WORKSHOP ON E-ACTIVITY, PROCEEDINGS, 2008, : 100 - +
  • [35] Graph Clustering Based on Structural/Attribute Similarities
    Zhou, Yang
    Cheng, Hong
    Yu, Jeffrey Xu
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2009, 2 (01): : 718 - 729
  • [36] Graph clustering based on structural similarity of fragments
    Yoshida, T
    Shoda, R
    Motoda, H
    FEDERATION OVER THE WEB, 2006, 3847 : 97 - 114
  • [37] Phase lag index-based graph attention networks for detecting driving fatigue
    Wang, Zhongmin
    Zhao, Yupeng
    He, Yan
    Zhang, Jie
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2021, 92 (09):
  • [38] An Index-Based Approach to Assess Social Vulnerability for Hamburg, Germany
    von Szombathely, Malte
    Hanf, Franziska S.
    Bareis, Janka
    Meier, Linda
    Ossenbruegge, Juergen
    Pohl, Thomas
    INTERNATIONAL JOURNAL OF DISASTER RISK SCIENCE, 2023, 14 (05) : 782 - 794
  • [39] An Index-Based Approach to Assess Social Vulnerability for Hamburg, Germany
    Malte von Szombathely
    Franziska S. Hanf
    Janka Bareis
    Linda Meier
    Jürgen Oßenbrügge
    Thomas Pohl
    International Journal of Disaster Risk Science, 2023, 14 : 782 - 794
  • [40] A health index-based approach for fuel cell lifetime estimation
    Wu, Hangyu
    Zhu, Wenchao
    Guo, Bingxin
    Li, Changzhi
    Wu, Hangyu
    Zhang, Ruiming
    Zhu, Wenchao
    Xie, Changjun
    Li, Yang
    Yang, Yang
    Guo, Bingxin
    Li, Changzhi
    Xiong, Rui
    ISCIENCE, 2024, 27 (11)