A density-based statistical analysis of graph clustering algorithm performance

被引:10
|
作者
Miasnikof, Pierre [1 ]
Shestopaloff, Alexander Y. [2 ]
Bonner, Anthony J. [1 ]
Lawryshyn, Yuri [1 ]
Pardalos, Panos M. [3 ,4 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Alan Turing Inst, London, England
[3] Univ Florida, Gainesville, FL USA
[4] HSE Univ, Moscow, Russia
关键词
graph clustering; graph community detection; modularity; conductance; graph mining; network science; complex networks; social networks; unsupervised learning; data science; data analysis; COMMUNITY DETECTION; NETWORKS;
D O I
10.1093/comnet/cnaa012
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We introduce graph clustering quality measures based on comparisons of global, intra- and inter-cluster densities, an accompanying statistical significance test and a step-by-step routine for clustering quality assessment. Our work is centred on the idea that well-clustered graphs will display a mean intra-cluster density that is higher than global density and mean inter-cluster density. We do not rely on any generative model for the null model graph. Our measures are shown to meet the axioms of a good clustering quality function. They have an intuitive graph-theoretic interpretation, a formal statistical interpretation and can be tested for significance. Empirical tests also show they are more responsive to graph structure, less likely to breakdown during numerical implementation and less sensitive to uncertainty in connectivity than the commonly used measures.
引用
收藏
页数:33
相关论文
共 50 条
  • [41] A new density-based scheme for clustering based on genetic algorithm
    Lin, CY
    Chang, CC
    FUNDAMENTA INFORMATICAE, 2005, 68 (04) : 315 - 331
  • [42] A Statistical Performance Analysis of Graph Clustering Algorithms
    Miasnikof, Pierre
    Shestopaloff, Alexander Y.
    Bonner, Anthony J.
    Lawryshyn, Yuri
    ALGORITHMS AND MODELS FOR THE WEB GRAPH (WAW 2018), 2018, 10836 : 170 - 184
  • [43] DCSNE: Density-based Clustering using Graph Shared Neighbors and Entropy
    Maheshwari, Rashmi
    Mohanty, Sraban Kumar
    Mishra, Amaresh Chandra
    PATTERN RECOGNITION, 2023, 137
  • [44] HGADC: Hierarchical Genetic Algorithm with Density-Based Clustering for TSP
    Song, Zhenghan
    Li, Yunyi
    Wang, Wenjun
    BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, PT 1, BIC-TA 2023, 2024, 2061 : 262 - 275
  • [45] Density-based particle swarm optimization algorithm for data clustering
    Alswaitti, Mohammed
    Albughdadi, Mohanad
    Isa, Nor Ashidi Mat
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 91 : 170 - 186
  • [46] Density-based clustering localization algorithm for wireless sensor networks
    Wang, Yong
    Hu, Liang-Liang
    Yuan, Chao-Yan
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2013, 42 (03): : 406 - 409
  • [47] Efficient incremental density-based algorithm for clustering large datasets
    Bakr, Ahmad M.
    Ghanem, Nagia M.
    Ismail, Mohamed A.
    ALEXANDRIA ENGINEERING JOURNAL, 2015, 54 (04) : 1147 - 1154
  • [48] A Grid and Density-based Clustering Algorithm for Processing Data Stream
    Jia, Chen
    Tan, ChengYu
    Yong, Ai
    SECOND INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING: WGEC 2008, PROCEEDINGS, 2008, : 517 - +
  • [49] A New Approach on Density-Based Algorithm for Clustering Dense Areas
    Perchinunno, Paola
    L'Abbate, Samuela
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2022 WORKSHOPS, PT I, 2022, 13377 : 530 - 542
  • [50] An efficient automated incremental density-based algorithm for clustering and classification
    Azhir, Elham
    Navimipour, Nima Jafari
    Hosseinzadeh, Mehdi
    Sharifi, Arash
    Darwesh, Aso
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 114 (114): : 665 - 678