A density-based statistical analysis of graph clustering algorithm performance

被引:10
|
作者
Miasnikof, Pierre [1 ]
Shestopaloff, Alexander Y. [2 ]
Bonner, Anthony J. [1 ]
Lawryshyn, Yuri [1 ]
Pardalos, Panos M. [3 ,4 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Alan Turing Inst, London, England
[3] Univ Florida, Gainesville, FL USA
[4] HSE Univ, Moscow, Russia
关键词
graph clustering; graph community detection; modularity; conductance; graph mining; network science; complex networks; social networks; unsupervised learning; data science; data analysis; COMMUNITY DETECTION; NETWORKS;
D O I
10.1093/comnet/cnaa012
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We introduce graph clustering quality measures based on comparisons of global, intra- and inter-cluster densities, an accompanying statistical significance test and a step-by-step routine for clustering quality assessment. Our work is centred on the idea that well-clustered graphs will display a mean intra-cluster density that is higher than global density and mean inter-cluster density. We do not rely on any generative model for the null model graph. Our measures are shown to meet the axioms of a good clustering quality function. They have an intuitive graph-theoretic interpretation, a formal statistical interpretation and can be tested for significance. Empirical tests also show they are more responsive to graph structure, less likely to breakdown during numerical implementation and less sensitive to uncertainty in connectivity than the commonly used measures.
引用
收藏
页数:33
相关论文
共 50 条
  • [31] Optimal choice of parameters for a density-based clustering algorithm
    Gan, WY
    Li, DY
    ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, 2003, 2639 : 603 - 606
  • [32] An Efficient Density-based clustering algorithm for face groping
    Pei, Shenfei
    Nie, Feiping
    Wang, Rong
    Li, Xuelong
    NEUROCOMPUTING, 2021, 462 : 331 - 343
  • [33] A density-based evolutionary clustering algorithm for intelligent development
    Xie, Haibin
    Li, Peng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 104
  • [34] Density-based clustering algorithm for mixture data sets
    Huang, De-Cai
    Wu, Tian-Hong
    Kongzhi yu Juece/Control and Decision, 2010, 25 (03): : 416 - 421
  • [35] DENDIS: A new density-based sampling for clustering algorithm
    Ros, Frederic
    Guillaume, Serge
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 56 : 349 - 359
  • [36] A modified density-based clustering algorithm and its implementation
    Ban, Zhihua
    Liu, Jianguo
    Yuan, Lulu
    Yang, Hua
    MIPPR 2015: PATTERN RECOGNITION AND COMPUTER VISION, 2015, 9813
  • [37] A fast density-based clustering algorithm for large databases
    Liu, Bing
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 996 - 1000
  • [38] Density-based clustering
    Campello, Ricardo J. G. B.
    Kroeger, Peer
    Sander, Jorg
    Zimek, Arthur
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 10 (02)
  • [39] Density-based clustering
    Kriegel, Hans-Peter
    Kroeger, Peer
    Sander, Joerg
    Zimek, Arthur
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (03) : 231 - 240
  • [40] Combined Density-based and Constraint-based Algorithm for Clustering
    陈同孝
    陈荣昌
    林志强
    邱永兴
    Journal of DongHua University, 2006, (06) : 36 - 38