A density-based statistical analysis of graph clustering algorithm performance

被引:10
|
作者
Miasnikof, Pierre [1 ]
Shestopaloff, Alexander Y. [2 ]
Bonner, Anthony J. [1 ]
Lawryshyn, Yuri [1 ]
Pardalos, Panos M. [3 ,4 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Alan Turing Inst, London, England
[3] Univ Florida, Gainesville, FL USA
[4] HSE Univ, Moscow, Russia
关键词
graph clustering; graph community detection; modularity; conductance; graph mining; network science; complex networks; social networks; unsupervised learning; data science; data analysis; COMMUNITY DETECTION; NETWORKS;
D O I
10.1093/comnet/cnaa012
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We introduce graph clustering quality measures based on comparisons of global, intra- and inter-cluster densities, an accompanying statistical significance test and a step-by-step routine for clustering quality assessment. Our work is centred on the idea that well-clustered graphs will display a mean intra-cluster density that is higher than global density and mean inter-cluster density. We do not rely on any generative model for the null model graph. Our measures are shown to meet the axioms of a good clustering quality function. They have an intuitive graph-theoretic interpretation, a formal statistical interpretation and can be tested for significance. Empirical tests also show they are more responsive to graph structure, less likely to breakdown during numerical implementation and less sensitive to uncertainty in connectivity than the commonly used measures.
引用
收藏
页数:33
相关论文
共 50 条
  • [1] DenGraph-HO: a density-based hierarchical graph clustering algorithm
    Schlitter, Nico
    Falkowski, Tanja
    Laessig, Joerg
    EXPERT SYSTEMS, 2014, 31 (05) : 469 - 479
  • [2] A novel density-based clustering algorithm using nearest neighbor graph
    Li, Hao
    Liu, Xiaojie
    Li, Tao
    Gan, Rundong
    PATTERN RECOGNITION, 2020, 102
  • [3] A density-based clustering algorithm for the CYGNO data analysis
    Baracchini, E.
    Benussi, L.
    Bianco, S.
    Capoccia, C.
    Caponero, M.
    Cavoto, G.
    Cortez, A.
    Costa, I. A.
    Di Marco, E.
    D'Imperio, G.
    Dho, G.
    Lacoangeli, F.
    Maccarrone, G.
    Marafini, M.
    Mazzitelli, G.
    Messina, A.
    Nobrega, R. A.
    Orlandi, A.
    Paoletti, E.
    Passamonti, L.
    Petrucci, F.
    Piccolo, D.
    Pierluigi, D.
    Pinci, D.
    Renga, F.
    Rosatelli, F.
    Russo, A.
    Saviano, G.
    Tesauroc, R.
    Tomassini, S.
    JOURNAL OF INSTRUMENTATION, 2020, 15 (12)
  • [4] Fast density-based clustering algorithm
    Zhou, Shuigeng
    Zhou, Aoying
    Cao, Jing
    Hu, Yunfa
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2000, 37 (11): : 1287 - 1292
  • [5] A varied density-based clustering algorithm
    Fahim, Ahmed
    JOURNAL OF COMPUTATIONAL SCIENCE, 2023, 66
  • [6] ScaleSCAN: Scalable Density-Based Graph Clustering
    Shiokawa, Hiroaki
    Takahashi, Tomokatsu
    Kitagawa, Hiroyuki
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2018, PT I, 2018, 11029 : 18 - 34
  • [7] A Fast Algorithm for Identifying Density-Based Clustering Structures Using a Constraint Graph
    Kim, Jeong-Hun
    Choi, Jong-Hyeok
    Yoo, Kwan-Hee
    Loh, Woong-Kee
    Nasridinov, Aziz
    ELECTRONICS, 2019, 8 (10)
  • [8] A Density-Based Clustering Algorithm with Sampling for Travel Behavior Analysis
    Tang, Wang
    Pi, Dechang
    He, Yun
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2016, 2016, 9937 : 231 - 239
  • [9] An Efficient Density-Based Algorithm for Data Clustering
    Theljani, Foued
    Laabidi, Kaouther
    Zidi, Salah
    Ksouri, Moufida
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2017, 26 (04)
  • [10] TOBAE: A Density-based Agglomerative Clustering Algorithm
    Shehzad Khalid
    Shahid Razzaq
    Journal of Classification, 2015, 32 : 241 - 267