A density-based statistical analysis of graph clustering algorithm performance

被引:10
|
作者
Miasnikof, Pierre [1 ]
Shestopaloff, Alexander Y. [2 ]
Bonner, Anthony J. [1 ]
Lawryshyn, Yuri [1 ]
Pardalos, Panos M. [3 ,4 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Alan Turing Inst, London, England
[3] Univ Florida, Gainesville, FL USA
[4] HSE Univ, Moscow, Russia
关键词
graph clustering; graph community detection; modularity; conductance; graph mining; network science; complex networks; social networks; unsupervised learning; data science; data analysis; COMMUNITY DETECTION; NETWORKS;
D O I
10.1093/comnet/cnaa012
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We introduce graph clustering quality measures based on comparisons of global, intra- and inter-cluster densities, an accompanying statistical significance test and a step-by-step routine for clustering quality assessment. Our work is centred on the idea that well-clustered graphs will display a mean intra-cluster density that is higher than global density and mean inter-cluster density. We do not rely on any generative model for the null model graph. Our measures are shown to meet the axioms of a good clustering quality function. They have an intuitive graph-theoretic interpretation, a formal statistical interpretation and can be tested for significance. Empirical tests also show they are more responsive to graph structure, less likely to breakdown during numerical implementation and less sensitive to uncertainty in connectivity than the commonly used measures.
引用
收藏
页数:33
相关论文
共 50 条
  • [21] MIDBSCAN: An Efficient Density-Based Clustering Algorithm
    Tsai, Cheng-Fa
    Sung, Chun-Yi
    SIXTH INTERNATIONAL SYMPOSIUM ON NEURAL NETWORKS (ISNN 2009), 2009, 56 : 469 - 479
  • [22] TOBAE: A Density-based Agglomerative Clustering Algorithm
    Khalid, Shehzad
    Razzaq, Shahid
    JOURNAL OF CLASSIFICATION, 2015, 32 (02) : 241 - 267
  • [23] Research on Application of Density-Based Clustering Algorithm in Aircraft Formation Analysis
    Zhang, Xianwei
    Zhang, Lu
    2020 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, COMPUTER TECHNOLOGY AND TRANSPORTATION (ISCTT 2020), 2020, : 417 - 421
  • [24] An Algorithm to Adaptive Determination of Density Threshold for Density-based Clustering
    Ke, Zhang
    Lei, Huang
    Yi, Chai
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3929 - 3935
  • [25] Performance evaluation of density-based clustering methods
    Aliguliyev, Ramiz M.
    INFORMATION SCIENCES, 2009, 179 (20) : 3583 - 3602
  • [26] A Density-based clustering algorithm suitable to various density dataset
    School of Software, Dalian University of Technology, Dalian 116621, China
    J. Comput. Inf. Syst., 2008, 6 (2473-2481):
  • [27] Video abstraction using density-based clustering algorithm
    Fereshteh Falah Chamasemani
    Lilly Suriani Affendey
    Norwati Mustapha
    Fatimah Khalid
    The Visual Computer, 2018, 34 : 1299 - 1314
  • [28] Video abstraction using density-based clustering algorithm
    Chamasemani, Fereshteh Falah
    Affendey, Lilly Suriani
    Mustapha, Norwati
    Khalid, Fatimah
    VISUAL COMPUTER, 2018, 34 (10): : 1299 - 1314
  • [29] An Improved BAT Algorithm Using Density-Based Clustering
    Al-Asadi, Samraa Adnan
    Al-Mamory, Safaa O.
    INTELIGENCIA ARTIFICIAL-IBEROAMERICAL JOURNAL OF ARTIFICIAL INTELLIGENCE, 2023, 26 (72): : 102 - 123
  • [30] A GPU-Accelerated Density-Based Clustering Algorithm
    Loh, Woong-Kee
    Kim, Young-Kuk
    2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 775 - 776