Community detection in general stochastic block models: fundamental limits and efficient algorithms for recovery

被引:208
|
作者
Abbe, Emmanuel [1 ]
Sandon, Colin [2 ]
机构
[1] Princeton Univ, PACM & EE Dept, Princeton, NJ 08544 USA
[2] Princeton Univ, Dept Math, Princeton, NJ 08544 USA
关键词
Community detection; stochastic block models; phase transitions; clustering algorithms; information measures; graph-based codes; BLOCKMODELS; GRAPHS;
D O I
10.1109/FOCS.2015.47
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
New phase transition phenomena have recently been discovered for the stochastic block model, for the special case of two non-overlapping symmetric communities. This gives raise in particular to new algorithmic challenges driven by the thresholds. This paper investigates whether a general phenomenon takes place for multiple communities, without imposing symmetry. In the general stochastic block model SBM(n,p, W), n vertices are split into k communities of relative size {p(i)}(i is an element of[k]), and vertices in community i and j connect independently with probability {Wi,j}i,je [k]. This paper investigates the partial and exact recovery of communities in the general SBM (in the constant and logarithmic degree regimes), and uses the generality of the results to tackle overlapping communities. The contributions of the paper are: (i) an explicit characterization of the recovery threshold in the general SBM in terms of a new f-divergence function D+, which generalizes the Hellinger and Chernoff divergences, and which provides an operational meaning to a divergence function analog to the KL-divergence in the channel coding theorem, (ii) the development of an algorithm that recovers the communities all the way down to the optimal threshold and runs in quasi-linear time, showing that exact recovery has no information-theoretic to computational gap for multiple communities, (iii) the development of an efficient algorithm that detects communities in the constant degree regime with an explicit accuracy bound that can be made arbitrarily close to 1 when a prescribed signal-to-noise ratio (defined in term of the spectrum of diag(p)W) tends to infinity.
引用
收藏
页码:670 / 688
页数:19
相关论文
共 50 条
  • [31] Information Theoretic Limits of Exact Recovery in Sub-hypergraph Models for Community Detection
    Liang, Jiajun
    Ke, Chuyang
    Honorio, Jean
    2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 2578 - 2583
  • [32] Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits
    Liu, Qinghua
    Wang, Yuanhao
    Jin, Chi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [33] CONSISTENCY OF COMMUNITY DETECTION IN NETWORKS UNDER DEGREE-CORRECTED STOCHASTIC BLOCK MODELS
    Zhao, Yunpeng
    Levina, Elizaveta
    Zhu, Ji
    ANNALS OF STATISTICS, 2012, 40 (04): : 2266 - 2292
  • [34] Profile-pseudo likelihood methods for community detection of multilayer stochastic block models
    Fu, Kang
    Hu, Jianwei
    STAT, 2023, 12 (01):
  • [35] A distributed community detection algorithm for large scale networks under stochastic block models
    Wu, Shihao
    Li, Zhe
    Zhu, Xuening
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 187
  • [36] Robust Recovery for Stochastic Block Models, Simplified and Generalized
    Mohanty, Sidhanth
    Raghavendra, Prasad
    Wu, David X.
    PROCEEDINGS OF THE 56TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, STOC 2024, 2024, : 367 - 374
  • [37] Efficient Inference in Stochastic Block Models With Vertex Labels
    Stegehuis, Clara
    Massoulie, Laurent
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (03): : 1215 - 1225
  • [38] Community Recovery in the Degree-Heterogeneous Stochastic Block Model
    Cohen-Addad, Vincent
    Mallmann-Trenn, Frederik
    Saulpic, David
    CONFERENCE ON LEARNING THEORY, VOL 178, 2022, 178
  • [39] A SPECTRAL METHOD FOR COMMUNITY DETECTION IN MODERATELY SPARSE DEGREE-CORRECTED STOCHASTIC BLOCK MODELS
    Gulikers, Lennart
    Lelarge, Marc
    Massoulie, Laurent
    ADVANCES IN APPLIED PROBABILITY, 2017, 49 (03) : 686 - 721
  • [40] Distributed Community Detection on Overlapping Stochastic Block Model
    Xu, Jiasheng
    Fu, Luoyi
    Gan, Xiaoying
    Zhu, Bo
    2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 201 - 206