A scalable community detection algorithm for large graphs using stochastic block models

被引:4
|
作者
Peng, Chengbin [1 ,2 ]
Zhang, Zhihua [3 ]
Wong, Ka-Chun [4 ]
Zhang, Xiangliang [1 ]
Keyes, David E. [1 ]
机构
[1] King Abdullah Univ Sci & Technol, Post Box 2925, Thuwal 239556900, Saudi Arabia
[2] Ningbo Inst Ind Technol, Ningbo, Zhejiang, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[4] City Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
关键词
Stochastic block model; parallel computing; community detection; MULTI;
D O I
10.3233/IDA-163156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Community detection in graphs is widely used in social and biological networks, and the stochastic block model is a powerful probabilistic tool for describing graphs with community structures. However, in the era of "big data", traditional inference algorithms for such a model are increasingly limited due to their high time complexity and poor scalability. In this paper, we propose a multi-stage maximum likelihood approach to recover the latent parameters of the stochastic block model, in time linear with respect to the number of edges. We also propose a parallel algorithm based on message passing. Our algorithm can overlap communication and computation, providing speedup without compromising accuracy as the number of processors grows. For example, to process a real-world graph with about 1.3 million nodes and 10 million edges, our algorithm requires about 6 seconds on 64 cores of a contemporary commodity Linux cluster. Experiments demonstrate that the algorithm can produce high quality results on both benchmark and real-world graphs. An example of finding more meaningful communities is illustrated consequently in comparison with a popular modularity maximization algorithm.
引用
收藏
页码:1463 / 1485
页数:23
相关论文
共 50 条
  • [21] Scalable Community Detection with the Louvain Algorithm
    Que, Xinyu
    Checconi, Fabio
    Petrini, Fabrizio
    Gunnels, John A.
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 28 - 37
  • [22] A Scalable Distributed Louvain Algorithm for Large-scale Graph Community Detection
    Zeng, Jianping
    Yu, Hongfeng
    2018 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2018, : 268 - 278
  • [23] Community Detection Algorithm Combining Stochastic Block Model and Attribute Data Clustering
    Kataoka, Shun
    Kobayashi, Takuto
    Yasuda, Muneki
    Tanaka, Kazuyuki
    JOURNAL OF THE PHYSICAL SOCIETY OF JAPAN, 2016, 85 (11)
  • [24] Graph Theoretic and Stochastic Block Models Integrated with Matrix Factorization for Community Detection
    McGarry, Ken
    ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, UKCI 2022, 2024, 1454 : 297 - 311
  • [25] Dynamic stochastic block models: parameter estimation and detection of changes in community structure
    Ludkin, Matthew
    Eckley, Idris
    Neal, Peter
    STATISTICS AND COMPUTING, 2018, 28 (06) : 1201 - 1213
  • [26] Dynamic stochastic block models: parameter estimation and detection of changes in community structure
    Matthew Ludkin
    Idris Eckley
    Peter Neal
    Statistics and Computing, 2018, 28 : 1201 - 1213
  • [27] Rate optimal Chernoff bound and application to community detection in the stochastic block models
    Zhou, Zhixin
    Li, Ping
    ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (01): : 1302 - 1347
  • [28] A Sequential and Scalable Approach to Community Detection in Dynamic Graphs
    Beckus, Andre
    Atia, George K.
    2019 57TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2019, : 298 - 302
  • [29] Accurate and scalable social recommendation using mixed-membership stochastic block models
    Godoy-Lorite, Antonia
    Guimera, Roger
    Moore, Cristopher
    Sales-Pardo, Marta
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (50) : 14207 - 14212
  • [30] Community Detection in Large Directed Graphs
    Chen, Siqi
    Bhatnagar, Raj
    BIG DATA ANALYTICS, BDA 2022, 2022, 13773 : 172 - 181