Hypothesis testing for automated community detection in networks

被引:112
|
作者
Bickel, Peter J. [1 ]
Sarkar, Purnamrita [2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Univ Texas Austin, Austin, TX 78712 USA
基金
美国国家科学基金会;
关键词
Asymptotic analysis; Community detection; Hypothesis testing; Networks; Stochastic block model; Tracy-Widom distribution; STOCHASTIC BLOCKMODELS; UNIVERSALITY; EIGENVALUES; MODEL; EDGE;
D O I
10.1111/rssb.12117
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Community detection in networks is a key exploratory tool with applications in a diverse set of areas, ranging from finding communities in social and biological networks to identifying link farms in the World Wide Web. The problem of finding communities or clusters in a network has received much attention from statistics, physics and computer science. However, most clustering algorithms assume knowledge of the number of clusters k. We propose to determine k automatically in a graph generated from a stochastic block model by using a hypothesis test of independent interest. Our main contribution is twofold; first, we theoretically establish the limiting distribution of the principal eigenvalue of the suitably centred and scaled adjacency matrix and use that distribution for our test of the hypothesis that a random graph is of Erdos-Renyi (noise) type. Secondly, we use this test to design a recursive bipartitioning algorithm, which naturally uncovers nested community structure. Using simulations and quantifiable classification tasks on real world networks with ground truth, we show that our algorithm outperforms state of the art methods.
引用
收藏
页码:253 / 273
页数:21
相关论文
共 50 条
  • [31] Distributed Hypothesis Testing: Cooperation and Concurrent Detection
    Escamilla, Pierre
    Wigger, Michele
    Zaidi, Abdellatif
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2020, 66 (12) : 7550 - 7564
  • [32] Bilateral symmetry detection: Testing a 'callosal' hypothesis
    Herbert, AM
    Humphrey, GK
    PERCEPTION, 1996, 25 (04) : 463 - 480
  • [33] Edge and line detection as exercises in hypothesis testing
    Newsam, GN
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 2685 - 2688
  • [34] Sequential analysis: hypothesis testing and changepoint detection
    Ober, Pieter Bastiaan
    JOURNAL OF APPLIED STATISTICS, 2015, 42 (10) : 2290 - 2290
  • [35] Hypothesis Testing Framework for Active Object Detection
    Atanasov, Nikolay
    Sankaran, Bharath
    Le Ny, Jerome
    Koletschka, Thomas
    Pappas, George J.
    Daniilidis, Kostas
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 4216 - 4222
  • [36] Anomaly detection and community detection in networks
    Hadiseh Safdari
    Caterina De Bacco
    Journal of Big Data, 9
  • [37] Anomaly detection and community detection in networks
    Safdari, Hadiseh
    De Bacco, Caterina
    JOURNAL OF BIG DATA, 2022, 9 (01)
  • [38] Belief consensus and distributed hypothesis testing in sensor networks
    Olfati-Saber, Reza
    Franco, Elisa
    Frazzoli, Emilio
    Shamma, Jeff S.
    NETWORKED EMBEDDED SENSING AND CONTROL, 2006, 331 : 169 - 182
  • [39] Secure localization using hypothesis testing in wireless networks
    AlRoomi, Suood Abdulaziz
    Ahmad, Imtiaz
    Dimitriou, Tassos
    AD HOC NETWORKS, 2018, 74 : 47 - 56
  • [40] Permutation-Based Hypothesis Testing for Neural Networks
    Mandel, Francesca
    Barnett, Ian
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 14306 - 14314