STRATEGIES FOR ONLINE INFERENCE OF MODEL-BASED CLUSTERING IN LARGE AND GROWING NETWORKS

被引:19
|
作者
Zanghi, Hugo [1 ]
Picard, Franck [2 ]
Miele, Vincent [2 ]
Ambroise, Christophe [3 ]
机构
[1] Exalead, F-75008 Paris, France
[2] UCB Lyon 1, Lab Biometrie & Biol Evolut, F-69622 Villeurbanne, France
[3] CNRS, INRA, Lab Stat & Genome, UEVE 1152,UMR 8071, F-91000 Evry, France
来源
ANNALS OF APPLIED STATISTICS | 2010年 / 4卷 / 02期
关键词
Graph clustering; EM Algorithms; online strategies; web graph structure analysis; MIXED MEMBERSHIP; EM ALGORITHM; MIXTURE; CONVERGENCE; PREDICTION;
D O I
10.1214/10-AOAS359
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper we adapt online estimation strategies to perform model-based clustering on large networks. Our work focuses on two algorithms, the first based on the SAEM algorithm, and the second on variational methods. These two strategies are compared with existing approaches on simulated and real data. We use the method to decipher the connexion structure of the political websphere during the US political campaign in 2008. We show that our online EM-based algorithms offer a good trade-off between precision and speed, when estimating parameters for mixture distributions in the context of random graphs.
引用
收藏
页码:687 / 714
页数:28
相关论文
共 50 条
  • [1] MODEL-BASED CLUSTERING OF LARGE NETWORKS
    Vu, Duy Q.
    Hunter, David R.
    Schweinberger, Michael
    ANNALS OF APPLIED STATISTICS, 2013, 7 (02): : 1010 - 1039
  • [2] Robust inference for parsimonious model-based clustering
    Dotto, Francesco
    Farcomeni, Alessio
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2019, 89 (03) : 414 - 442
  • [3] Model-Based Clustering and New Edge Modelling in Large Computer Networks
    Metelli, Silvia
    Heard, Nicholas
    IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS: CYBERSECURITY AND BIG DATA, 2016, : 91 - 96
  • [4] Model-based clustering for populations of networks
    Signorelli, Mirko
    Wit, Ernst C.
    STATISTICAL MODELLING, 2020, 20 (01) : 9 - 29
  • [5] Model-based clustering for social networks
    Handcock, Mark S.
    Raftery, Adrian E.
    Tantrum, Jeremy M.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2007, 170 : 301 - 322
  • [6] Model-based Clustering with Noise: Bayesian Inference and Estimation
    H. Bensmail
    J. J. Meulman
    Journal of Classification, 2003, 20 : 049 - 076
  • [7] Model-based clustering with noise: Bayesian inference and estimation
    Bensmail, H
    Meulman, JJ
    JOURNAL OF CLASSIFICATION, 2003, 20 (01) : 49 - 76
  • [8] Model-Based Co-Clustering in Customer Targeting Utilizing Large-Scale Online Product Rating Networks
    Chen, Qian
    Agarwal, Amal
    Fong, Duncan K. H.
    DeSarbo, Wayne S.
    Xue, Lingzhou
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2024,
  • [9] Model-based Fraud Detection in Growing Networks
    Moriano, Pablo
    Finke, Jorge
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 6068 - 6073
  • [10] Hierarchical model-based clustering for large datasets
    Posse, C
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2001, 10 (03) : 464 - 486