Sampling in Dirichlet Process Mixture Models for Clustering Streaming Data

被引:0
|
作者
Dinari, Or [1 ]
Freifeld, Oren [1 ]
机构
[1] Ben Gurion Univ Negev, Beer Sheva, Israel
基金
以色列科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Practical tools for clustering streaming data must be fast enough to handle the arrival rate of the observations. Typically, they also must adapt on the fly to possible lack of stationarity; i.e., the data statistics may be time-dependent due to various forms of drifts, changes in the number of clusters, etc. The Dirichlet Process Mixture Model (DPMM), whose Bayesian nonparametric nature allows it to adapt its complexity to the data, seems a natural choice for the streaming-data case. In its classical formulation, however, the DPMM cannot capture common types of drifts in the data statistics. Moreover, and regardless of that limitation, existing methods for online DPMM inference are too slow to handle rapid data streams. In this work we propose adapting both the DPMM and a known DPMM sampling-based non-streaming inference method for streaming-data clustering. We demonstrate the utility of the proposed method on several challenging settings, where it obtains state-of-the-art results while being on par with other methods in terms of speed.
引用
收藏
页码:818 / 835
页数:18
相关论文
共 50 条
  • [31] Dirichlet process mixture models for non-stationary data streams
    Casado, Ioar
    Perez, Aritz
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 873 - 878
  • [32] Dirichlet process mixture models for single-cell RNA-seq clustering
    Adossa, Nigatu A.
    Rytkonen, Kalle T.
    Elo, Laura L.
    BIOLOGY OPEN, 2022, 11 (04):
  • [33] Swendsen-Wang Cuts sampling for spatially constrained Dirichlet process mixture models
    Wang, Xiangrong
    Zhao, Jieyu
    GRAPHICAL MODELS, 2014, 76 : 496 - 506
  • [34] Clustering with label constrained Dirichlet process mixture model
    Burhanuddin, Nurul Afiqah
    Adam, Mohd Bakri
    Ibrahim, Kamarulzaman
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 107
  • [35] Graph Clustering Using Dirichlet Process Mixture Model
    Atastina, Imelda
    Sitohang, Benhard
    Putri, G. A. S.
    Moertini, Veronica S.
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2017,
  • [36] Deep Clustering using Dirichlet Process Gaussian Mixture
    Lim, Kart-Leong
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [37] Massively Distributed Clustering via Dirichlet Process Mixture
    Meguelati, Khadidja
    Fontez, Benedicte
    Hilgert, Nadine
    Masseglia, Florent
    Sanchez, Isabelle
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V, 2021, 12461 : 536 - 540
  • [38] Markov chain sampling methods for Dirichlet process mixture
    Neal, RM
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2000, 9 (02) : 249 - 265
  • [39] Data Clustering using Online Variational Learning of Finite Scaled Dirichlet Mixture Models
    Nguyen, Hieu
    Kalra, Meeta
    Azam, Muhammad
    Bouguila, Nizar
    2019 IEEE 20TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2019), 2019, : 267 - 274
  • [40] Dirichlet process mixture models with shrinkage prior
    Ding, Dawei
    Karabatsos, George
    STAT, 2021, 10 (01):