Sampling in Dirichlet Process Mixture Models for Clustering Streaming Data

被引:0
|
作者
Dinari, Or [1 ]
Freifeld, Oren [1 ]
机构
[1] Ben Gurion Univ Negev, Beer Sheva, Israel
基金
以色列科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Practical tools for clustering streaming data must be fast enough to handle the arrival rate of the observations. Typically, they also must adapt on the fly to possible lack of stationarity; i.e., the data statistics may be time-dependent due to various forms of drifts, changes in the number of clusters, etc. The Dirichlet Process Mixture Model (DPMM), whose Bayesian nonparametric nature allows it to adapt its complexity to the data, seems a natural choice for the streaming-data case. In its classical formulation, however, the DPMM cannot capture common types of drifts in the data statistics. Moreover, and regardless of that limitation, existing methods for online DPMM inference are too slow to handle rapid data streams. In this work we propose adapting both the DPMM and a known DPMM sampling-based non-streaming inference method for streaming-data clustering. We demonstrate the utility of the proposed method on several challenging settings, where it obtains state-of-the-art results while being on par with other methods in terms of speed.
引用
收藏
页码:818 / 835
页数:18
相关论文
共 50 条
  • [21] Online Data Clustering Using Variational Learning of a Hierarchical Dirichlet Process Mixture of Dirichlet Distributions
    Fan, Wentao
    Bouguila, Nizar
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014, 2014, 8505 : 18 - 32
  • [22] Dirichlet process mixture models for unsupervised clustering of symptoms in Parkinson's disease
    White, Nicole
    Johnson, Helen
    Silburn, Peter
    Mengersen, Kerrie
    JOURNAL OF APPLIED STATISTICS, 2012, 39 (11) : 2363 - 2377
  • [23] Research on dirichlet process mixture model for clustering
    Zhang B.
    Zhang K.
    Zhong L.
    Zhang X.
    Ingenierie des Systemes d'Information, 2019, 24 (02): : 183 - 189
  • [24] Sampling from Dirichlet process mixture models with unknown concentration parameter: mixing issues in large data implementations
    Hastie, David I.
    Liverani, Silvia
    Richardson, Sylvia
    STATISTICS AND COMPUTING, 2015, 25 (05) : 1023 - 1037
  • [25] Sampling from Dirichlet process mixture models with unknown concentration parameter: mixing issues in large data implementations
    David I. Hastie
    Silvia Liverani
    Sylvia Richardson
    Statistics and Computing, 2015, 25 : 1023 - 1037
  • [26] Data Clustering using Variational Learning of Finite Scaled Dirichlet Mixture Models
    Hieu Nguyen
    Azam, Muhammad
    Bouguila, Nizar
    2019 IEEE 28TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2019, : 1391 - 1396
  • [27] Object Clustering With Dirichlet Process Mixture Model for Data Association in Monocular SLAM
    Wei, Songlin
    Chen, Guodong
    Chi, Wenzheng
    Wang, Zhenhua
    Sun, Lining
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (01) : 594 - 603
  • [28] Estimating mixture of Dirichlet process models
    MacEachern, SN
    Muller, P
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1998, 7 (02) : 223 - 238
  • [29] Deep Dirichlet Process Mixture Models
    Li, Naiqi
    Li, Wenjie
    Jiang, Yong
    Xia, Shu-Tao
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 1138 - 1147
  • [30] An Adaptive Dirichlet Multinomial Mixture Model for Short Text Streaming Clustering
    Duan, Ruting
    Li, Chunping
    2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 49 - 55