Bayesian Semiparametric Local Clustering of Multiple Time Series Data

被引:0
|
作者
Fan, Jingjing [1 ]
Sarkar, Abhra [1 ]
机构
[1] Univ Texas Austin, Dept Stat & Data Sci, Welch 5-216,105 East 24th St D9800, Austin, TX 78705 USA
基金
美国国家科学基金会;
关键词
Change point detection; Hidden Markov model; Local clustering; Time series; MODELS;
D O I
10.1080/00401706.2023.2288324
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In multiple time series data, clustering the component profiles can identify meaningful latent groups while also detecting interesting change points in their trajectories. Conventional time series clustering methods, however, suffer the drawback of requiring the co-clustered units to have the same cluster membership throughout the entire time domain. In contrast to these "global" clustering methods, we develop a Bayesian "local" clustering method that allows the functions to flexibly change their cluster memberships over time. We design a Markov chain Monte Carlo algorithm to implement our method. We illustrate the method in several real-world datasets, where time-varying cluster memberships provide meaningful inferences about the underlying processes. These include a public health dataset to showcase the more detailed inference our method can provide over global clustering alternatives, and a temperature dataset to demonstrate our method's utility as a flexible change point detection method. Supplemental materials for this article, including R codes implementing the method, are available online.
引用
收藏
页码:282 / 294
页数:13
相关论文
共 50 条
  • [31] A semiparametric method for clustering mixed data
    Alex Foss
    Marianthi Markatou
    Bonnie Ray
    Aliza Heching
    Machine Learning, 2016, 105 : 419 - 458
  • [32] Application of Agglomerative Hierarchical Clustering for Clustering of Time Series Data
    Radovanovic, Ana
    Li, Junshi
    Milanovic, Jovica, V
    Milosavljevic, Nina
    Storchi, Riccardo
    2020 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE (ISGT-EUROPE 2020): SMART GRIDS: KEY ENABLERS OF A GREEN POWER SYSTEM, 2020, : 640 - 644
  • [33] Bayesian methods for time series of count data
    Obeidat, Mohammed
    Liu, Juxin
    Osgood, Nathaniel
    Klassen, Geoff
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (02) : 486 - 504
  • [34] Bayesian multiscale analysis for time series data
    Oigard, Tor Arne
    Rue, Havard
    Godtliebsen, Fred
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (03) : 1719 - 1730
  • [35] Bayesian Forecasting for Time Series of Categorical Data
    Angers, Jean-Francois
    Biswas, Atanu
    Maiti, Raju
    JOURNAL OF FORECASTING, 2017, 36 (03) : 217 - 229
  • [36] Bayesian Forecasting for Time Series of Count Data
    Nariswari, Rinda
    Pudjihastuti, Herena
    4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY, 2019, 157 : 427 - 435
  • [37] Bayesian analysis of time series Poisson data
    Oh, MS
    Lim, YB
    JOURNAL OF APPLIED STATISTICS, 2001, 28 (02) : 259 - 271
  • [38] A semiparametric Bayesian approach to the analysis of financial time series with applications to value at risk estimation
    Concepcion Ausin, M.
    Galeano, Pedro
    Ghosh, Pulak
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2014, 232 (02) : 350 - 358
  • [39] Multiple gene expression profile alignment for microarray time-series data clustering
    Subhani, Numanul
    Rueda, Luis
    Ngom, Alioune
    Burden, Conrad J.
    BIOINFORMATICS, 2010, 26 (18) : 2281 - 2288
  • [40] Bayesian local bandwidths in a flexible semiparametric kernel estimation for multivariate count data with diagnostics
    Sobom M. Somé
    Célestin C. Kokonendji
    Nawel Belaid
    Smail Adjabi
    Rahma Abid
    Statistical Methods & Applications, 2023, 32 : 843 - 865