Bayesian Semiparametric Local Clustering of Multiple Time Series Data

被引:0
|
作者
Fan, Jingjing [1 ]
Sarkar, Abhra [1 ]
机构
[1] Univ Texas Austin, Dept Stat & Data Sci, Welch 5-216,105 East 24th St D9800, Austin, TX 78705 USA
基金
美国国家科学基金会;
关键词
Change point detection; Hidden Markov model; Local clustering; Time series; MODELS;
D O I
10.1080/00401706.2023.2288324
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In multiple time series data, clustering the component profiles can identify meaningful latent groups while also detecting interesting change points in their trajectories. Conventional time series clustering methods, however, suffer the drawback of requiring the co-clustered units to have the same cluster membership throughout the entire time domain. In contrast to these "global" clustering methods, we develop a Bayesian "local" clustering method that allows the functions to flexibly change their cluster memberships over time. We design a Markov chain Monte Carlo algorithm to implement our method. We illustrate the method in several real-world datasets, where time-varying cluster memberships provide meaningful inferences about the underlying processes. These include a public health dataset to showcase the more detailed inference our method can provide over global clustering alternatives, and a temperature dataset to demonstrate our method's utility as a flexible change point detection method. Supplemental materials for this article, including R codes implementing the method, are available online.
引用
收藏
页码:282 / 294
页数:13
相关论文
共 50 条
  • [41] Clustering multivariate time-series data
    Singhal, A
    Seborg, DE
    JOURNAL OF CHEMOMETRICS, 2005, 19 (08) : 427 - 438
  • [42] Clustering of multivariate time-series data
    Singhal, A
    Seborg, DE
    PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 3931 - 3936
  • [43] Non-parametric Bayesian covariate-dependent multivariate functional clustering: An application to time-series data for multiple air pollutants
    Yang, Daewon
    Choi, Taeryon
    Lavigne, Eric
    Chung, Yeonseung
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2022, 71 (05) : 1521 - 1542
  • [44] Distance and Density Clustering for Time Series Data
    Ma, Ruizhe
    Angryk, Rafal A.
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 25 - 32
  • [45] Time Series Clustering of Energy Meter Data
    Majumder, Patrali
    Richter, Marc
    Gotze, Jens
    2022 IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2022 IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC / I&CPS EUROPE), 2022,
  • [46] Clustering multimedia data using time series
    Niennattrakul, Vit
    Ratanamahatana, Chotirat Ann
    2006 INTERNATIONAL CONFERENCE ON HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2006, : 372 - +
  • [47] Seasonal clustering technique for time series data
    Inniss, Tasha R.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2006, 175 (01) : 376 - 384
  • [48] A novel clustering method on time series data
    Zhang, Xiaohang
    Liu, Jiaqi
    Du, Yu
    Lv, Tingjie
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) : 11891 - 11900
  • [49] Bayesian local bandwidths in a flexible semiparametric kernel estimation for multivariate count data with diagnostics
    Some, Sobom M.
    Kokonendji, Celestin C.
    Belaid, Nawel
    Adjabi, Smail
    Abid, Rahma
    STATISTICAL METHODS AND APPLICATIONS, 2023, 32 (03): : 843 - 865
  • [50] Semiparametric Bayesian inference in multiple equation models
    Koop, G
    Poirier, DJ
    Tobias, J
    JOURNAL OF APPLIED ECONOMETRICS, 2005, 20 (06) : 723 - 747