Hierarchical clustering of time-series data streams

被引:129
|
作者
Rodrigues, Pedro Pereira [1 ,2 ]
Gama, Joao [1 ,3 ]
Pedroso, Joao Pedro [4 ,5 ]
机构
[1] LIAAD INESC Porto LA, P-4050190 Oporto, Portugal
[2] Univ Porto, Fac Sci, P-4050190 Oporto, Portugal
[3] Univ Porto, Fac Econ, P-4050190 Oporto, Portugal
[4] UESP INESC, P-4169007 Oporto, Portugal
[5] Univ Porto, Fac Sci, P-4169007 Oporto, Portugal
关键词
data stream analysis; clustering streaming time series; incremental hierarchical clustering; change detection;
D O I
10.1109/TKDE.2007.190727
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents and analyzes an incremental system for clustering streaming time series. The Online Divisive-Agglomerative Clustering (ODAC) system continuously maintains a tree-like hierarchy of clusters that evolves with data, using a top-down strategy. The splitting criterion is a correlation-based dissimilarity measure among time series, splitting each node by the farthest pair of streams. The system also uses a merge operator that reaggregates a previously split node in order to react to changes in the correlation structure between time series. The split and merge operators are triggered in response to changes in the diameters of existing clusters, assuming that in stationary environments, expanding the structure leads to a decrease in the diameters of the clusters. The system is designed to process thousands of data streams that flow at a high rate. The main features of the system include update time and memory consumption that do not depend on the number of examples in the stream. Moreover, the time and memory required to process an example decreases whenever the cluster structure expands. Experimental results on artificial and real data assess the processing qualities of the system, suggesting a competitive performance on clustering streaming time series, exploring also its ability to deal with concept drift.
引用
收藏
页码:615 / 627
页数:13
相关论文
共 50 条
  • [1] ODAC: Hierarchical Clustering of Time Series Data Streams
    Rodrigues, Pedro Pereira
    Gama, Joao
    Pedroso, Joao Pedro
    PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 499 - 503
  • [2] An Effective Performance of Fuzzy Hierarchical Clustering Using Time Series Data Streams
    Kavitha, V.
    Punithavalli, M.
    COMPUTER NETWORKS AND INFORMATION TECHNOLOGIES, 2011, 142 : 242 - +
  • [3] Clustering of multivariate time-series data
    Singhal, A
    Seborg, DE
    PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 3931 - 3936
  • [4] Clustering multivariate time-series data
    Singhal, A
    Seborg, DE
    JOURNAL OF CHEMOMETRICS, 2005, 19 (08) : 427 - 438
  • [5] Clustering to Forecast Sparse Time-Series Data
    Jha, Abhay
    Ray, Shubhankar
    Seaman, Brian
    Dhillon, Inderjit S.
    2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 1388 - 1399
  • [6] Logical Clustering and Learning for Time-Series Data
    Vazquez-Chanlatte, Marcell
    Deshmukh, Jyotirmoy V.
    Jin, Xiaoqing
    Seshia, Sanjit A.
    COMPUTER AIDED VERIFICATION, CAV 2017, PT I, 2017, 10426 : 305 - 325
  • [7] Time-series data dynamic density clustering
    Chen, Hao
    Xia, Yu
    Pan, Yuekai
    Yang, Qing
    INTELLIGENT DATA ANALYSIS, 2021, 25 (06) : 1487 - 1506
  • [8] Adaptive forecasting method for time-series data streams
    School of Computer Science and Engineering, Southeast University, Nanjing 210096, China
    不详
    不详
    不详
    Zidonghua Xuebao, 2007, 2 (197-201):
  • [9] Hierarchical clustering of functional MRI time-series by deterministic annealing
    Wismüller, A
    Dersch, DR
    Lipinski, B
    Hahn, K
    Auer, D
    MEDICAL DATA ANALYSIS, PROCEEDINGS, 2000, 1933 : 49 - 54
  • [10] Application of Agglomerative Hierarchical Clustering for Clustering of Time Series Data
    Radovanovic, Ana
    Li, Junshi
    Milanovic, Jovica, V
    Milosavljevic, Nina
    Storchi, Riccardo
    2020 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE (ISGT-EUROPE 2020): SMART GRIDS: KEY ENABLERS OF A GREEN POWER SYSTEM, 2020, : 640 - 644