Clustering of time series data - a survey

被引:1643
|
作者
Liao, TW [1 ]
机构
[1] Louisiana State Univ, Dept Ind & Mfg Syst Engn, Baton Rouge, LA 70803 USA
关键词
time series data; clustering; distance measure; data mining;
D O I
10.1016/j.patcog.2005.01.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series clustering has been shown effective in providing useful information in various domains. There seems to be an increased interest in time series clustering as part of the effort in temporal data mining research. To provide an overview, this paper surveys and summarizes previous works that investigated the clustering of time series data in various application domains. The basics of time series clustering are presented, including general-purpose clustering algorithms commonly used in time series clustering studies, the criteria for evaluating the performance of the clustering results, and the measures to determine the similarity/dissimilarity between two time series being compared, either in the forms of raw data, extracted features, or some model parameters. The past researchs are organized into three groups depending upon whether they work directly with the raw data either in the time or frequency domain, indirectly with features extracted from the raw data, or indirectly with models built from the raw data. The uniqueness and limitation of previous research are discussed and several possible topics for future research are identified. Moreover, the areas that time series clustering have been applied to are also summarized, including the sources of data used. It is hoped that this review will serve as the steppingstone for those interested in advancing this area of research. (c) 2005 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1857 / 1874
页数:18
相关论文
共 50 条
  • [21] Logical Clustering and Learning for Time-Series Data
    Vazquez-Chanlatte, Marcell
    Deshmukh, Jyotirmoy V.
    Jin, Xiaoqing
    Seshia, Sanjit A.
    COMPUTER AIDED VERIFICATION, CAV 2017, PT I, 2017, 10426 : 305 - 325
  • [22] Clustering to Forecast Sparse Time-Series Data
    Jha, Abhay
    Ray, Shubhankar
    Seaman, Brian
    Dhillon, Inderjit S.
    2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 1388 - 1399
  • [23] Clustering short time series gene expression data
    Ernst, J
    Nau, GJ
    Bar-Joseph, Z
    BIOINFORMATICS, 2005, 21 : I159 - I168
  • [24] Time Series Clustering from High Dimensional Data
    Drago, Carlo
    Scepi, Germana
    CLUSTERING HIGH-DIMENSIONAL DATA, CHDD 2012, 2015, 7627 : 72 - 86
  • [25] Optimized Data Acquisition by Time Series Clustering in OPC
    Huang, Tze-Haw
    Song, XingXing
    Huang, Mao Lin
    2011 6TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2011, : 2486 - 2492
  • [26] Encoding Time Series Data for Better Clustering Results
    Barton, Tomas
    Kordik, Pavel
    INTERNATIONAL JOINT CONFERENCE CISIS'12 - ICEUTE'12 - SOCO'12 SPECIAL SESSIONS, 2013, 189 : 467 - 475
  • [27] Using independent component for clustering of time series data
    Safadi, Thelma
    APPLIED MATHEMATICS AND COMPUTATION, 2014, 243 : 522 - 527
  • [28] Time-series data dynamic density clustering
    Chen, Hao
    Xia, Yu
    Pan, Yuekai
    Yang, Qing
    INTELLIGENT DATA ANALYSIS, 2021, 25 (06) : 1487 - 1506
  • [29] Characteristic-based clustering for time series data
    Wang, Xiaozhe
    Smith, Kate
    Hyndman, Rob
    DATA MINING AND KNOWLEDGE DISCOVERY, 2006, 13 (03) : 335 - 364
  • [30] Persistent homology for time series and spatial data clustering
    Pereira, Cassio M. M.
    de Mello, Rodrigo F.
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (15-16) : 6026 - 6038