Cluster-based stability evaluation in time series data sets

被引:3
|
作者
Klassen, Gerhard [1 ]
Tatusch, Martha [1 ]
Conrad, Stefan [1 ]
机构
[1] Heinrich Heine Univ Dusseldorf, Dusseldorf, Germany
关键词
Time series clustering; Over-time stability evaluation; Evolutionary clustering; Anomalous subsequences; SUBSEQUENCES; VALIDATION;
D O I
10.1007/s10489-022-04231-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In modern data analysis, time is often considered just another feature. Yet time has a special role that is regularly overlooked. Procedures are usually only designed for time-independent data and are therefore often unsuitable for the temporal aspect of the data. This is especially the case for clustering algorithms. Although there are a few evolutionary approaches for time-dependent data, the evaluation of these and therefore the selection is difficult for the user. In this paper, we present a general evaluation measure that examines clusterings with respect to their temporal stability and thus provides information about the achieved quality. For this purpose, we examine the temporal stability of time series with respect to their cluster neighbors, the temporal stability of clusters with respect to their composition, and finally conclude on the temporal stability of the entire clustering. We summarise these components in a parameter-free toolkit that we call Cluster Over-Time Stability Evaluation (CLOSE). In addition to that we present a fuzzy variant which we call FCSETS (Fuzzy Clustering Stability Evaluation of Time Series). These toolkits enable a number of advanced applications. One of these is parameter selection for any type of clustering algorithm. We demonstrate parameter selection as an example and evaluate results of classical clustering algorithms against a well-known evolutionary clustering algorithm. We then introduce a method for outlier detection in time series data based on CLOSE. We demonstrate the practicality of our approaches on three real world data sets and one generated data set.
引用
收藏
页码:16606 / 16629
页数:24
相关论文
共 50 条
  • [21] Evaluation of a cluster-based system for the OLTP application
    Hahn, WJ
    Yoon, SH
    Lee, K
    Dubois, M
    ETRI JOURNAL, 1998, 20 (04) : 301 - 326
  • [22] Cluster-Based Cooperative Data Service for VANETs
    Shi, Yongyue
    Peng, Xiao-Hong
    Shen, Hang
    Bai, Guangwei
    WIRELESS INTERNET (WICON 2017), 2018, 230 : 119 - 129
  • [23] Cluster-Based Prediction for Batteries in Data Centers
    Haider, Syed Naeem
    Zhao, Qianchuan
    Li, Xueliang
    ENERGIES, 2020, 13 (05)
  • [24] Cluster-based sampling of multiclass imbalanced data
    Prachuabsupakij, Wanthanee
    Soonthornphisaj, Nuanwan
    INTELLIGENT DATA ANALYSIS, 2014, 18 (06) : 1109 - 1135
  • [25] Cluster-based IP router: Implementation and evaluation
    Ye, Qinghua
    MacGregor, Mike H.
    2006 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, VOLS 1 AND 2, 2006, : 21 - +
  • [26] Evaluation of a programmable cluster-based IP router
    Pradhan, P
    Chiueh, TC
    NINTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 2002, : 321 - 326
  • [27] Localization techniques for cluster-based data grid
    Hsu, CH
    Lin, GH
    Li, KC
    Yang, CT
    DISTRIBUTED AND PARALLEL COMPUTING, 2005, 3719 : 83 - 92
  • [28] A Cluster-Based Cooperative Data Transmission in VANETs
    Fu, Qi
    Chen, Anhua
    Jiang, Yunxia
    Tang, Mingdong
    COLLABORATE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2016, 2017, 201 : 563 - 568
  • [29] Cluster-based Data Reduction for Persistent Homology
    Moitra, Anindya
    Malott, Nicholas O.
    Wilsey, Philip A.
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 327 - 334
  • [30] Evaluation of a cluster-based system for the OLTP application
    AIT, Cupertino, CA, United States
    不详
    不详
    ETRI J, 4 (301-326):