Cluster-based stability evaluation in time series data sets

被引:3
|
作者
Klassen, Gerhard [1 ]
Tatusch, Martha [1 ]
Conrad, Stefan [1 ]
机构
[1] Heinrich Heine Univ Dusseldorf, Dusseldorf, Germany
关键词
Time series clustering; Over-time stability evaluation; Evolutionary clustering; Anomalous subsequences; SUBSEQUENCES; VALIDATION;
D O I
10.1007/s10489-022-04231-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In modern data analysis, time is often considered just another feature. Yet time has a special role that is regularly overlooked. Procedures are usually only designed for time-independent data and are therefore often unsuitable for the temporal aspect of the data. This is especially the case for clustering algorithms. Although there are a few evolutionary approaches for time-dependent data, the evaluation of these and therefore the selection is difficult for the user. In this paper, we present a general evaluation measure that examines clusterings with respect to their temporal stability and thus provides information about the achieved quality. For this purpose, we examine the temporal stability of time series with respect to their cluster neighbors, the temporal stability of clusters with respect to their composition, and finally conclude on the temporal stability of the entire clustering. We summarise these components in a parameter-free toolkit that we call Cluster Over-Time Stability Evaluation (CLOSE). In addition to that we present a fuzzy variant which we call FCSETS (Fuzzy Clustering Stability Evaluation of Time Series). These toolkits enable a number of advanced applications. One of these is parameter selection for any type of clustering algorithm. We demonstrate parameter selection as an example and evaluate results of classical clustering algorithms against a well-known evolutionary clustering algorithm. We then introduce a method for outlier detection in time series data based on CLOSE. We demonstrate the practicality of our approaches on three real world data sets and one generated data set.
引用
收藏
页码:16606 / 16629
页数:24
相关论文
共 50 条
  • [1] Cluster-based stability evaluation in time series data sets
    Gerhard Klassen
    Martha Tatusch
    Stefan Conrad
    Applied Intelligence, 2023, 53 : 16606 - 16629
  • [2] Cluster-Based Similarity Search in Time Series
    Karamitopoulos, Leonidas
    Evangelidis, Georgios
    PROCEEDINGS OF THE 2009 FOURTH BALKAN CONFERENCE IN INFORMATICS, 2009, : 113 - 118
  • [3] Cluster-based genetic segmentation of time series with DWT
    Tseng, Vincent S.
    Chen, Chun-Hao
    Huang, Pai-Chieh
    Hong, Tzung-Pei
    PATTERN RECOGNITION LETTERS, 2009, 30 (13) : 1190 - 1197
  • [4] A CLUSTER-BASED APPRAOCH TO CONTENT BASED TIME SERIES RETRIEVAL (CBTSR)
    Bovolo, Francesca
    Demir, Beguem
    Bruzzone, Lorenzo
    2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 2793 - 2796
  • [5] A Cluster-based Genetic Approach for Segmentation of Time Series and Pattern Discovery
    Tseng, Vincent S.
    Chen, Chun-Hao
    Huang, Pal-Chieh
    Hong, Tzung-Pei
    2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 1949 - +
  • [6] From Cluster-Based Outlier Detection to Time Series Discord Discovery
    Nguyen Huy Kha
    Duong Tuan Anh
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2015, 2015, 9441 : 16 - 28
  • [7] Recognition of signed expressions using cluster-based segmentation of time series
    Oszust M.
    Wysocki M.
    Advances in Intelligent and Soft Computing, 2010, 84 : 167 - 174
  • [8] Cluster-based evaluation in fuzzy-genetic data mining
    Chen, Chun-Hao
    Tseng, Vincent S.
    Hong, Tzung-Pei
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2008, 16 (01) : 249 - 262
  • [9] Cluster-Based Empirical Tropospheric Corrections Applied to InSAR Time Series Analysis
    Murray, Kyle Dennis
    Lohman, Rowena B.
    Bekaert, David P. S.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (03): : 2204 - 2212
  • [10] A Cluster-Based Data Assimilation Approach to Generate New Daily Gridded Time Series Precipitation Data in the Himalayan River Basins
    Singh, Japjeet
    Singh, Vishal
    Ojha, Chandra Shekhar Prasad
    WATER RESOURCES RESEARCH, 2025, 61 (01)