Cluster-based stability evaluation in time series data sets

被引:3
|
作者
Klassen, Gerhard [1 ]
Tatusch, Martha [1 ]
Conrad, Stefan [1 ]
机构
[1] Heinrich Heine Univ Dusseldorf, Dusseldorf, Germany
关键词
Time series clustering; Over-time stability evaluation; Evolutionary clustering; Anomalous subsequences; SUBSEQUENCES; VALIDATION;
D O I
10.1007/s10489-022-04231-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In modern data analysis, time is often considered just another feature. Yet time has a special role that is regularly overlooked. Procedures are usually only designed for time-independent data and are therefore often unsuitable for the temporal aspect of the data. This is especially the case for clustering algorithms. Although there are a few evolutionary approaches for time-dependent data, the evaluation of these and therefore the selection is difficult for the user. In this paper, we present a general evaluation measure that examines clusterings with respect to their temporal stability and thus provides information about the achieved quality. For this purpose, we examine the temporal stability of time series with respect to their cluster neighbors, the temporal stability of clusters with respect to their composition, and finally conclude on the temporal stability of the entire clustering. We summarise these components in a parameter-free toolkit that we call Cluster Over-Time Stability Evaluation (CLOSE). In addition to that we present a fuzzy variant which we call FCSETS (Fuzzy Clustering Stability Evaluation of Time Series). These toolkits enable a number of advanced applications. One of these is parameter selection for any type of clustering algorithm. We demonstrate parameter selection as an example and evaluate results of classical clustering algorithms against a well-known evolutionary clustering algorithm. We then introduce a method for outlier detection in time series data based on CLOSE. We demonstrate the practicality of our approaches on three real world data sets and one generated data set.
引用
收藏
页码:16606 / 16629
页数:24
相关论文
共 50 条
  • [41] Elastic Data Routing in Cluster-based Deduplication Systems
    Wang, Yufeng
    Tang, Shaojie
    Tan, Chiu C.
    2014 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2014, : 117 - 118
  • [42] Practical Data Transmission in Cluster-Based Sensor Networks
    Kim, Dae-Young
    Cho, Jinsung
    Jeong, Byeong-Soo
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2010, 4 (03): : 224 - 242
  • [43] Cluster-based time synchronisation scheme for femtocell network
    Hasan, Mohammad Kamrul
    Saeed, Rashid Abdelhaleem
    Alsaqour, Raed Ali
    Ismail, Ahmed Fadzil
    Aisha, Hassan Abdalla
    Islam, Shayla
    INTERNATIONAL JOURNAL OF MOBILE COMMUNICATIONS, 2015, 13 (06) : 567 - 598
  • [44] Preserving Privacy of Outsourced Data: A Cluster-Based Approach
    Sayi, T. J. V. R. K. M. K.
    Krishna, R. K. N. Sai
    Mukkamala, R.
    Baruah, P. K.
    2012 IEEE 13TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2012, : 215 - 223
  • [45] Linguistic and Graphical Explanation of a Cluster-Based Data Structure
    Smits, Gregory
    Pivert, Olivier
    SCALABLE UNCERTAINTY MANAGEMENT (SUM 2015), 2015, 9310 : 186 - 200
  • [46] Optimizing data aggregation for cluster-based internet services
    Chu, LK
    Tang, H
    Yang, T
    Shen, K
    ACM SIGPLAN NOTICES, 2003, 38 (10) : 119 - 130
  • [47] Cluster-based sampling approaches to imbalanced data distributions
    Yen, Show-Jane
    Lee, Yue-Shi
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4081 : 427 - 436
  • [48] The Cluster-Based Time-Aware Web System
    Zatwarnicki, Krzysztof
    Zatwarnicka, Anna
    COMPUTER NETWORKS, CN 2014, 2014, 431 : 37 - 46
  • [49] Cluster-Based Instance Selection for the Imbalanced Data Classification
    Czarnowski, Ireneusz
    Jedrzejowicz, Piotr
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2018, PT II, 2018, 11056 : 191 - 200
  • [50] Hardware Evaluation of Cluster-Based Agricultural IoT Network
    Effah, Emmanuel
    Thiare, Ousmane
    Wyglinski, Alexander M.
    IEEE ACCESS, 2024, 12 : 33628 - 33651