Incremental Cluster Validity Index for Predicting Early Signs of Change in Data Streams

被引:0
|
作者
Ibrahim, Omar A. [1 ]
Reformat, Marek [1 ]
Musilek, Petr [1 ]
机构
[1] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB, Canada
关键词
Online clustering; change detection; incremental; internal cluster validity; incremental SD index; incremental Davies-Bouldin index;
D O I
10.1109/FUZZ52849.2023.10309720
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an incremental version of the SD cluster validity index for streaming data monitoring and predicting early signs of change. The proposed incremental SD (iSD) is used to monitor the data stream along with the MU Streaming Clustering (MUSC) algorithm. We investigate the use of iSD for detecting early signs of changes in multiple data streams arriving at the same time. Synthetic and real-life datasets are used in the analysis to demonstrate the effectiveness of the proposed index in detecting early signs of changes in the data stream. Valuable information about the streaming data can be directly captured from the index values such as the appearance of new patterns, and cluster size based on the analysis of outliers. The performance of iSD is compared with the incremental Davies-Boudin index (iDB). iSD has larger values which makes it more robust in monitoring large data streams compared to iDB which tends to flatten over time and approaches zero.
引用
收藏
页数:7
相关论文
共 34 条
  • [1] Hierarchical Clustering of Projected Data Streams Using Cluster Validity Index
    Pardeshi, Bharat
    Toshniwal, Durga
    ADVANCES IN COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, PT I, 2011, 131 : 551 - 559
  • [2] Analysis of Incremental Cluster Validity for Big Data Applications
    Ibrahim, Omar A.
    Wang, Yiqing
    Keller, James M.
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2018, 26 : 47 - 62
  • [3] Monitoring Incremental Histogram Distribution for Change Detection in Data Streams
    Sebastiao, Raquel
    Gama, Joao
    Rodrigues, Pedro Pereira
    Bernardes, Joao
    KNOWLEDGE DISCOVERY FROM SENSOR DATA, 2010, 5840 : 25 - +
  • [4] Efficient density and cluster based incremental outlier detection in data streams
    Degirmenci, Ali
    Karal, Omer
    INFORMATION SCIENCES, 2022, 607 : 901 - 920
  • [5] CUBOS: An Internal Cluster Validity Index for Categorical Data
    Gao, Xiaonan
    Wu, Sen
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2019, 26 (02): : 486 - 494
  • [6] Cluster validity for DNA microarray data using a geometrical index
    Lam, BSY
    Yan, H
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 3333 - 3339
  • [7] Incremental Cluster Validity Index-Guided Online Learning for Performance and Robustness to Presentation Order
    Brito da Silva, Leonardo Enzo
    Rayapati, Nagasharath
    Wunsch, Donald C., II
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 6686 - 6700
  • [8] An Exponential Cluster Validity Index for Fuzzy Clustering with Crisp and Fuzzy Data
    Zarandi, M. H. Fazel
    Faraji, M. R.
    Karbasian, M.
    SCIENTIA IRANICA TRANSACTION E-INDUSTRIAL ENGINEERING, 2010, 17 (02): : 95 - 110
  • [9] A new cluster validity index for data with merged clusters and different densities
    Lam, B
    Yan, H
    INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 798 - 803
  • [10] An exponential cluster validity index for fuzzy clustering with crisp and fuzzy data
    Fazei Zarandi, M.H.
    Faraji, M.R.
    Karbasian, M.
    Scientia Iranica, 2010, 17 (2 E) : 95 - 110