Incremental Cluster Validity Indices for Online Learning of Hard Partitions: Extensions and Comparative Study

被引:17
|
作者
Brito Da Silva, Leonardo Enzo [1 ,2 ]
Melton, Niklas Max [1 ]
Wunsch, Donald C. [1 ]
机构
[1] Missouri Univ Sci & Technol, Appl Computat Intelligence Lab, Rolla, MO 65409 USA
[2] Minist Educ Brazil, CAPES Fdn, BR-70040020 Brasilia, DF, Brazil
来源
IEEE ACCESS | 2020年 / 8卷
关键词
Clustering; validation; incremental cluster validity index (iCVI); adaptive resonance theory (ART); incremental (online) clustering algorithms; data streams; NEURAL-NETWORK; ALGORITHM; ARTMAP; NUMBER;
D O I
10.1109/ACCESS.2020.2969849
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Validation is one of the most important aspects of clustering, particularly when the user is designing a trustworthy or explainable system. However, most clustering validation approaches require batch calculation. This is an important gap because of the value of clustering in real-time data streaming and other online learning applications. Therefore, interest has grown in providing online alternatives for validation. This paper extends the incremental cluster validity index (iCVI) family by presenting incremental versions of Calinski-Harabasz (iCH), Pakhira-Bandyopadhyay-Maulik (iPBM), WB index (iWB), Silhouette (iSIL), Negentropy Increment (iNI), Representative Cross Information Potential (irCIP), Representative Cross Entropy (irH), and Conn&_Index (iConn&_Index). This paper also provides a thorough comparative study of correct, under- and over-partitioning on the behavior of these iCVIs, the Partition Separation (PS) index as well as four recently introduced iCVIs: incremental Xie-Beni (iXB), incremental Davies-Bouldin (iDB), and incremental generalized Dunn's indices 43 and 53 (iGD43 and iGD53). Experiments were carried out using a framework that was designed to be as agnostic as possible to the clustering algorithms. The results on synthetic benchmark data sets showed that while evidence of most under-partitioning cases could be inferred from the behaviors of the majority of these iCVIs, over-partitioning was found to be a more challenging problem, detected by fewer of them. Interestingly, over-partitioning, rather then under-partitioning, was more prominently detected on the real-world data experiments within this study. The expansion of iCVIs provides significant novel opportunities for assessing and interpreting the results of unsupervised lifelong learning in real-time, wherein samples cannot be reprocessed due to memory and/or application constraints.
引用
收藏
页码:22025 / 22047
页数:23
相关论文
共 50 条
  • [1] Incremental Cluster Validity Indices for Online Learning of Hard Partitions: Extensions and Comparative Study
    Brito Da Silva, Leonardo Enzo
    Melton, Niklas Max
    Wunsch, Donald C.
    IEEE Access, 2020, 8 : 22025 - 22047
  • [2] A COMPARATIVE STUDY OF CLUSTER VALIDITY INDICES
    Kondruk, N. E.
    RADIO ELECTRONICS COMPUTER SCIENCE CONTROL, 2019, (04) : 59 - 67
  • [3] An extensive comparative study of cluster validity indices
    Arbelaitz, Olatz
    Gurrutxaga, Ibai
    Muguerza, Javier
    Perez, Jesus M.
    Perona, Inigo
    PATTERN RECOGNITION, 2013, 46 (01) : 243 - 256
  • [4] Approximating Dunn's Cluster Validity Indices for Partitions of Big Data
    Rathore, Punit
    Ghafoori, Zahra
    Bezdek, James C.
    Palaniswami, Marimuthu
    Leckie, Christopher
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (05) : 1629 - 1641
  • [5] Incremental Cluster Validity Index-Guided Online Learning for Performance and Robustness to Presentation Order
    Brito da Silva, Leonardo Enzo
    Rayapati, Nagasharath
    Wunsch, Donald C., II
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 6686 - 6700
  • [6] Online cluster validity indices for performance monitoring of streaming data clustering
    Moshtaghi, Masud
    Bezdek, James C.
    Erfani, Sarah M.
    Leckie, Christopher
    Bailey, James
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2019, 34 (04) : 541 - 563
  • [7] A comparative study of different cluster validity indexes
    Liu, Ruochen
    Sun, Xiaojuan
    Jiao, Licheng
    Li, Yangyang
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2012, 34 (07) : 876 - 890
  • [8] A Study of Cluster Validity Indices for Real-Life Data
    Starczewski, Artur
    Krzyzak, Adam
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2017, PT II, 2017, 10246 : 148 - 158
  • [9] A comparison study of cluster validity indices using a nonhierarchical clustering algorithm
    Shim, Yosung
    Chung, Jiwon
    Choi, In-Chan
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 1, PROCEEDINGS, 2006, : 199 - +
  • [10] Childhood Maltreatment and Psychosis: A Comparative Validity Study of Maltreatment Indices
    Beasley, Rhianna E.
    Kivisto, Aaron J.
    Leonhardt, Bethany L.
    Waldron, Jordan S.
    CHILD MALTREATMENT, 2021, 26 (02) : 228 - 237