Incremental Cluster Validity Indices for Online Learning of Hard Partitions: Extensions and Comparative Study

被引:17
|
作者
Brito Da Silva, Leonardo Enzo [1 ,2 ]
Melton, Niklas Max [1 ]
Wunsch, Donald C. [1 ]
机构
[1] Missouri Univ Sci & Technol, Appl Computat Intelligence Lab, Rolla, MO 65409 USA
[2] Minist Educ Brazil, CAPES Fdn, BR-70040020 Brasilia, DF, Brazil
来源
IEEE ACCESS | 2020年 / 8卷
关键词
Clustering; validation; incremental cluster validity index (iCVI); adaptive resonance theory (ART); incremental (online) clustering algorithms; data streams; NEURAL-NETWORK; ALGORITHM; ARTMAP; NUMBER;
D O I
10.1109/ACCESS.2020.2969849
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Validation is one of the most important aspects of clustering, particularly when the user is designing a trustworthy or explainable system. However, most clustering validation approaches require batch calculation. This is an important gap because of the value of clustering in real-time data streaming and other online learning applications. Therefore, interest has grown in providing online alternatives for validation. This paper extends the incremental cluster validity index (iCVI) family by presenting incremental versions of Calinski-Harabasz (iCH), Pakhira-Bandyopadhyay-Maulik (iPBM), WB index (iWB), Silhouette (iSIL), Negentropy Increment (iNI), Representative Cross Information Potential (irCIP), Representative Cross Entropy (irH), and Conn&_Index (iConn&_Index). This paper also provides a thorough comparative study of correct, under- and over-partitioning on the behavior of these iCVIs, the Partition Separation (PS) index as well as four recently introduced iCVIs: incremental Xie-Beni (iXB), incremental Davies-Bouldin (iDB), and incremental generalized Dunn's indices 43 and 53 (iGD43 and iGD53). Experiments were carried out using a framework that was designed to be as agnostic as possible to the clustering algorithms. The results on synthetic benchmark data sets showed that while evidence of most under-partitioning cases could be inferred from the behaviors of the majority of these iCVIs, over-partitioning was found to be a more challenging problem, detected by fewer of them. Interestingly, over-partitioning, rather then under-partitioning, was more prominently detected on the real-world data experiments within this study. The expansion of iCVIs provides significant novel opportunities for assessing and interpreting the results of unsupervised lifelong learning in real-time, wherein samples cannot be reprocessed due to memory and/or application constraints.
引用
收藏
页码:22025 / 22047
页数:23
相关论文
共 50 条
  • [31] Comparative Study of Cluster Validity Techniques Using K-Mediod Algorithm
    Riyaz, Romana
    Wani, Mohd Arif
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 893 - 898
  • [32] A validity and reliability study of the Online Cooperative Learning Attitude Scale (OCLAS)
    Korkmaz, Ozgen
    COMPUTERS & EDUCATION, 2012, 59 (04) : 1162 - 1169
  • [33] Online adaptive learning: A study of score validity of the adaptive self-regulated learning model
    Harati H.
    Yen C.-J.
    Tu C.-H.
    Cruickshank B.J.
    Armfield S.W.J.
    International Journal of Web-Based Learning and Teaching Technologies, 2020, 15 (04) : 18 - 35
  • [34] A Comparative Study of Textbook Learning and Online Learning among Undergraduate Medical Students
    Prabhu, Sujatha P.
    Bin Mdnasir, Mohdhafizuddin
    Bin Lokman, Adliyunus
    Bin Abdullah, Ahmad Hanif
    Bin Mohdhanafi, Muhammad Farris
    RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 (04): : 2014 - 2017
  • [35] A comparative study on student perceptions of face-to-face learning and online learning
    Can, Gurhan
    Saglam, Mustafa
    Eristi, Bahadir
    Kurum, Dilruba
    PROCEEDINGS OF THE 6TH WSEAS INTERNATIONAL CONFERENCE ON EDUCATION AND EDUCATIONAL TECHNOLOGY (EDU'07): NEW HORIZONS IN EDUCATION AND EDUCATIONAL TECHNOLOGY, 2007, : 41 - +
  • [36] Autonomous Online Learning of Velocity Kinematics on the iCub: a Comparative Study
    Droniou, Alain
    Ivaldi, Serena
    Padois, Vincent
    Sigaud, Olivier
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 3577 - 3582
  • [37] A comparative study of ANN online-offline learning algorithm
    Guo, HX
    Zhu, KJ
    Li, CP
    Chen, X
    Proceedings of the 2005 International Conference on Management Science & Engineering (12th), Vols 1- 3, 2005, : 239 - 245
  • [38] A comparative study of simple online learning strategies for streaming data
    Universitat Jaume I, Dept. Llenguatges i Sistemes Informátics, Av. Sos Baynat s/n, 12071 Castelló de la Plana, Spain
    WSEAS Trans. Circuits Syst., 2008, 10 (900-910):
  • [39] A Comparative Study on Cluster Validity Criteria in Linear Fuzzy Clustering and Pareto Optimality Analysis
    Honda, Katsuhiro
    Nomaguchi, Tomonari
    Notsu, Akira
    Ichihashi, Hidetomo
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 1101 - 1106
  • [40] ADAPTATION OF STUDENTS' ACCEPTANCE OF ONLINE LEARNING SCALE INTO TURKISH: VALIDITY AND RELIABILITY STUDY
    Akyurek, Muhammet Ibrahim
    Battal, Ali
    TURKISH ONLINE JOURNAL OF DISTANCE EDUCATION, 2024, 25 (04): : 97 - 108