Incremental Clustering for Categorical Data Using Clustering Ensemble

被引:0
|
作者
Li Taoying [1 ]
Chne Yan [1 ]
Qu Lili [1 ]
Mu Xiangwei [1 ]
机构
[1] Dalian Maritime Univ, Transportat Management Coll, Dalian 116026, Peoples R China
关键词
DataMining; Clustering; Incremental Clustering; Clustering Ensemble; K-MEANS ALGORITHM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
More and more data in practice is changing every minute and been collected in incremental mode, and incremental clustering has attracted much of researchers' attention. However, little research now focuses on partitioning categorical data in incremental mode. How to design incremental clustering for categorical data is an urgent problem. We propose an incremental clustering for categorical data using clustering ensemble in this paper. We firstly prune redundant attributes if needed, and then make use of true values of different attributes to form clustering memberships, and next use clustering ensemble to merge or divide clusters to gain optimal clustering. Finally, the proposed algorithm is applied in Yellow- Small dataset, Diagnosis dataset and Zoo dataset and results show that it is effective.
引用
收藏
页码:2519 / 2524
页数:6
相关论文
共 50 条
  • [21] Clustering categorical data streams
    He, Zengyou
    Xu, Xiaofei
    Deng, Shengchun
    Huang, Joshua Zhexue
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2011, 11 (04) : 185 - 192
  • [22] Subtractive Clustering for Categorical Data
    Gu, Lei
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1229 - 1232
  • [23] Evaluation of Categorical Data Clustering
    Rezankova, Hana
    Loster, Tomas
    Husek, Dusan
    ADVANCES IN INTELLIGENT WEB MASTERING 3, 2011, 86 : 173 - 182
  • [24] Clustering Categorical Data: A Survey
    Naouali, Sami
    Ben Salem, Semeh
    Chtourou, Zied
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2020, 19 (01) : 49 - 96
  • [25] On data labeling for clustering categorical data
    Chen, Hung-Leng
    Chuang, Kun-Ta
    Chen, Ming-Syan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (11) : 1458 - 1471
  • [26] Incremental entropy-based clustering on categorical data streams with concept drift
    Li, Yanhong
    Li, Deyu
    Wang, Suge
    Zhai, Yanhui
    KNOWLEDGE-BASED SYSTEMS, 2014, 59 : 33 - 47
  • [27] Weighted Delta Factor Cluster Ensemble Algorithm for Categorical Data Clustering in Data Mining
    Sengottaian, Sarumathi
    Natesan, Shanthi
    Mathivanan, Sharmila
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (03) : 275 - 284
  • [28] Fuzzy clustering of categorical data using fuzzy centroids
    Kim, DW
    Lee, KH
    Lee, D
    PATTERN RECOGNITION LETTERS, 2004, 25 (11) : 1263 - 1271
  • [29] Clustering Categorical Data Using an Extended Modularity Measure
    Labiod, Lazhar
    Grozavu, Nistor
    Bennani, Younes
    NEURAL INFORMATION PROCESSING: MODELS AND APPLICATIONS, PT II, 2010, 6444 : 310 - 320
  • [30] Clustering Categorical Data Using Community Detection Techniques
    Huu Hiep Nguyen
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2017, 2017