Incremental Clustering for Categorical Data Using Clustering Ensemble

被引:0
|
作者
Li Taoying [1 ]
Chne Yan [1 ]
Qu Lili [1 ]
Mu Xiangwei [1 ]
机构
[1] Dalian Maritime Univ, Transportat Management Coll, Dalian 116026, Peoples R China
关键词
DataMining; Clustering; Incremental Clustering; Clustering Ensemble; K-MEANS ALGORITHM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
More and more data in practice is changing every minute and been collected in incremental mode, and incremental clustering has attracted much of researchers' attention. However, little research now focuses on partitioning categorical data in incremental mode. How to design incremental clustering for categorical data is an urgent problem. We propose an incremental clustering for categorical data using clustering ensemble in this paper. We firstly prune redundant attributes if needed, and then make use of true values of different attributes to form clustering memberships, and next use clustering ensemble to merge or divide clusters to gain optimal clustering. Finally, the proposed algorithm is applied in Yellow- Small dataset, Diagnosis dataset and Zoo dataset and results show that it is effective.
引用
收藏
页码:2519 / 2524
页数:6
相关论文
共 50 条
  • [1] Incremental clustering algorithm of mixed numerical and categorical data based on clustering ensemble
    Li, Tao-Ying
    Chen, Yan
    Zhang, Jin-Song
    Qin, Sheng-Jun
    Kongzhi yu Juece/Control and Decision, 2012, 27 (04): : 603 - 608
  • [2] Clustering Categorical Data:A Cluster Ensemble Approach
    何增友
    HighTechnologyLetters, 2003, (04) : 8 - 12
  • [3] An Incremental Clustering with Attribute Unbalance Considered for Categorical Data
    Chen, Jize
    Yang, Zhimin
    Yin, Jian
    Yang, Xiaobo
    Huang, Li
    COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, 2009, 51 : 433 - +
  • [4] Categorical Data Clustering Based on Cluster Ensemble Process
    Veeraiah, D.
    Vasumathi, D.
    PROCEEDINGS OF THE INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2015, VOL 2, 2016, 439 : 101 - 111
  • [5] Fuzzy Clustering Ensemble Algorithm for Partitioning Categorical Data
    Li, Taoying
    Chen, Yan
    2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 170 - 174
  • [6] Ensemble based rough fuzzy clustering for categorical data
    Saha, Indrajit
    Sarkar, Jnanendra Prasad
    Maulik, Ujjwal
    KNOWLEDGE-BASED SYSTEMS, 2015, 77 : 114 - 127
  • [7] Incremental Semi-Supervised Clustering Ensemble for High Dimensional Data Clustering
    Yu, Zhiwen
    Luo, Peinan
    You, Jane
    Wong, Hau-San
    Leung, Hareton
    Wu, Si
    Zhang, Jun
    Han, Guoqiang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (03) : 701 - 714
  • [8] Incremental Semi-supervised Clustering Ensemble for High Dimensional Data Clustering
    Yu, Zhiwen
    Luo, Peinan
    Wu, Si
    Han, Guoqiang
    You, Jane
    Leung, Hareton
    Wong, Hau-San
    Zhang, Jun
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1484 - 1485
  • [9] Improving Quality of Ensemble Technique for Categorical Data Clustering Using Granule Computing
    Brnawy, Rahmah
    Shiri, Nematollaah
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2021, PT I, 2021, 12923 : 261 - 272
  • [10] Incremental learning based multiobjective fuzzy clustering for categorical data
    Saha, Indrajit
    Maulik, Ujjwal
    INFORMATION SCIENCES, 2014, 267 : 35 - 57