A Fast Incremental Clustering Algorithm

被引:0
|
作者
Su, Xiaoke [1 ]
Lan, Yang [2 ]
Wan, Renxia [1 ]
Qin, Yuming [1 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai 201620, Peoples R China
[2] Xinyang Normal Univ, Sch Comp & Informat Technol, Xinyang 464000, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
incremental clustering; categorical data; radius threshold value; inter-cluster dissimilarity measure; clustering accuracy; data mining;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering has played a very important role in data mining. In this paper, a fast incremental clustering algorithm is proposed by changing the radius threshold value dynamically. The algorithm restricts the number of the final clusters and reads the original dataset only once. At the same time an inter-cluster dissimilarity measure taking into account the frequency information of the attribute values is introduced. It can be used for the categorical data. The experimental results on the mushroom dataset show that the proposed algorithm is feasible and effective. It can be used for the large-scale data set.
引用
收藏
页码:175 / +
页数:2
相关论文
共 50 条
  • [41] Incremental Clustering Algorithm for Earth Science Data Mining
    Vatsavi, Ranga Raju
    COMPUTATIONAL SCIENCE - ICCS 2009, 2009, 5545 : 375 - 384
  • [42] Nonparametric incremental clustering: A moderate-grained algorithm
    Chen, Chunlei
    Mu, Dejun
    Zhang, Huixiang
    Hu, Wei
    Journal of Computational Information Systems, 2014, 10 (03): : 1183 - 1193
  • [43] A Fuzzy Density-based Incremental Clustering Algorithm
    Laohakiat, Sirisup
    Ratanajaipan, Photchanan
    Navaravong, Leenhapat
    Ungrangsi, Rachanee
    Maleewong, Krissada
    2018 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2018, : 211 - 215
  • [45] An incremental irregular grid algorithm for clustering data streams
    College of Computer Science and Technology, Harbin Engineering University, Harbin 150001, China
    Harbin Gongcheng Daxue Xuebao, 2008, 8 (846-850):
  • [46] An incremental clustering algorithm based on swarm intelligence theory
    Chen, Z
    Meng, QC
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1768 - 1772
  • [47] ADIC: an anomaly detection algorithm using incremental clustering
    Ren, Fei
    Hu, Liang
    Zhao, Kuo
    Liang, Hao
    Ren, Weiwu
    Journal of Information and Computational Science, 2009, 6 (02): : 1051 - 1057
  • [48] An incremental clustering algorithm based on compact sets with radius α
    Pons-Porrata, A
    Díaz, GS
    Cortés, ML
    Ramírez, LA
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2005, 3773 : 518 - 527
  • [49] PBIRCH: A scalable parallel clustering algorithm for incremental data
    Garg, Ashwani
    Mangla, Ashish
    Gupta, Neelima
    Bhatnagar, Vasudha
    10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 315 - +
  • [50] Incremental grid density-based clustering algorithm
    Chen, Ning
    Chen, An
    Zhou, Long-Xiang
    Ruan Jian Xue Bao/Journal of Software, 2002, 13 (01): : 1 - 7