A Fast Incremental Clustering Algorithm

被引:0
|
作者
Su, Xiaoke [1 ]
Lan, Yang [2 ]
Wan, Renxia [1 ]
Qin, Yuming [1 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai 201620, Peoples R China
[2] Xinyang Normal Univ, Sch Comp & Informat Technol, Xinyang 464000, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
incremental clustering; categorical data; radius threshold value; inter-cluster dissimilarity measure; clustering accuracy; data mining;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering has played a very important role in data mining. In this paper, a fast incremental clustering algorithm is proposed by changing the radius threshold value dynamically. The algorithm restricts the number of the final clusters and reads the original dataset only once. At the same time an inter-cluster dissimilarity measure taking into account the frequency information of the attribute values is introduced. It can be used for the categorical data. The experimental results on the mushroom dataset show that the proposed algorithm is feasible and effective. It can be used for the large-scale data set.
引用
收藏
页码:175 / +
页数:2
相关论文
共 50 条
  • [31] SIHC: A STABLE INCREMENTAL HIERARCHICAL CLUSTERING ALGORITHM
    Gurrutxaga, Ibai
    Arbelaitz, Olatz
    Martin, Jose I.
    Muguerza, Javier
    Perez, Jesus M.
    Perona, Inigo
    ICEIS 2009 : PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL AIDSS, 2009, : 300 - 304
  • [32] An Incremental Clustering Algorithm Based on Mahalanobis Distance
    Aik, Lim Eng
    Choon, Tan Wee
    INTERNATIONAL CONFERENCE ON QUANTITATIVE SCIENCES AND ITS APPLICATIONS (ICOQSIA 2014), 2014, 1635 : 788 - 793
  • [33] Automatic Topic Detection with an Incremental Clustering Algorithm
    Zhang, Xiaoming
    Li, Zhoujun
    WEB INFORMATION SYSTEMS AND MINING, 2010, 6318 : 344 - 351
  • [34] A Novel Fast Clustering Algorithm
    Li Xia
    Jiang Sheng-yi
    Su Xiao-ke
    2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL IV, PROCEEDINGS, 2009, : 284 - +
  • [35] Improved fast partitional clustering algorithm for text clustering
    Bejos, Sebastian
    Feliciano-Avelino, Ivan
    Martinez-Trinidad, J. Fco.
    Carrasco-Ochoa, J. A.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (02) : 2137 - 2145
  • [36] A fast algorithm for incremental principal component analysis
    Hwang, WS
    Zhang, YL
    Hwang, WS
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 876 - 881
  • [37] A fast algorithm for unsupervised incremental speaker adaptation
    Schussler, M
    Gallwitz, F
    Harbeck, S
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1019 - 1022
  • [38] A fast growth distance algorithm for incremental motions
    Ong, CJ
    Huang, E
    Hong, SM
    IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 2000, 16 (06): : 880 - 890
  • [39] A fast incremental algorithm for constructing concept lattices
    Zou, Ligeng
    Zhang, Zuping
    Long, Jun
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (09) : 4474 - 4481
  • [40] FIRLA: a Fast Incremental Record Linkage Algorithm
    Soliman, Ahmed
    Rajasekaran, Sanguthevar
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 130