Clustering Categorical Data Using Hierarchies (CLUCDUH)

被引:0
|
作者
Silahtaroglu, Gökhan [1 ]
机构
[1] Beykent University, Department of Mathematics and Computing, Istanbul 34900, Turkey
关键词
Clustering; -; Gini; Pruning; Split; Tree;
D O I
暂无
中图分类号
学科分类号
摘要
Clustering large populations is an important problem when the data contain noise and different shapes. A good clustering algorithm or approach should be efficient enough to detect clusters sensitively. Besides space complexity, time complexity also gains importance as the size grows. Using hierarchies we developed a new algorithm to split attributes according to the values they have and choosing the dimension for splitting so as to divide the database roughly into equal parts as much as possible. At each node we calculate some certain descriptive statistical features of the data which reside and by pruning we generate the natural clusters with a complexity of O(n).
引用
收藏
页码:334 / 339
相关论文
共 50 条
  • [21] A Clustering Method for Categorical Ordinal Data
    Giordan, Marco
    Diana, Giancarlo
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2011, 40 (07) : 1315 - 1334
  • [22] Formulations of fuzzy clustering for categorical data
    Umayahara, Kazutaka
    Miyamoto, Sadaaki
    Nakamori, Yoshiteru
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2005, 1 (01): : 83 - 94
  • [23] HABOS clustering algorithm for categorical data
    Wu, Sen (wusen@manage.ustb.edu.cn), 2016, Science Press (38):
  • [24] Space Structure and Clustering of Categorical Data
    Qian, Yuhua
    Li, Feijiang
    Liang, Jiye
    Liu, Bing
    Dang, Chuangyin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (10) : 2047 - 2059
  • [25] Conceptual clustering categorical data with uncertainty
    Xia, Yuni
    Xi, Bowei
    19TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL I, PROCEEDINGS, 2007, : 329 - +
  • [26] Clustering Categorical Data Based on Representatives
    Aranganayagi, S.
    Thangavel, K.
    THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 599 - +
  • [27] Fuzzy rough clustering for categorical data
    Xu, Shuliang
    Liu, Shenglan
    Zhou, Jian
    Feng, Lin
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (11) : 3213 - 3223
  • [28] Clustering categorical data in projected spaces
    Mohamed Bouguessa
    Data Mining and Knowledge Discovery, 2015, 29 : 3 - 38
  • [29] Fuzzy rough clustering for categorical data
    Shuliang Xu
    Shenglan Liu
    Jian Zhou
    Lin Feng
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 3213 - 3223
  • [30] Fuzzy clustering for categorical multivariate data
    Oh, CH
    Honda, K
    Ichihashi, H
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 2154 - 2159