Clustering Categorical Data Using Hierarchies (CLUCDUH)

被引:0
|
作者
Silahtaroglu, Gökhan [1 ]
机构
[1] Beykent University, Department of Mathematics and Computing, Istanbul 34900, Turkey
关键词
Clustering; -; Gini; Pruning; Split; Tree;
D O I
暂无
中图分类号
学科分类号
摘要
Clustering large populations is an important problem when the data contain noise and different shapes. A good clustering algorithm or approach should be efficient enough to detect clusters sensitively. Besides space complexity, time complexity also gains importance as the size grows. Using hierarchies we developed a new algorithm to split attributes according to the values they have and choosing the dimension for splitting so as to divide the database roughly into equal parts as much as possible. At each node we calculate some certain descriptive statistical features of the data which reside and by pruning we generate the natural clusters with a complexity of O(n).
引用
收藏
页码:334 / 339
相关论文
共 50 条
  • [31] Efficiency Based Categorical Data Clustering
    Kalaivani, K.
    Raghavendra, A. P. V.
    2012 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2012, : 550 - 553
  • [32] Clustering From Categorical Data Sequences
    Crane, Harry
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (510) : 810 - 823
  • [33] Summarizing categorical data by clustering attributes
    Michael Mampaey
    Jilles Vreeken
    Data Mining and Knowledge Discovery, 2013, 26 : 130 - 173
  • [34] Summarizing categorical data by clustering attributes
    Mampaey, Michael
    Vreeken, Jilles
    DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 26 (01) : 130 - 173
  • [35] LIMBO: Scalable clustering of categorical data
    Andritsos, P
    Tsaparas, P
    Miller, RJ
    Sevcik, KC
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2004, PROCEEDINGS, 2004, 2992 : 123 - 146
  • [36] Multiobjective approach to categorical data clustering
    Mukhopadhyay, Anirban
    Maulik, Ujjwal
    2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, : 1296 - +
  • [37] Clustering algorithm for Boolean and categorical data
    Liu, H.
    Deng, H.
    Lu, S.
    Huazhong Ligong Daxue Xuebao/Journal Huazhong (Central China) University of Science and Technology, 2001, 29 (03): : 30 - 32
  • [38] Clustering categorical data in projected spaces
    Bouguessa, Mohamed
    DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 29 (01) : 3 - 38
  • [39] Weighted Topological Clustering for Categorical Data
    Rogovschi, Nicoleta
    Nadif, Mohamed
    NEURAL INFORMATION PROCESSING, PT I, 2011, 7062 : 599 - +
  • [40] ac Clustering categorical data using silhouette coefficient as a relocating measure
    Aranganayagi, S.
    Thangavel, K.
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 13 - +