Clustering Categorical Data Using Hierarchies (CLUCDUH)

被引:0
|
作者
Silahtaroglu, Gökhan [1 ]
机构
[1] Beykent University, Department of Mathematics and Computing, Istanbul 34900, Turkey
关键词
Clustering; -; Gini; Pruning; Split; Tree;
D O I
暂无
中图分类号
学科分类号
摘要
Clustering large populations is an important problem when the data contain noise and different shapes. A good clustering algorithm or approach should be efficient enough to detect clusters sensitively. Besides space complexity, time complexity also gains importance as the size grows. Using hierarchies we developed a new algorithm to split attributes according to the values they have and choosing the dimension for splitting so as to divide the database roughly into equal parts as much as possible. At each node we calculate some certain descriptive statistical features of the data which reside and by pruning we generate the natural clusters with a complexity of O(n).
引用
收藏
页码:334 / 339
相关论文
共 50 条
  • [1] Incremental Clustering for Categorical Data Using Clustering Ensemble
    Li Taoying
    Chne Yan
    Qu Lili
    Mu Xiangwei
    PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 2519 - 2524
  • [2] Clustering categorical data using coverage density
    Yan, H
    Zhang, L
    Zhang, Y
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 248 - 255
  • [3] Clustering Categorical Data Using Community Detection Techniques
    Huu Hiep Nguyen
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2017, 2017
  • [4] Fuzzy clustering of categorical data using fuzzy centroids
    Kim, DW
    Lee, KH
    Lee, D
    PATTERN RECOGNITION LETTERS, 2004, 25 (11) : 1263 - 1271
  • [5] Clustering Categorical Data Using an Extended Modularity Measure
    Labiod, Lazhar
    Grozavu, Nistor
    Bennani, Younes
    NEURAL INFORMATION PROCESSING: MODELS AND APPLICATIONS, PT II, 2010, 6444 : 310 - 320
  • [6] Clustering Categorical Data Using Rough Membership Function
    Kumar, B. Suresh
    Reddy, H. Venkateswara
    Raju, T. Ankamma
    Vennam, Preethi
    2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 602 - 607
  • [7] Categorical data visualization and clustering using subjective factors
    Chang, CH
    Ding, ZK
    DATA & KNOWLEDGE ENGINEERING, 2005, 53 (03) : 243 - 262
  • [8] Categorical data visualization and clustering using subjective factors
    Chang, CH
    Ding, ZK
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2004, 3181 : 229 - 238
  • [9] On data labeling for clustering categorical data
    Chen, Hung-Leng
    Chuang, Kun-Ta
    Chen, Ming-Syan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (11) : 1458 - 1471
  • [10] Clustering categorical data streams
    He, Zengyou
    Xu, Xiaofei
    Deng, Shengchun
    Huang, Joshua Zhexue
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2011, 11 (04) : 185 - 192