A Global K-modes Algorithm for Clustering Categorical Data

被引:0
|
作者
Bai Tian [1 ,2 ]
Kulikowski, C. A. [2 ]
Gong Leiguang [3 ]
Yang Bin [1 ]
Huang Lan [1 ]
Zhou Chunguang [1 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[2] Rutgers State Univ, Dept Comp Sci, New Brunswick, NJ 08903 USA
[3] IBM Thomas J Watson Res Ctr, Hawthorne, NJ USA
来源
CHINESE JOURNAL OF ELECTRONICS | 2012年 / 21卷 / 03期
基金
中国国家自然科学基金;
关键词
Categorical data; Clustering; Data mining; K-modes algorithm;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a new Global k-modes (GKM) algorithm is proposed for clustering categorical data. The new method randomly selects a sufficiently large number of initial modes to account for the global distribution of the data set, and then progressively eliminates the redundant modes using an iterative optimization process with an elimination criterion function. Systematic experiments were carried out with data from the UCI Machine learning repository. The results and a comparative evaluation show a high performance and consistency of the proposed method, which achieves significant improvement compared to other well-known k-modes-type algorithms in terms of clustering accuracy.
引用
收藏
页码:460 / 465
页数:6
相关论文
共 50 条
  • [21] CLEKMODES: a modified k-modes clustering algorithm
    Mastrogiannis, N.
    Giannikos, I.
    Boutsinas, B.
    Antzoulatos, G.
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2009, 60 (08) : 1085 - 1095
  • [22] Block Fuzzy K-modes Clustering Algorithm
    Yang, Miin-Shen
    Lin, Chih-Ying
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 384 - 389
  • [23] K-modes clustering
    Chaturvedi, A
    Green, PE
    Carroll, JD
    JOURNAL OF CLASSIFICATION, 2001, 18 (01) : 35 - 55
  • [24] K-modes Clustering
    Anil Chaturvedi
    Paul E. Green
    J. Douglas Caroll
    Journal of Classification, 2001, 18 : 35 - 55
  • [25] Novel dynamic k-modes clustering of categorical and non categorical dataset with optimized genetic algorithm based feature selection
    Suryanarayana, G.
    Prakash, L. N. C. K.
    Mahesh, P. C. Senthil
    Bhaskar, T.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24399 - 24418
  • [26] Novel dynamic k-modes clustering of categorical and non categorical dataset with optimized genetic algorithm based feature selection
    G. Suryanarayana
    LNC Prakash K
    P. C. Senthil Mahesh
    T. Bhaskar
    Multimedia Tools and Applications, 2022, 81 : 24399 - 24418
  • [27] Categorical fuzzy k-modes clustering with automated feature weight learning
    Saha, Arkajyoti
    Das, Swagatam
    NEUROCOMPUTING, 2015, 166 : 422 - 435
  • [28] Rough Set Based Fuzzy K-Modes for Categorical Data
    Saha, Indrajit
    Sarkar, Jnanendra Prasad
    Maulik, Ujjwal
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, (SEMCCO 2012), 2012, 7677 : 323 - 330
  • [29] On the impact of dissimilarity measure in k-modes clustering algorithm
    Ng, Michael K.
    Li, Mark Junjie
    Huang, Joshua Zhexue
    He, Zengyou
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (03) : 503 - 507
  • [30] Cluster center initialization algorithm for K-modes clustering
    Khan, Shehroz S.
    Ahmad, Amir
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (18) : 7444 - 7456