An Efficient Dimension Reduction Technique for Basic K-Means Clustering Algorithm

被引:0
|
作者
Usman, Dauda [1 ]
Mohamad, Ismail [1 ]
机构
[1] Univ Teknol Malaysia, Fac Sci, Dept Math Sci, Johor Baharu 81310, Johor Darul Taa, Malaysia
关键词
Decimal Scaling; K-Means Clustering; Min-Max; Principal Component Analysis; Standardization; z-score;
D O I
暂无
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
K-means clustering is being widely studied problem in a variety of application domains. The computational complexity of the basic k-means is very high, the number of distance calculations also increases with the increase of the dimensionality of the data. Several algorithms have been proposed to improve the performance of the basic k-means. Here we investigate the behavior of the basic k-means clustering algorithm and two alternatives to it, we have analyzed the performances of three different standardization methods. Equivalently, we prove that z-score and principal components are the best preprocessing methods that will simplify the analysis and visualize the multidimensional dataset. The analyzed result revealed that the z-score outperform min-max and decimal scaling also principal component analysis picks up the dimensions with the largest variances. Our results also provide effective ways to solve the k-means clustering problems.
引用
收藏
页码:253 / 267
页数:15
相关论文
共 50 条
  • [31] The global k-means clustering algorithm
    Likas, A
    Vlassis, N
    Verbeek, JJ
    PATTERN RECOGNITION, 2003, 36 (02) : 451 - 461
  • [32] Improved K-means clustering algorithm
    Zhang, Zhe
    Zhang, Junxi
    Xue, Huifeng
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 169 - 172
  • [33] A k-means based clustering algorithm
    Bloisi, Domenico Daniele
    Locchi, Luca
    COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 109 - 118
  • [34] An improved K-means clustering algorithm
    Huang, Xiuchang
    Su, Wei
    Journal of Networks, 2014, 9 (01) : 161 - 167
  • [35] An Enhancement of K-means Clustering Algorithm
    Gu, Jirong
    Zhou, Jieming
    Chen, Xianwei
    2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 237 - 240
  • [36] Adaptive K-Means clustering algorithm
    Chen, Hailin
    Wu, Xiuqing
    Hu, Junhua
    MIPPR 2007: PATTERN RECOGNITION AND COMPUTER VISION, 2007, 6788
  • [37] Improved Algorithm for the k-means Clustering
    Zhang, Sheng
    Wang, Shouqiang
    PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 4717 - 4720
  • [38] k*-means:: A new generalized k-means clustering algorithm
    Cheung, YM
    PATTERN RECOGNITION LETTERS, 2003, 24 (15) : 2883 - 2893
  • [39] A fast K-Means clustering algorithm based on grid data reduction
    Li, Daqi
    Shen, Junyi
    Chen, Hongmin
    2008 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2008, : 2273 - +
  • [40] An evolutionary technique based on K-Means algorithm for optimal clustering in RN
    Bandyopadhyay, S
    Maulik, U
    INFORMATION SCIENCES, 2002, 146 (1-4) : 221 - 237