An Efficient Dimension Reduction Technique for Basic K-Means Clustering Algorithm

被引:0
|
作者
Usman, Dauda [1 ]
Mohamad, Ismail [1 ]
机构
[1] Univ Teknol Malaysia, Fac Sci, Dept Math Sci, Johor Baharu 81310, Johor Darul Taa, Malaysia
关键词
Decimal Scaling; K-Means Clustering; Min-Max; Principal Component Analysis; Standardization; z-score;
D O I
暂无
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
K-means clustering is being widely studied problem in a variety of application domains. The computational complexity of the basic k-means is very high, the number of distance calculations also increases with the increase of the dimensionality of the data. Several algorithms have been proposed to improve the performance of the basic k-means. Here we investigate the behavior of the basic k-means clustering algorithm and two alternatives to it, we have analyzed the performances of three different standardization methods. Equivalently, we prove that z-score and principal components are the best preprocessing methods that will simplify the analysis and visualize the multidimensional dataset. The analyzed result revealed that the z-score outperform min-max and decimal scaling also principal component analysis picks up the dimensions with the largest variances. Our results also provide effective ways to solve the k-means clustering problems.
引用
收藏
页码:253 / 267
页数:15
相关论文
共 50 条
  • [1] K*-Means: An Effective and Efficient K-means Clustering Algorithm
    Qi, Jianpeng
    Yu, Yanwei
    Wang, Lihong
    Liu, Jinglei
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCES ON BIG DATA AND CLOUD COMPUTING (BDCLOUD 2016) SOCIAL COMPUTING AND NETWORKING (SOCIALCOM 2016) SUSTAINABLE COMPUTING AND COMMUNICATIONS (SUSTAINCOM 2016) (BDCLOUD-SOCIALCOM-SUSTAINCOM 2016), 2016, : 242 - 249
  • [2] A time efficient pattern reduction algorithm for k-means based clustering
    Tsai, Chun-Wei
    Yang, Chu-Sing
    Chiang, Ming-Chao
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 209 - +
  • [3] A time-efficient pattern reduction algorithm for k-means clustering
    Chiang, Ming-Chao
    Tsai, Chun-Wei
    Yang, Chu-Sing
    INFORMATION SCIENCES, 2011, 181 (04) : 716 - 731
  • [4] Efficient enhanced k-means clustering algorithm
    Fahim A.M.
    Salem A.M.
    Torkey F.A.
    Ramadan M.A.
    Journal of Zhejiang University-SCIENCE A, 2006, 7 (10): : 1626 - 1633
  • [5] An efficient enhanced k-means clustering algorithm
    FAHIM A.M
    SALEM A.M
    TORKEY F.A
    RAMADAN M.A
    Journal of Zhejiang University Science A(Science in Engineering), 2006, (10) : 1626 - 1633
  • [6] A more efficient algorithm for K-means clustering
    Wang, Shouqiang
    Zhu, Daming
    Journal of Computational Information Systems, 2007, 3 (05): : 1951 - 1956
  • [7] An Efficient Global K-means Clustering Algorithm
    Xie, Juanying
    Jiang, Shuai
    Xie, Weixin
    Gao, Xinbo
    JOURNAL OF COMPUTERS, 2011, 6 (02) : 271 - 279
  • [8] Far Efficient K-Means Clustering Algorithm
    Mishra, Bikram Keshari
    Nayak, Nihar Ranjan
    Rath, Amiya
    Swain, Sagarika
    PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), 2012, : 106 - 110
  • [9] An Efficient K-means Clustering Algorithm on MapReduce
    Li, Qiuhong
    Wang, Peng
    Wang, Wei
    Hu, Hao
    Li, Zhongsheng
    Li, Junxian
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014, PT I, 2014, 8421 : 357 - 371
  • [10] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
    Shi Na
    Liu Xumin
    Guan Yong
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67