K-means - a fast and efficient K-means algorithms

被引:0
|
作者
Nguyen C.D. [1 ]
Duong T.H. [2 ]
机构
[1] Faculty of Information Technology, Ton Duc Thang University, HoChiMinh City
[2] Institute of Science and Technology of Industry 4.0, Nguyen Tat Thanh University, HoChiMinh City
关键词
Data clustering; Data mining; IKM; Incremental K-means; K-means; K-means++;
D O I
10.1504/IJIIDS.2018.10012685
中图分类号
学科分类号
摘要
K-means often converges to a local optimum. In improved versions of K-means, k-means++ is well-known for achieving a rather optimum solution with its cluster initialisation strategy and high computational efficiency. Incremental K-means is recognised for its converging to the empirically global optimum but having a high complexity due to its stepping of the number of clusters K. The paper introduces K-means** with a doubling strategy on K. Additional techniques, including only doubling big enough clusters, stepping K for the last few values and searching on other candidates for the last K, are used to help K-means** have a complexity of O(K logK), which is lower than the complexity of incremental K-means, and still converge to empirically global optimum. On a set of synthesis and real datasets, K-means** archive the minimum results in almost of test cases. K-means** is much faster than incremental K-means and comparable with the speed of k-means++. Copyright © 2018 Inderscience Enterprises Ltd.
引用
收藏
页码:27 / 45
页数:18
相关论文
共 50 条
  • [31] Faster Algorithms for the Constrained k-means Problem
    Anup Bhattacharya
    Ragesh Jaiswal
    Amit Kumar
    Theory of Computing Systems, 2018, 62 : 93 - 115
  • [32] The seeding algorithms for spherical k-means clustering
    Min Li
    Dachuan Xu
    Dongmei Zhang
    Juan Zou
    Journal of Global Optimization, 2020, 76 : 695 - 708
  • [33] Faster Algorithms for the Constrained k-means Problem
    Bhattacharya, Anup
    Jaiswal, Ragesh
    Kumar, Amit
    THEORY OF COMPUTING SYSTEMS, 2018, 62 (01) : 93 - 115
  • [34] Competitive K-means
    Esteves, Rui Maximo
    Hacker, Thomas
    Rong, Chunming
    2013 IEEE FIFTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), VOL 1, 2013, : 17 - 24
  • [35] Faster Algorithms for the Constrained k-Means Problem
    Bhattacharya, Anup
    Jaiswal, Ragesh
    Kumar, Amit
    33RD SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2016), 2016, 47
  • [36] COMPRESSIVE K-MEANS
    Keriven, Nicolas
    Tremblay, Nicolas
    Traonmilin, Yann
    Gribonval, Remi
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 6369 - 6373
  • [37] k-means: A revisit
    Zhao, Wan-Lei
    Deng, Cheng-Hao
    Ngo, Chong-Wah
    NEUROCOMPUTING, 2018, 291 : 195 - 206
  • [38] The seeding algorithms for spherical k-means clustering
    Li, Min
    Xu, Dachuan
    Zhang, Dongmei
    Zou, Juan
    JOURNAL OF GLOBAL OPTIMIZATION, 2020, 76 (04) : 695 - 708
  • [39] Acceleration of K-means and related clustering algorithms
    Phillips, SJ
    ALGORITHM ENGINEERING AND EXPERIMENTS, 2002, 2409 : 166 - 177
  • [40] COMPARATIVE ANALYSIS OF K-MEANS AND DBSCAN ALGORITHMS
    Zurini, Madalina
    INTERNATIONAL CONFERENCE ON INFORMATICS IN ECONOMY, 2013, : 646 - 651