K-means - a fast and efficient K-means algorithms

被引:0
|
作者
Nguyen C.D. [1 ]
Duong T.H. [2 ]
机构
[1] Faculty of Information Technology, Ton Duc Thang University, HoChiMinh City
[2] Institute of Science and Technology of Industry 4.0, Nguyen Tat Thanh University, HoChiMinh City
关键词
Data clustering; Data mining; IKM; Incremental K-means; K-means; K-means++;
D O I
10.1504/IJIIDS.2018.10012685
中图分类号
学科分类号
摘要
K-means often converges to a local optimum. In improved versions of K-means, k-means++ is well-known for achieving a rather optimum solution with its cluster initialisation strategy and high computational efficiency. Incremental K-means is recognised for its converging to the empirically global optimum but having a high complexity due to its stepping of the number of clusters K. The paper introduces K-means** with a doubling strategy on K. Additional techniques, including only doubling big enough clusters, stepping K for the last few values and searching on other candidates for the last K, are used to help K-means** have a complexity of O(K logK), which is lower than the complexity of incremental K-means, and still converge to empirically global optimum. On a set of synthesis and real datasets, K-means** archive the minimum results in almost of test cases. K-means** is much faster than incremental K-means and comparable with the speed of k-means++. Copyright © 2018 Inderscience Enterprises Ltd.
引用
收藏
页码:27 / 45
页数:18
相关论文
共 50 条
  • [1] Fast k-means algorithms with constant approximation
    Song, MJ
    Rajasekaran, S
    ALGORITHMS AND COMPUTATION, 2005, 3827 : 1029 - 1038
  • [2] Empirical Evaluation of K-Means, Bisecting K-Means, Fuzzy C-Means and Genetic K-Means Clustering Algorithms
    Banerjee, Shreya
    Choudhary, Ankit
    Pal, Somnath
    2015 IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE), 2015, : 172 - 176
  • [3] A Modified K-means Algorithms - Bi-Level K-Means Algorithm
    Yu, Shyr-Shen
    Chu, Shao-Wei
    Wang, Ching-Lin
    Chan, Yung-Kuan
    Chuang, Chia-Yi
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON SOFT COMPUTING IN INFORMATION COMMUNICATION TECHNOLOGY, 2014, : 10 - 13
  • [4] How fast is k-means?
    Dasgupta, S
    LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 735 - 735
  • [5] Efficient k-Means on GPUs
    Lutz, Clemens
    Bress, Sebastian
    Rabl, Tilmann
    Zeuch, Steffen
    Markl, Volker
    14TH INTERNATIONAL WORKSHOP ON DATA MANAGEMENT ON NEW HARDWARE (DAMON 2018), 2018,
  • [6] K*-Means: An Effective and Efficient K-means Clustering Algorithm
    Qi, Jianpeng
    Yu, Yanwei
    Wang, Lihong
    Liu, Jinglei
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCES ON BIG DATA AND CLOUD COMPUTING (BDCLOUD 2016) SOCIAL COMPUTING AND NETWORKING (SOCIALCOM 2016) SUSTAINABLE COMPUTING AND COMMUNICATIONS (SUSTAINCOM 2016) (BDCLOUD-SOCIALCOM-SUSTAINCOM 2016), 2016, : 242 - 249
  • [7] K-means algorithms for functional data
    Lopez Garcia, Maria Luz
    Garcia-Rodenas, Ricardo
    Gonzalez Gomez, Antonia
    NEUROCOMPUTING, 2015, 151 : 231 - 245
  • [8] A note on constrained k-means algorithms
    Ng, MK
    PATTERN RECOGNITION, 2000, 33 (03) : 515 - 519
  • [9] A Comparative Study of K-Means, K-Means plus plus and Fuzzy C-Means Clustering Algorithms
    Kapoor, Akanksha
    Singhal, Abhishek
    2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2017,
  • [10] Comparative Study of K-Means, Pam and Rough K-Means Algorithms Using Cancer Datasets
    Kumar, Parvesh
    Wasan, Krishan
    COMPUTING, COMMUNICATION, AND CONTROL, 2011, 1 : 136 - 140