K-means - a fast and efficient K-means algorithms

被引：0

作者：

Nguyen C.D. ^{[1
]}

Duong T.H. ^{[2
]}

机构：

[1] Faculty of Information Technology, Ton Duc Thang University, HoChiMinh City

[2] Institute of Science and Technology of Industry 4.0, Nguyen Tat Thanh University, HoChiMinh City

来源：

Nguyen, Cuong Duc (nguyenduccuong@tdt.edu.vn) | 2018年 / Inderscience Publishers, 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland卷 / 11期

关键词：

Data clustering; Data mining; IKM; Incremental K-means; K-means; K-means++;

D O I：

10.1504/IJIIDS.2018.10012685

中图分类号：

学科分类号：

摘要：

K-means often converges to a local optimum. In improved versions of K-means, k-means++ is well-known for achieving a rather optimum solution with its cluster initialisation strategy and high computational efficiency. Incremental K-means is recognised for its converging to the empirically global optimum but having a high complexity due to its stepping of the number of clusters K. The paper introduces K-means** with a doubling strategy on K. Additional techniques, including only doubling big enough clusters, stepping K for the last few values and searching on other candidates for the last K, are used to help K-means** have a complexity of O(K logK), which is lower than the complexity of incremental K-means, and still converge to empirically global optimum. On a set of synthesis and real datasets, K-means** archive the minimum results in almost of test cases. K-means** is much faster than incremental K-means and comparable with the speed of k-means++. Copyright © 2018 Inderscience Enterprises Ltd.

引用

页码：27 / 45

页数：18

共 50 条

[1] Fast k-means algorithms with constant approximation
Song, MJ
Rajasekaran, S
ALGORITHMS AND COMPUTATION, 2005, 3827 : 1029 - 1038
[2] Empirical Evaluation of K-Means, Bisecting K-Means, Fuzzy C-Means and Genetic K-Means Clustering Algorithms
Banerjee, Shreya
Choudhary, Ankit
Pal, Somnath
2015 IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE), 2015, : 172 - 176
[3] A Modified K-means Algorithms - Bi-Level K-Means Algorithm
Yu, Shyr-Shen
Chu, Shao-Wei
Wang, Ching-Lin
Chan, Yung-Kuan
Chuang, Chia-Yi
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON SOFT COMPUTING IN INFORMATION COMMUNICATION TECHNOLOGY, 2014, : 10 - 13
[4] How fast is k-means?
Dasgupta, S
LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 735 - 735
[5] Efficient k-Means on GPUs
Lutz, Clemens
Bress, Sebastian
Rabl, Tilmann
Zeuch, Steffen
Markl, Volker
14TH INTERNATIONAL WORKSHOP ON DATA MANAGEMENT ON NEW HARDWARE (DAMON 2018), 2018,
[6] K*-Means: An Effective and Efficient K-means Clustering Algorithm
Qi, Jianpeng
Yu, Yanwei
Wang, Lihong
Liu, Jinglei
PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCES ON BIG DATA AND CLOUD COMPUTING (BDCLOUD 2016) SOCIAL COMPUTING AND NETWORKING (SOCIALCOM 2016) SUSTAINABLE COMPUTING AND COMMUNICATIONS (SUSTAINCOM 2016) (BDCLOUD-SOCIALCOM-SUSTAINCOM 2016), 2016, : 242 - 249
[7] K-means algorithms for functional data
Lopez Garcia, Maria Luz
Garcia-Rodenas, Ricardo
Gonzalez Gomez, Antonia
NEUROCOMPUTING, 2015, 151 : 231 - 245
[8] A note on constrained k-means algorithms
Ng, MK
PATTERN RECOGNITION, 2000, 33 (03) : 515 - 519
[9] A Comparative Study of K-Means, K-Means plus plus and Fuzzy C-Means Clustering Algorithms
Kapoor, Akanksha
Singhal, Abhishek
2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2017,
[10] Comparative Study of K-Means, Pam and Rough K-Means Algorithms Using Cancer Datasets
Kumar, Parvesh
Wasan, Krishan
COMPUTING, COMMUNICATION, AND CONTROL, 2011, 1 : 136 - 140

← 1 2 3 4 5 →