A K-means Clustering with Optimized Initial Center Based on Hadoop Platform

被引：0

作者：

Lin, Kunhui ^{[1
]}

Li, Xiang ^{[1
]}

Zhang, Zhongnan ^{[1
]}

Chen, Jiahong ^{[1
]}

机构：

[1] Xiamen Univ, Software Sch, Xiamen, Peoples R China

来源：

2014 PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2014) | 2014年

关键词：

MapReduce; K-means clustering; Initial center; Density; MAPREDUCE;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

With the explosive growth of data, the traditional clustering algorithms running on separate servers can not meet the demand. To solve the problem, more and more researchers implement the traditional clustering algorithms on the cloud computing platforms, especially for K-means clustering. But, few researchers pay attention to the K-means clustering structure, and most of researchers optimized the model of the cloud computing platform to raise the computing speed of K-means clustering. However the problem of instability caused by the random initial centers still exists. In this paper, we propose a K-means clustering algorithm with optimized initial centers based on data dimensional density. This method avoids the deficiency of the random initial centers and improves the stability of the Kmeans clustering. The experimental results show that the approach achieves a good performance on K-means, and improves the accuracy of K-means clustering on the test set.

引用

页码：263 / 266

页数：4

共 50 条

[41] Research on selecting initial points for k-means clustering
Wang, Shou-Qiang
Zhu, Da-Ming
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2673 - 2677
[42] A Method for selecting initial centers of K-means clustering
Xiong, Zhibin
Mou, Jinjun
Du, Hongyan
BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 147 - 148
[43] Research and Improve on K-means Algorithm Based on Hadoop
Wu, Kehe
Zeng, Wenjing
Wu, Tingting
An, Yanwen
PROCEEDINGS OF 2015 6TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE, 2015, : 334 - 337
[44] A k-means based clustering algorithm
Bloisi, Domenico Daniele
Locchi, Luca
COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 109 - 118
[45] Graph based k-means clustering
Galluccio, Laurent
Michel, Olivier
Comon, Pierre
Hero, Alfred O., III
SIGNAL PROCESSING, 2012, 92 (09) : 1970 - 1984
[46] Research on Improved K-Means Algorithm Based on Hadoop
Wei Xiaojing
Li Yuanbo
2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2017, : 593 - 598
[47] An Improved Parallelization of K-means Algorithm based on HADOOP
Guo, Yizhuo
2018 INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS AND CONTROL ENGINEERING (ISPECE 2018), 2019, 1187
[48] Cluster center initialization algorithm for K-means clustering
Khan, SS
Ahmad, A
PATTERN RECOGNITION LETTERS, 2004, 25 (11) : 1293 - 1302
[49] Fuzzy K-Means Incremental Clustering Based on K-Center and Vector Quantization
Li, Taoying
Chen, Yan
JOURNAL OF COMPUTERS, 2010, 5 (11) : 1670 - 1677
[50] k-Means Clustering Algorithm and Its Simulation Based on Distributed Computing Platform
Wu, Chunqiong
Yan, Bingwen
Yu, Rongrui
Yu, Baoqin
Zhou, Xiukao
Yu, Yanliang
Chen, Na
COMPLEXITY, 2021, 2021

← 1 2 3 4 5 →