A K-means Clustering with Optimized Initial Center Based on Hadoop Platform

被引:0
|
作者
Lin, Kunhui [1 ]
Li, Xiang [1 ]
Zhang, Zhongnan [1 ]
Chen, Jiahong [1 ]
机构
[1] Xiamen Univ, Software Sch, Xiamen, Peoples R China
来源
2014 PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2014) | 2014年
关键词
MapReduce; K-means clustering; Initial center; Density; MAPREDUCE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the explosive growth of data, the traditional clustering algorithms running on separate servers can not meet the demand. To solve the problem, more and more researchers implement the traditional clustering algorithms on the cloud computing platforms, especially for K-means clustering. But, few researchers pay attention to the K-means clustering structure, and most of researchers optimized the model of the cloud computing platform to raise the computing speed of K-means clustering. However the problem of instability caused by the random initial centers still exists. In this paper, we propose a K-means clustering algorithm with optimized initial centers based on data dimensional density. This method avoids the deficiency of the random initial centers and improves the stability of the Kmeans clustering. The experimental results show that the approach achieves a good performance on K-means, and improves the accuracy of K-means clustering on the test set.
引用
收藏
页码:263 / 266
页数:4
相关论文
共 50 条
  • [41] Research on selecting initial points for k-means clustering
    Wang, Shou-Qiang
    Zhu, Da-Ming
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2673 - 2677
  • [42] A Method for selecting initial centers of K-means clustering
    Xiong, Zhibin
    Mou, Jinjun
    Du, Hongyan
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 147 - 148
  • [43] Research and Improve on K-means Algorithm Based on Hadoop
    Wu, Kehe
    Zeng, Wenjing
    Wu, Tingting
    An, Yanwen
    PROCEEDINGS OF 2015 6TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE, 2015, : 334 - 337
  • [44] A k-means based clustering algorithm
    Bloisi, Domenico Daniele
    Locchi, Luca
    COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 109 - 118
  • [45] Graph based k-means clustering
    Galluccio, Laurent
    Michel, Olivier
    Comon, Pierre
    Hero, Alfred O., III
    SIGNAL PROCESSING, 2012, 92 (09) : 1970 - 1984
  • [46] Research on Improved K-Means Algorithm Based on Hadoop
    Wei Xiaojing
    Li Yuanbo
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2017, : 593 - 598
  • [47] An Improved Parallelization of K-means Algorithm based on HADOOP
    Guo, Yizhuo
    2018 INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS AND CONTROL ENGINEERING (ISPECE 2018), 2019, 1187
  • [48] Cluster center initialization algorithm for K-means clustering
    Khan, SS
    Ahmad, A
    PATTERN RECOGNITION LETTERS, 2004, 25 (11) : 1293 - 1302
  • [49] Fuzzy K-Means Incremental Clustering Based on K-Center and Vector Quantization
    Li, Taoying
    Chen, Yan
    JOURNAL OF COMPUTERS, 2010, 5 (11) : 1670 - 1677
  • [50] k-Means Clustering Algorithm and Its Simulation Based on Distributed Computing Platform
    Wu, Chunqiong
    Yan, Bingwen
    Yu, Rongrui
    Yu, Baoqin
    Zhou, Xiukao
    Yu, Yanliang
    Chen, Na
    COMPLEXITY, 2021, 2021