An Improved Method Based on the Density and K-means Nearest Neighbor Text Clustering Algorithm

被引:0
|
作者
Fan, Xiaojing [1 ]
Jiang, Mingyang [2 ]
Pei, Zhili [2 ]
Qiao, Shicheng [2 ]
Lian, Jie [2 ]
Wang, Chaoyong [3 ]
机构
[1] Inner Mongolia Univ Nationalities, Coll Mech & Engn, Tongliao, Peoples R China
[2] Inner Mongolia Univ Nationalities, Coll Comp Sci & Technol, Tongliao, Peoples R China
[3] Jilin Teachers Inst Engn & Technol, Changchun 130052, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For k-means algorithm to the initial cluster centers sensitive to outliers shortcomings, we propose a density-based method to improve the k-means algorithm. Density-based methods are used, by setting the neighborhood and the neighborhood of the object that contains at least to exclude isolated point, and will not repeat the core point as the initial cluster centers We use the ratio of the distance between the distance and class within the class as a criterion evaluation function, the number of clusters to obtain the minimum value of the criterion function as the best number of clusters. These improvements effectively overcome the shortcomings of K-means algorithm. Finally, a few examples of the improved algorithm introduces specific application examples show that the improved algorithm has a higher accuracy than the original clustering algorithm, can help achieve tight class within the class room away from the clustering effect.
引用
收藏
页码:312 / 315
页数:4
相关论文
共 50 条
  • [41] Load Forecasting Based on Improved K-means Clustering Algorithm
    Wang Yanbo
    Liu Li
    Pang Xinfu
    Fan Enpeng
    2018 CHINA INTERNATIONAL CONFERENCE ON ELECTRICITY DISTRIBUTION (CICED), 2018, : 2751 - 2755
  • [42] An Improved K-means Clustering Algorithm Based on Hadoop Platform
    Hou, Xiangru
    CYBER SECURITY INTELLIGENCE AND ANALYTICS, 2020, 928 : 1101 - 1109
  • [43] Research on Improved K-means Clustering Algorithm
    Zhang, Yinsheng
    Shan, Huilin
    Li, Jiaqiang
    Zhou, Jie
    MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 1977 - 1980
  • [44] An Improved Kernel K-means Clustering Algorithm
    Liu, Yang
    Yin, Hong Peng
    Chai, Yi
    PROCEEDINGS OF 2016 CHINESE INTELLIGENT SYSTEMS CONFERENCE, VOL I, 2016, 404 : 275 - 280
  • [45] Research on improved K-means clustering algorithm
    Zhang, Yinsheng
    Shan, Huilin
    Li, Jiaqiang
    Zhou, Jie
    Advanced Materials Research, 2012, 403-408 : 1977 - 1980
  • [46] Movie Recommender System Using K-Means Clustering AND K-Nearest Neighbor
    Ahuja, Rishabh
    Solanki, Arun
    Nayyar, Anand
    2019 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2019), 2019, : 263 - 268
  • [47] An Improved K-means Clustering Algorithm Based on Normal Matrix
    Tian Shengwen
    Zhao Yongsheng
    Wang Yilei
    PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON TEST AUTOMATION AND INSTRUMENTATION, VOL 4, 2008, : 2182 - 2185
  • [48] Improved K-means Algorithm Based on the Clustering Reliability Analysis
    Zhang, Hong
    Yu, Hong
    Li, Ying
    Hu, Baofang
    PROCEEDINGS OF THE 2015 INTERNATIONAL SYMPOSIUM ON COMPUTERS & INFORMATICS, 2015, 13 : 2516 - 2523
  • [49] Clustering of College Students Based on Improved K-means Algorithm
    Fan, Zhongxiang
    Yan, Sun
    2016 INTERNATIONAL COMPUTER SYMPOSIUM (ICS), 2016, : 676 - 679
  • [50] The K-means clustering algorithm based on density and ant colony
    Peng, YQ
    Hou, XD
    Liu, S
    PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS & SIGNAL PROCESSING, PROCEEDINGS, VOLS 1 AND 2, 2003, : 457 - 460