Study on density peaks clustering based on k-nearest neighbors and principal component analysis

被引:370
|
作者
Du, Mingjing [1 ,2 ]
Ding, Shifei [1 ,2 ]
Jia, Hongjie [1 ,2 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100090, Peoples R China
基金
中国国家自然科学基金;
关键词
Data clustering; Density peaks; k Nearest neighbors (KNN); Principal component analysis (PCA); ALGORITHM; SEARCH;
D O I
10.1016/j.knosys.2016.02.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Density peaks clustering (DPC) algorithm published in the US journal Science in 2014 is a novel clustering algorithm based on density. It needs neither iterative process nor more parameters. However, original algorithm only has taken into account the global structure of data, which leads to missing many clusters. In addition, DPC does not perform well when data sets have relatively high dimension. Especially, DPC generates wrong number of clusters of real-world data sets. In order to overcome the first problem, we propose a density peaks clustering based on k nearest neighbors (DPC-KNN) which introduces the idea of k nearest neighbors (KNN) into DPC and has another option for the local density computation. In order to overcome the second problem, we introduce principal component analysis (PCA) into the model of DPC-KNN and further bring forward a method based on PCA (DPC-KNN-PCA), which preprocesses high dimensional data. By experiments on synthetic data sets, we demonstrate the feasibility of our algorithms. By experiments on real-world data sets, we compared this algorithm with k-means algorithm and spectral clustering (SC) algorithm in accuracy. Experimental results show that our algorithms are feasible and effective. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:135 / 145
页数:11
相关论文
共 50 条
  • [11] Relative density based K-nearest neighbors clustering algorithm
    Liu, QB
    Deng, S
    Lu, CH
    Wang, B
    Zhou, YF
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 133 - 137
  • [12] Density peaks clustering based on local fair density and fuzzy k-nearest neighbors membership allocation strategy
    Ren, Chunhua
    Sun, Linfu
    Gao, Yunhui
    Yu, Yang
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (01) : 21 - 34
  • [13] Accelerated k-nearest neighbors algorithm based on principal component analysis for text categorization
    Min DU
    Xing-shu CHEN
    Journal of Zhejiang University-Science C(Computers & Electronics), 2013, 14 (06) : 407 - 416
  • [14] Accelerated k-nearest neighbors algorithm based on principal component analysis for text categorization
    Min DU
    Xing-shu CHEN
    Frontiers of Information Technology & Electronic Engineering, 2013, (06) : 407 - 416
  • [15] Accelerated k-nearest neighbors algorithm based on principal component analysis for text categorization
    Min Du
    Xing-shu Chen
    Journal of Zhejiang University SCIENCE C, 2013, 14 : 407 - 416
  • [16] Accelerated k-nearest neighbors algorithm based on principal component analysis for text categorization
    Du, Min
    Chen, Xing-shu
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2013, 14 (06): : 407 - 416
  • [17] Robust clustering by detecting density peaks and assigning points based on fuzzy weighted K-nearest neighbors
    Xie, Juanying
    Gao, Hongchao
    Xie, Weixin
    Liu, Xiaohui
    Grant, Philip W.
    INFORMATION SCIENCES, 2016, 354 : 19 - 40
  • [18] A Fuzzy Density Peaks Clustering Algorithm Based on Improved DNA Genetic Algorithm and K-Nearest Neighbors
    Zhang, Wenqian
    Zang, Wenke
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 476 - 487
  • [19] An improved density peaks clustering algorithm using similarity assignment strategy with K-nearest neighbors
    Hu, Wei
    Feng, Ji
    Yang, Degang
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (09): : 12689 - 12706
  • [20] K-nearest neighbors clustering algorithm
    Gauza, Dariusz
    Zukowska, Anna
    Nowak, Robert
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2014, 2014, 9290