Density peaks clustering based on k-nearest neighbors and self-recommendation

被引:32
|
作者
Sun, Lin [1 ]
Qin, Xiaoying [1 ]
Ding, Weiping [2 ]
Xu, Jiucheng [1 ]
Zhang, Shiguang [1 ]
机构
[1] Henan Normal Univ, Coll Comp & Informat Engn, Xinxiang 453007, Henan, Peoples R China
[2] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
基金
中国国家自然科学基金;
关键词
Density peaks clustering; Microcluster; Neighbourhood; Local center; Self-recommendation strategy; ALGORITHM; INFORMATION;
D O I
10.1007/s13042-021-01284-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Density peaks clustering (DPC) model focuses on searching density peaks and clustering data with arbitrary shapes for machine learning. However, it is difficult for DPC to select a cut-off distance in the calculation of a local density of points, and DPC easily ignores the cluster centers with lower density in datasets with variable densities. In addition, for clusters with complex shapes, DPC selects only one cluster center for a cluster, meaning that the structure of the whole cluster is not fully reflected. To overcome these drawbacks, this paper presents a novel DPC model that merges microclusters based on k-nearest neighbors (kNN) and self-recommendation, called DPC-MC for short. First, the kNN-based neighbourhood of point is defined and the mutual neighbour degree of point is presented in this neighbourhood, and then a new local density based on the mutual neighbour degree is proposed. This local density does not need to set the cut-off distance manually. Second, to address the artificial setting of cluster centers, a self-recommendation strategy for local centers is provided. Third, after the selection of multiple local centers, the binding degree of microclusters is developed to quantify the combination degree between a microcluster and its neighbour clusters. After that, homogeneous clusters are found according to the binding degree of microclusters during the process of deleting boundary points layer by layer. The homologous clusters are merged, the points in the abnormal clusters are reallocated, and then the clustering process ends. Finally, the DPC-MC algorithm is designed, and nine synthetic datasets and twenty-seven real-world datasets are used to verify the effectiveness of our algorithm. The experimental results demonstrate that the presented algorithm outperforms other compared algorithms in terms of several evaluation metrics for clustering.
引用
收藏
页码:1913 / 1938
页数:26
相关论文
共 50 条
  • [1] Density peaks clustering based on k-nearest neighbors and self-recommendation
    Lin Sun
    Xiaoying Qin
    Weiping Ding
    Jiucheng Xu
    Shiguang Zhang
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 1913 - 1938
  • [2] Density peaks clustering based on k-nearest neighbors sharing
    Fan, Tanghuai
    Yao, Zhanfeng
    Han, Longzhe
    Liu, Baohong
    Lv, Li
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (05):
  • [3] Local density based on weighted K-nearest neighbors for density peaks clustering
    Ding, Sifan
    Li, Min
    Huang, Tianyi
    Zhu, William
    KNOWLEDGE-BASED SYSTEMS, 2024, 305
  • [4] Density Peaks Clustering Algorithm Based on Representative Points and K-nearest Neighbors
    Zhang Q.-H.
    Zhou J.-P.
    Dai Y.-Y.
    Wang G.-Y.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (12): : 5629 - 5648
  • [5] Density peaks clustering algorithm with K-nearest neighbors and weighted similarity
    Zhao J.
    Chen L.
    Wu R.-X.
    Zhang B.
    Han L.-Z.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (12): : 2349 - 2357
  • [6] Study on density peaks clustering based on k-nearest neighbors and principal component analysis
    Du, Mingjing
    Ding, Shifei
    Jia, Hongjie
    KNOWLEDGE-BASED SYSTEMS, 2016, 99 : 135 - 145
  • [7] Density Peaks Clustering Algorithm Based on Weighted k-Nearest Neighbors and Geodesic Distance
    Liu, Lina
    Yu, Donghua
    IEEE ACCESS, 2020, 8 : 168282 - 168296
  • [8] A novel density peaks clustering algorithm for automatic selection of clustering centers based on K-nearest neighbors
    Wang, Zhihe
    Wang, Huan
    Du, Hui
    Chen, Shiyin
    Shi, Xinxin
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (07) : 11875 - 11894
  • [9] Density Peaks Clustering Algorithm Based on K Nearest Neighbors
    Yin, Shihao
    Wu, Runxiu
    Li, Peiwu
    Liu, Baohong
    Fu, Xuefeng
    ADVANCES IN INTELLIGENT SYSTEMS AND COMPUTING (ECC 2021), 2022, 268 : 129 - 144
  • [10] Effective Density Peaks Clustering Algorithm Based on the Layered K-Nearest Neighbors and Subcluster Merging
    Ren, Chunhua
    Sun, Linfu
    Yu, Yang
    Wu, Qishi
    IEEE ACCESS, 2020, 8 : 123449 - 123468