Density peaks clustering based on k-nearest neighbors and self-recommendation

被引:32
|
作者
Sun, Lin [1 ]
Qin, Xiaoying [1 ]
Ding, Weiping [2 ]
Xu, Jiucheng [1 ]
Zhang, Shiguang [1 ]
机构
[1] Henan Normal Univ, Coll Comp & Informat Engn, Xinxiang 453007, Henan, Peoples R China
[2] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
基金
中国国家自然科学基金;
关键词
Density peaks clustering; Microcluster; Neighbourhood; Local center; Self-recommendation strategy; ALGORITHM; INFORMATION;
D O I
10.1007/s13042-021-01284-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Density peaks clustering (DPC) model focuses on searching density peaks and clustering data with arbitrary shapes for machine learning. However, it is difficult for DPC to select a cut-off distance in the calculation of a local density of points, and DPC easily ignores the cluster centers with lower density in datasets with variable densities. In addition, for clusters with complex shapes, DPC selects only one cluster center for a cluster, meaning that the structure of the whole cluster is not fully reflected. To overcome these drawbacks, this paper presents a novel DPC model that merges microclusters based on k-nearest neighbors (kNN) and self-recommendation, called DPC-MC for short. First, the kNN-based neighbourhood of point is defined and the mutual neighbour degree of point is presented in this neighbourhood, and then a new local density based on the mutual neighbour degree is proposed. This local density does not need to set the cut-off distance manually. Second, to address the artificial setting of cluster centers, a self-recommendation strategy for local centers is provided. Third, after the selection of multiple local centers, the binding degree of microclusters is developed to quantify the combination degree between a microcluster and its neighbour clusters. After that, homogeneous clusters are found according to the binding degree of microclusters during the process of deleting boundary points layer by layer. The homologous clusters are merged, the points in the abnormal clusters are reallocated, and then the clustering process ends. Finally, the DPC-MC algorithm is designed, and nine synthetic datasets and twenty-seven real-world datasets are used to verify the effectiveness of our algorithm. The experimental results demonstrate that the presented algorithm outperforms other compared algorithms in terms of several evaluation metrics for clustering.
引用
收藏
页码:1913 / 1938
页数:26
相关论文
共 50 条
  • [21] Density Peaks Clustering Based on Label Propagation and K-Mutual-Nearest Neighbors
    Sun, Liping
    Huang, Fan
    Zheng, Xiaoyao
    Guo, Liangmin
    Yu, Qingying
    Chen, Zhenghua
    Luo, Yonglong
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [22] RETRACTED: Detection of Power Data Outliers Using Density Peaks Clustering Algorithm Based on K-Nearest Neighbors (Retracted Article)
    Li, Qingpeng
    Chen, Lei
    Wang, Yuhan
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [23] Clustering by Detecting Density Peaks and Assigning Points by Similarity-First Search Based on Weighted K-Nearest Neighbors Graph
    Diao, Qi
    Dai, Yaping
    An, Qichao
    Li, Weixing
    Feng, Xiaoxue
    Pan, Feng
    COMPLEXITY, 2020, 2020
  • [24] Density based clustering algorithm for distributed datasets using mutual K-nearest neighbors
    Salim A.
    International Journal of Advanced Computer Science and Applications, 2019, 10 (03): : 620 - 630
  • [26] Density Peak Clustering Algorithm Based on K-nearest Neighbors and Optimized Allocation Strategy
    Sun L.
    Qin X.-Y.
    Xu J.-C.
    Xue Z.-A.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (04): : 1390 - 1411
  • [27] AN APPROXIMATE CLUSTERING TECHNIQUE BASED ON THE K-NEAREST NEIGHBORS METHOD
    KOVALENKO, AP
    AUTOMATION AND REMOTE CONTROL, 1992, 53 (10) : 1592 - 1598
  • [28] A novel density peaks clustering algorithm based on K nearest neighbors with adaptive merging strategy
    Xiaoning Yuan
    Hang Yu
    Jun Liang
    Bing Xu
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 2825 - 2841
  • [29] A novel density peaks clustering algorithm based on K nearest neighbors with adaptive merging strategy
    Yuan, Xiaoning
    Yu, Hang
    Liang, Jun
    Xu, Bing
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (10) : 2825 - 2841
  • [30] A novel density peaks clustering algorithm based on k nearest neighbors for improving assignment process
    Jiang, Jianhua
    Chen, Yujun
    Meng, Xianqiu
    Wang, Limin
    Li, Keqin
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 523 : 702 - 713