A novel stratification clustering algorithm based on a new local density estimation method and an improved local inter-cluster distance measure

被引:0
|
作者
Jianfang Qi
Yue Li
Haibin Jin
Jianying Feng
Dong Tian
Weisong Mu
机构
[1] China Agricultural University,College of Information and Electrical Engineering
[2] Ministry of Agriculture,Key Laboratory of Viticulture and Enology
关键词
Local density estimation; Single-linkage algorithm; Inter-cluster distance measure; Natural neighbor; Stratification clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Recently clustering for datasets with different shapes, densities and noises has attracted more and more attention from scholars. However, most current clustering algorithms improve the clustering performance at the expense of the simplicity, and cannot balance well between the clustering quality and the operability for the users. To solve this problem, we propose a new algorithm called stratification clustering based on density, hierarchy and partition (SDHP) by effectively integrating the advantages of the density-based, hierarchical-based and partition-based clustering. First, a new parameter-free local density estimation strategy based on the bidirectional natural neighbor relationship named local density based on natural neighbor (NN-LD) is proposed to identify the core part of each sub-cluster. Then, a new stratification strategy based on the NN-LD Stratification-NN-LD (S-NN-LD) is proposed to divide the entire dataset into two layers, the core layer and the edge layer, to simplify the dataset structure and make the algorithm robust to noises. Next, the hierarchical-based single-linkage algorithm is adopted in the core layer to obtain the initial clustering result since it has advantages on clustering the datasets with various shapes and densities. Finally, to improve the clustering accuracy of samples in the edge layer, a combination of a new local inter-cluster distance measure based on the average of neighbor distances and the partitioning clustering is adopted to match these samples to the sub-clusters in the initial clustering result. The experiments on twenty datasets show that the SDHP has better clustering accuracy, and can be applied in practice well compared with four popular hierarchical clustering algorithms, four recent density-based clustering algorithms, and a state-of-the-art partitioning clustering algorithm. The source code can be downloaded from https://github.com/qi111678/SDHP.
引用
收藏
页码:4251 / 4283
页数:32
相关论文
共 50 条
  • [1] A novel stratification clustering algorithm based on a new local density estimation method and an improved local inter-cluster distance measure
    Qi, Jianfang
    Li, Yue
    Jin, Haibin
    Feng, Jianying
    Tian, Dong
    Mu, Weisong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (12) : 4251 - 4283
  • [2] A novel inter-cluster distance measure combining GLR and ICR for improved agglomerative hierarchical speaker clustering
    Han, Kyu J.
    Narayanan, Shrikanth S.
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4373 - 4376
  • [3] An Improved K-modes Clustering Algorithm Based on Intra-cluster and Inter-cluster Dissimilarity Measure
    Zhou, Hongfang
    Zhang, Yihui
    Liu, Yibin
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE & APPLICATION TECHNOLOGY (ICCIA 2017), 2017, 74 : 410 - 418
  • [4] Study on cluster centers optimization of max-min distance k-means clustering algorithm based on inter-cluster separation measure
    Xie W.
    Lei L.
    Liu X.
    Liu Y.
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04): : 7839 - 7857
  • [5] A novel bidirectional clustering algorithm based on local density
    Baicheng Lyu
    Wenhua Wu
    Zhiqiang Hu
    Scientific Reports, 11
  • [6] A novel bidirectional clustering algorithm based on local density
    Lyu, Baicheng
    Wu, Wenhua
    Hu, Zhiqiang
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [7] A new local density and relative distance based spectrum clustering
    Mingzhe Liu
    Mingfu He
    Ruili Wang
    Shaoda Li
    Knowledge and Information Systems, 2019, 61 : 965 - 985
  • [8] A new local density and relative distance based spectrum clustering
    Liu, Mingzhe
    He, Mingfu
    Wang, Ruili
    Li, Shaoda
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 61 (02) : 965 - 985
  • [9] A Novel Density Peaks Clustering Algorithm Based on Local Reachability Density
    Wang, Hanqing
    Zhou, Bin
    Zhang, Jianyong
    Cheng, Ruixue
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 690 - 697
  • [10] A Novel Density Peaks Clustering Algorithm Based on Local Reachability Density
    Hanqing Wang
    Bin Zhou
    Jianyong Zhang
    Ruixue Cheng
    International Journal of Computational Intelligence Systems, 2020, 13 : 690 - 697