A novel stratification clustering algorithm based on a new local density estimation method and an improved local inter-cluster distance measure

被引:0
|
作者
Jianfang Qi
Yue Li
Haibin Jin
Jianying Feng
Dong Tian
Weisong Mu
机构
[1] China Agricultural University,College of Information and Electrical Engineering
[2] Ministry of Agriculture,Key Laboratory of Viticulture and Enology
关键词
Local density estimation; Single-linkage algorithm; Inter-cluster distance measure; Natural neighbor; Stratification clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Recently clustering for datasets with different shapes, densities and noises has attracted more and more attention from scholars. However, most current clustering algorithms improve the clustering performance at the expense of the simplicity, and cannot balance well between the clustering quality and the operability for the users. To solve this problem, we propose a new algorithm called stratification clustering based on density, hierarchy and partition (SDHP) by effectively integrating the advantages of the density-based, hierarchical-based and partition-based clustering. First, a new parameter-free local density estimation strategy based on the bidirectional natural neighbor relationship named local density based on natural neighbor (NN-LD) is proposed to identify the core part of each sub-cluster. Then, a new stratification strategy based on the NN-LD Stratification-NN-LD (S-NN-LD) is proposed to divide the entire dataset into two layers, the core layer and the edge layer, to simplify the dataset structure and make the algorithm robust to noises. Next, the hierarchical-based single-linkage algorithm is adopted in the core layer to obtain the initial clustering result since it has advantages on clustering the datasets with various shapes and densities. Finally, to improve the clustering accuracy of samples in the edge layer, a combination of a new local inter-cluster distance measure based on the average of neighbor distances and the partitioning clustering is adopted to match these samples to the sub-clusters in the initial clustering result. The experiments on twenty datasets show that the SDHP has better clustering accuracy, and can be applied in practice well compared with four popular hierarchical clustering algorithms, four recent density-based clustering algorithms, and a state-of-the-art partitioning clustering algorithm. The source code can be downloaded from https://github.com/qi111678/SDHP.
引用
收藏
页码:4251 / 4283
页数:32
相关论文
共 50 条
  • [31] A new interpoint distance-based clustering algorithm using kernel density estimation
    Modak, Soumita
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (11) : 5323 - 5341
  • [32] An affinity-based new local distance function and similarity measure for kNN algorithm
    Bhattacharya, Gautam
    Ghosh, Koushik
    Chowdhury, Ananda S.
    PATTERN RECOGNITION LETTERS, 2012, 33 (03) : 356 - 363
  • [33] An energy-efficient routing protocol based on particle swarm clustering algorithm and inter-cluster routing algorithm for WSN
    Xia Li
    Wang Gang
    Liu Zongqi
    Zhang Yanyan
    2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 4029 - 4033
  • [34] Local Connectivity-Based Density Estimation for Face Clustering
    Shin, Junho
    Lee, Hyo-Jun
    Kim, Hyunseop
    Baek, Jong-Hyeon
    Kim, Daehyun
    Koh, Yeong Jun
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13621 - 13629
  • [35] Molecular Clump Extraction Algorithm Based on Local Density Clustering*
    Luo, Xiaoyu
    Zheng, Sheng
    Huang, Yao
    Zeng, Shuguang
    Zeng, Xiangyun
    Jiang, Zhibo
    Chen, Zhiwei
    RESEARCH IN ASTRONOMY AND ASTROPHYSICS, 2022, 22 (01)
  • [36] A local-density based spatial clustering algorithm with noise
    Duan, Lian
    Xu, Lida
    Guo, Feng
    Lee, Jun
    Yan, Baopin
    INFORMATION SYSTEMS, 2007, 32 (07) : 978 - 986
  • [37] Molecular Clump Extraction Algorithm Based on Local Density Clustering
    Xiaoyu Luo
    Sheng Zheng
    Yao Huang
    Shuguang Zeng
    Xiangyun Zeng
    Zhibo Jiang
    Zhiwei Chen
    Research in Astronomy and Astrophysics, 2022, 22 (01) : 21 - 31
  • [38] Hierarchical clustering algorithm based on natural local density peaks
    Cai, Fapeng
    Feng, Ji
    Yang, Degang
    Chen, Zhongshang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 7989 - 8004
  • [39] A new algorithm for clustering based on kernel density estimation
    Matioli, L. C.
    Santos, S. R.
    Kleina, M.
    Leite, E. A.
    JOURNAL OF APPLIED STATISTICS, 2018, 45 (02) : 347 - 366
  • [40] A new measure for assessment of clustering based on kernel density estimation
    Modak, Soumita
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2023, 52 (17) : 5942 - 5951