A novel stratification clustering algorithm based on a new local density estimation method and an improved local inter-cluster distance measure

被引:0
|
作者
Jianfang Qi
Yue Li
Haibin Jin
Jianying Feng
Dong Tian
Weisong Mu
机构
[1] China Agricultural University,College of Information and Electrical Engineering
[2] Ministry of Agriculture,Key Laboratory of Viticulture and Enology
关键词
Local density estimation; Single-linkage algorithm; Inter-cluster distance measure; Natural neighbor; Stratification clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Recently clustering for datasets with different shapes, densities and noises has attracted more and more attention from scholars. However, most current clustering algorithms improve the clustering performance at the expense of the simplicity, and cannot balance well between the clustering quality and the operability for the users. To solve this problem, we propose a new algorithm called stratification clustering based on density, hierarchy and partition (SDHP) by effectively integrating the advantages of the density-based, hierarchical-based and partition-based clustering. First, a new parameter-free local density estimation strategy based on the bidirectional natural neighbor relationship named local density based on natural neighbor (NN-LD) is proposed to identify the core part of each sub-cluster. Then, a new stratification strategy based on the NN-LD Stratification-NN-LD (S-NN-LD) is proposed to divide the entire dataset into two layers, the core layer and the edge layer, to simplify the dataset structure and make the algorithm robust to noises. Next, the hierarchical-based single-linkage algorithm is adopted in the core layer to obtain the initial clustering result since it has advantages on clustering the datasets with various shapes and densities. Finally, to improve the clustering accuracy of samples in the edge layer, a combination of a new local inter-cluster distance measure based on the average of neighbor distances and the partitioning clustering is adopted to match these samples to the sub-clusters in the initial clustering result. The experiments on twenty datasets show that the SDHP has better clustering accuracy, and can be applied in practice well compared with four popular hierarchical clustering algorithms, four recent density-based clustering algorithms, and a state-of-the-art partitioning clustering algorithm. The source code can be downloaded from https://github.com/qi111678/SDHP.
引用
收藏
页码:4251 / 4283
页数:32
相关论文
共 50 条
  • [41] A Clustering Algorithm for Varied Density Clusters based on Similarity of Local Density of Objects
    Fahim, Ahmed
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 26 - 31
  • [42] K-Modes clustering algorithm based on a new distance measure
    Liang, Jiye
    Bai, Liang
    Cao, Fuyuan
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (10): : 1749 - 1755
  • [43] A novel multi-label clustering method based on distance measure
    Sun, Tie-Li
    Cheng, Si-Ying
    Wu, Di
    Sun, Xiao-Xin
    Yang, Feng-Qin
    ICIC Express Letters, 2012, 6 (05): : 121 - 126
  • [44] A Source Number Estimation Algorithm Based on Data Local Density and Fuzzy C-Means Clustering
    Wu, Na
    Wang, Ke
    Wan, Liangtian
    Liu, Ning
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [45] An Improved Spectral Clustering Algorithm Based on Local Neighbors in Kernel Space
    Liu, Xinyue
    Yong, Xing
    Lin, Hongfei
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2011, 8 (04) : 1143 - 1157
  • [46] Anomaly Detection Algorithm Based on Subspace Local Density Estimation
    Zhang, Chunkai
    Yin, Ao
    INTERNATIONAL JOURNAL OF WEB SERVICES RESEARCH, 2019, 16 (03) : 44 - 58
  • [47] A density-core-based clustering algorithm with local resultant force
    Xiao-Xia Wang
    Yu-Fang Zhang
    Jiang Xie
    Qi-Zhu Dai
    Zhong-Yang Xiong
    Jing-Pei Dan
    Soft Computing, 2020, 24 : 6571 - 6590
  • [48] A density-core-based clustering algorithm with local resultant force
    Wang, Xiao-Xia
    Zhang, Yu-Fang
    Xie, Jiang
    Dai, Qi-Zhu
    Xiong, Zhong-Yang
    Dan, Jing-Pei
    SOFT COMPUTING, 2020, 24 (09) : 6571 - 6590
  • [49] Improved Density Peak Clustering Algorithm Based on Choosing Strategy Automatically for Cut-off Distance and Cluster Centre
    Wang, Limin
    Li, Mingyang
    Han, Xuming
    Zhou, Ruihong
    Zheng, Kaiyue
    Liu, Meihan
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2018, 25 (02): : 536 - 545
  • [50] A Sound Velocity Profile Stratification Method Based on Maximum Density and Maximum Distance Clustering
    Li, Jian
    Pan, Yue
    Li, Rong
    Zhu, Tianlong
    Zhang, Zhen
    Gu, Mingyu
    Han, Guangjie
    Tian, Jie
    APPLIED SCIENCES-BASEL, 2024, 14 (01):