Algorithm to determine ε-distance parameter in density based clustering

被引:63
|
作者
Jahirabadkar, Sunita [1 ]
Kulkarni, Parag [1 ]
机构
[1] Coll Engn, Pune, Maharashtra, India
关键词
Data mining; Clustering; Density based clustering; Subspace clustering; High dimensional data; SPATIAL DATABASES;
D O I
10.1016/j.eswa.2013.10.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The well known clustering algorithm DBSCAN is founded on the density notion of clustering. However, the use of global density parameter epsilon-distance makes DBSCAN not suitable in varying density datasets. Also, guessing the value for the same is not straightforward. In this paper, we generalise this algorithm in two ways. First, adaptively determine the key input parameter epsilon-distance, which makes DBSCAN independent of domain knowledge satisfying the unsupervised notion of clustering. Second, the approach of deriving epsilon-distance based on checking the data distribution of each dimension makes the approach suitable for subspace clustering, which detects clusters enclosed in various subspaces of high dimensional data. Experimental results illustrate that our approach can efficiently find out the clusters of varying sizes, shapes as well as varying densities. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2939 / 2946
页数:8
相关论文
共 50 条
  • [31] A non-distance based clustering algorithm
    Zhu, SG
    Li, T
    PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 2357 - 2362
  • [32] Density Based Clustering Algorithm for VANETs
    Kuklinski, Slawomir
    Wolny, Grzegorz
    2009 5TH INTERNATIONAL CONFERENCE ON TESTBEDS AND RESEARCH INFRASTRUCTURES FOR THE DEVELOPMENT OF NETWORKS & COMMUNITIES, 2009, : 446 - 451
  • [33] HCDC: A novel hierarchical clustering algorithm based on density-distance cores for data sets with varying density
    Yang, Qi-Fen
    Gao, Wan-Yi
    Han, Gang
    Li, Zi-Yang
    Tian, Meng
    Zhu, Shu-Hua
    Deng, Yu-hui
    INFORMATION SYSTEMS, 2023, 114
  • [34] Density Peaks Clustering Algorithm Based on Weighted k-Nearest Neighbors and Geodesic Distance
    Liu, Lina
    Yu, Donghua
    IEEE ACCESS, 2020, 8 : 168282 - 168296
  • [35] Density-based clustering algorithm for numerical and categorical data with mixed distance measure methods
    Chen, Jin-Yin
    He, Hui-Hao
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2015, 32 (08): : 993 - 1002
  • [36] SFKNN-DPC: Standard deviation weighted distance based density peak clustering algorithm
    Xie, Juanying
    Liu, Xinglin
    Wang, Mingzhao
    INFORMATION SCIENCES, 2024, 653
  • [37] Density Peaks Clustering Algorithm Based on a Divergence Distance and Tissue-Like P System
    Ge, Fuhua
    Liu, Xiyu
    APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [38] FDBSCAN-APT: A Fuzzy Density-based Clustering Algorithm with Automatic Parameter Tuning
    Bechini, Alessio
    Criscione, Martina
    Ducange, Pietro
    Marcelloni, Francesco
    Renda, Alessandro
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [39] An Improved DBSCAN, A Density Based Clustering Algorithm with Parameter Selection for High Dimensional Data Sets
    Shah, Glory H.
    3RD NIRMA UNIVERSITY INTERNATIONAL CONFERENCE ON ENGINEERING (NUICONE 2012), 2012,
  • [40] A spectral clustering algorithm based on attribute fluctuation and density peaks clustering algorithm
    Song, Xin
    Li, Shuhua
    Qi, Ziqiang
    Zhu, Jianlin
    APPLIED INTELLIGENCE, 2023, 53 (09) : 10520 - 10534