Algorithm to determine ε-distance parameter in density based clustering

被引:63
|
作者
Jahirabadkar, Sunita [1 ]
Kulkarni, Parag [1 ]
机构
[1] Coll Engn, Pune, Maharashtra, India
关键词
Data mining; Clustering; Density based clustering; Subspace clustering; High dimensional data; SPATIAL DATABASES;
D O I
10.1016/j.eswa.2013.10.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The well known clustering algorithm DBSCAN is founded on the density notion of clustering. However, the use of global density parameter epsilon-distance makes DBSCAN not suitable in varying density datasets. Also, guessing the value for the same is not straightforward. In this paper, we generalise this algorithm in two ways. First, adaptively determine the key input parameter epsilon-distance, which makes DBSCAN independent of domain knowledge satisfying the unsupervised notion of clustering. Second, the approach of deriving epsilon-distance based on checking the data distribution of each dimension makes the approach suitable for subspace clustering, which detects clusters enclosed in various subspaces of high dimensional data. Experimental results illustrate that our approach can efficiently find out the clusters of varying sizes, shapes as well as varying densities. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2939 / 2946
页数:8
相关论文
共 50 条
  • [41] A spectral clustering algorithm based on attribute fluctuation and density peaks clustering algorithm
    Xin Song
    Shuhua Li
    Ziqiang Qi
    Jianlin Zhu
    Applied Intelligence, 2023, 53 : 10520 - 10534
  • [42] Multilayered fuzzy clustering method based on distance and density
    Qiu, XP
    Meng, D
    Tang, YC
    Xu, Y
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 1417 - 1422
  • [43] The layered fuzzy clustering method based on distance and density
    Qiu, Xiaoping
    Xu, Yang
    Li, Xiaobing
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2007, : 282 - +
  • [44] Clustering in very large databases based on distance and density
    Weining Qian
    XueQing Gong
    AoYing Zhou
    Journal of Computer Science and Technology, 2003, 18 : 67 - 76
  • [45] Clustering in very large databases based on distance and density
    Qian, WN
    Gong, XQ
    Zhou, AY
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2003, 18 (01) : 67 - 76
  • [46] Two-phase clustering algorithm with density exploring distance measure
    Ma, Jingjing
    Jiang, Xiangming
    Gong, Maoguo
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2018, 3 (01) : 59 - 64
  • [47] A Density-Based Adaptive Distance Fuzzy Clustering Algorithm Based on the Multi-target Traffic Radar
    Zhang, Xinyi
    Cao, Lin
    Wang, Tao
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 511 - 515
  • [48] Hybrid Clustering Algorithm Based on Improved Density Peak Clustering
    Guo, Limin
    Qin, Weijia
    Cai, Zhi
    Su, Xing
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [49] Attribute reduction algorithm based on combined distance in clustering
    Liang, Baohua
    Lu, Zhengyu
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (01) : 1481 - 1496
  • [50] Quantum Clustering Algorithm based on Exponent Measuring Distance
    Zhang Yao
    Wang Peng
    Chen Gao-yun
    Chen Dong-Dong
    Ding Rui
    Zhang Yang
    2008 IEEE INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING WORKSHOP PROCEEDINGS, VOLS 1 AND 2, 2008, : 436 - 439