Density-based clustering with non-continuous data

被引:2
|
作者
Azzalini, Adelchi [1 ]
Menardi, Giovanna [1 ]
机构
[1] Univ Padua, Dipartimento Sci Stat, Padua, Italy
关键词
Density estimation; Mixed variables; Modal clustering; Model-based clustering; Multidimensional scaling; DISCRIMINANT-ANALYSIS; MODEL; TREE;
D O I
10.1007/s00180-016-0644-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Density-based clustering relies on the idea of associating groups with regions of the sample space characterized by high density of the probability distribution underlying the observations. While this approach to cluster analysis exhibits some desirable properties, its use is necessarily limited to continuous data only. The present contribution proposes a simple but working way to circumvent this problem, based on the identification of continuous components underlying the non-continuous variables. The basic idea is explored in a number of variants applied to simulated data, confirming the practical effectiveness of the technique and leading to recommendations for its practical usage. Some illustrations using real data are also presented.
引用
收藏
页码:771 / 798
页数:28
相关论文
共 50 条
  • [21] Hierarchical density-based clustering of categorical data and a simplification
    Andreopoulos, Bill
    An, Aijun
    Wang, Xiaogang
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, 4426 : 11 - +
  • [22] Effective Density-Based Clustering Algorithms for Incomplete Data
    Zhonghao Xue
    Hongzhi Wang
    Big Data Mining and Analytics, 2021, 4 (03) : 183 - 194
  • [23] Density-based clustering for bivariate-flow data
    Shu, Hua
    Pei, Tao
    Song, Ci
    Chen, Jie
    Chen, Xiao
    Guo, Sihui
    Liu, Yaxi
    Wang, Xi
    Wang, Xuyang
    Zhou, Chenghu
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2022, 36 (09) : 1809 - 1829
  • [24] Density-based clustering for evolving uncertain data stream
    He, Haitao
    Zhao, Jintian
    Journal of Computational Information Systems, 2014, 10 (01): : 419 - 426
  • [25] On Density-Based Data Streams Clustering Algorithms: A Survey
    Amineh Amini
    Teh Ying Wah
    Hadi Saboohi
    Journal of Computer Science & Technology, 2014, 29 (01) : 116 - 141
  • [26] Density-based clustering for road accident data analysis
    Alotaibi, Abdullah S.
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2018, 5 (08): : 113 - 121
  • [27] On Density-Based Data Streams Clustering Algorithms: A Survey
    Amini, Amineh
    Teh, Ying Wah
    Saboohi, Hadi
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2014, 29 (01) : 116 - 141
  • [28] Density-based clustering algorithm for mixture data sets
    Huang, De-Cai
    Wu, Tian-Hong
    Kongzhi yu Juece/Control and Decision, 2010, 25 (03): : 416 - 421
  • [29] A Density-Based Clustering of Spatio-Temporal Data
    Zaghlool, Ehab
    ElKaffas, Saleh
    Saad, Amani
    NEW CONTRIBUTIONS IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 2, 2015, 354 : 41 - 50
  • [30] Effective Density-Based Clustering Algorithms for Incomplete Data
    Xue, Zhonghao
    Wang, Hongzhi
    BIG DATA MINING AND ANALYTICS, 2021, 4 (03) : 183 - 194