Density-based clustering with non-continuous data

被引:2
|
作者
Azzalini, Adelchi [1 ]
Menardi, Giovanna [1 ]
机构
[1] Univ Padua, Dipartimento Sci Stat, Padua, Italy
关键词
Density estimation; Mixed variables; Modal clustering; Model-based clustering; Multidimensional scaling; DISCRIMINANT-ANALYSIS; MODEL; TREE;
D O I
10.1007/s00180-016-0644-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Density-based clustering relies on the idea of associating groups with regions of the sample space characterized by high density of the probability distribution underlying the observations. While this approach to cluster analysis exhibits some desirable properties, its use is necessarily limited to continuous data only. The present contribution proposes a simple but working way to circumvent this problem, based on the identification of continuous components underlying the non-continuous variables. The basic idea is explored in a number of variants applied to simulated data, confirming the practical effectiveness of the technique and leading to recommendations for its practical usage. Some illustrations using real data are also presented.
引用
收藏
页码:771 / 798
页数:28
相关论文
共 50 条
  • [1] Density-based clustering with non-continuous data
    Adelchi Azzalini
    Giovanna Menardi
    Computational Statistics, 2016, 31 : 771 - 798
  • [2] Novel density-based and hierarchical density-based clustering algorithms for uncertain data
    Zhang, Xianchao
    Liu, Han
    Zhang, Xiaotong
    NEURAL NETWORKS, 2017, 93 : 240 - 255
  • [3] An Efficient Density-Based Algorithm for Data Clustering
    Theljani, Foued
    Laabidi, Kaouther
    Zidi, Salah
    Ksouri, Moufida
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2017, 26 (04)
  • [4] Anytime density-based clustering of complex data
    Son T. Mai
    Xiao He
    Jing Feng
    Claudia Plant
    Christian Böhm
    Knowledge and Information Systems, 2015, 45 : 319 - 355
  • [5] Geometric algorithms for density-based data clustering
    Chen, DZ
    Smid, M
    Xu, B
    ALGORITHMS-ESA 2002, PROCEEDINGS, 2002, 2461 : 284 - 296
  • [6] Density-based clustering for exploration of analytical data
    Daszykowski, M
    Walczak, B
    Massart, DL
    ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2004, 380 (03) : 370 - 372
  • [7] Share density-based clustering of income data
    Condino, Francesca
    STATISTICAL ANALYSIS AND DATA MINING, 2023, 16 (04) : 336 - 347
  • [8] Geometric algorithms for density-based data clustering
    Chen, DZ
    Smid, M
    Xu, B
    INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS, 2005, 15 (03) : 239 - 260
  • [9] Anytime density-based clustering of complex data
    Mai, Son T.
    He, Xiao
    Feng, Jing
    Plant, Claudia
    Boehm, Christian
    KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 45 (02) : 319 - 355
  • [10] Density-based hierarchical clustering for streaming data
    Tu, Q.
    Lu, J. F.
    Yuan, B.
    Tang, J. B.
    Yang, J. Y.
    PATTERN RECOGNITION LETTERS, 2012, 33 (05) : 641 - 645