Enhancing principal direction divisive clustering

被引:23
|
作者
Tasoulis, S. K. [1 ]
Tasoulis, D. K. [2 ]
Plagianakos, V. P. [1 ]
机构
[1] Univ Cent Greece, Dept Comp Sci & Biomed Informat, Lamia 35100, Greece
[2] Univ London Imperial Coll Sci Technol & Med, Dept Math, London SW7 2AZ, England
关键词
Clustering; Principal component analysis; Kernel density estimation; ALGORITHM; CLASSIFICATION; SELECTION;
D O I
10.1016/j.patcog.2010.05.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While data clustering has a long history and a large amount of research has been devoted to the development of numerous clustering techniques, significant challenges still remain. One of the most important of them is associated with high data dimensionality. A particular class of clustering algorithms has been very successful in dealing with such datasets, utilising information driven by the principal component analysis. In this work, we try to deepen our understanding on what can be achieved by this kind of approaches. We attempt to theoretically discover the relationship between true clusters in the data and the distribution of their projection onto the principal components. Based on such findings, we propose appropriate criteria for the various steps involved in hierarchical divisive clustering and develop compilations of them into new algorithms. The proposed algorithms require minimal user-defined parameters and have the desirable feature of being able to provide approximations for the number of clusters present in the data. The experimental results indicate that the proposed techniques are effective in simulated as well as real data scenarios. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3391 / 3411
页数:21
相关论文
共 50 条
  • [31] A KPSO-based divisive hierarchical clustering algorithm
    Zhang Yanduo
    Liu Leyuan
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: 50 YEARS' ACHIEVEMENTS, FUTURE DIRECTIONS AND SOCIAL IMPACTS, 2006, : 676 - 678
  • [32] A study of divisive clustering with Hausdorff distances for interval data
    Chen, Yi
    Billard, L.
    PATTERN RECOGNITION, 2019, 96
  • [33] Clustering work and family trajectories by using a divisive algorithm
    Piccarreta, Raffaella
    Billari, Francesco C.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2007, 170 : 1061 - 1078
  • [34] Divisive Hierarchical Clustering Based on Adaptive Resonance Theory
    Yamada, Yuna
    Masuyama, Naoki
    Amako, Narito
    Nojima, Yusuke
    Loo, Chu Kiong
    Ishibuchi, Hisao
    2020 INTERNATIONAL SYMPOSIUM ON COMMUNITY-CENTRIC SYSTEMS (CCS), 2020,
  • [35] A Novel Divisive Hierarchical Clustering Algorithm for Geospatial Analysis
    Li, Shaoning
    Li, Wenjing
    Qiu, Jia
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2017, 6 (01):
  • [36] A Comparative Study of Divisive and Agglomerative Hierarchical Clustering Algorithms
    Roux, Maurice
    JOURNAL OF CLASSIFICATION, 2018, 35 (02) : 345 - 366
  • [37] A Divisive Hierarchical Clustering Approach to Hyperspectral Band Selection
    Ji, Haochen
    Zuo, Zongyu
    Han, Qing-Long
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [38] Clustering on the Basis of a Divisive Approach by the Method of Alternative Adaptation
    Veselov, Gennady E.
    Lebedev, Boris K.
    Lebedev, Oleg B.
    PROCEEDINGS OF THE FOURTH INTERNATIONAL SCIENTIFIC CONFERENCE INTELLIGENT INFORMATION TECHNOLOGIES FOR INDUSTRY (IITI'19), 2020, 1156 : 384 - 392
  • [39] A Comparative Study of Divisive and Agglomerative Hierarchical Clustering Algorithms
    Maurice Roux
    Journal of Classification, 2018, 35 : 345 - 366
  • [40] Accelerating the Divisive Information-Theoretic Clustering of Visual Words
    Zhang, Jianjia
    Wang, Lei
    Liu, Lingqiao
    Zhou, Luping
    Li, Wanqing
    2013 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES & APPLICATIONS (DICTA), 2013, : 141 - 148