Hierarchical Clustering Using Non-Greedy Principal Direction Divisive Partitioning

被引:0
|
作者
Martin Nilsson
机构
[1] Los Alamos National Laboratory,
来源
Information Retrieval | 2002年 / 5卷
关键词
clustering; taxonomy; PCA; classification;
D O I
暂无
中图分类号
学科分类号
摘要
We present a non-greedy version of the recently published Principal Direction Divisive Partitioning (PDDP) algorithm. The PDDP algorithm creates a hierarchical taxonomy of a data set by successively splitting the data into sub-clusters. At each level the cluster with largest variance is split by a hyper-plane orthogonal to its leading principal component. The PDDP algorithm is known to produce high quality clusters, especially when applied to high dimensional data, such as document-word feature matrices. It also scales well with both the size and the dimensionality of the data set. However, at each level only the locally optimal choice of spitting is considered. At a later stage this often leads to a non-optimal global partitioning of the data. The non-greedy version of the PDDP algorithm (NGPDDP) presented in this paper address this problem. At each level multiple alternative splitting strategies are considered. Results from applying the algorithm to generated and real data (feature vectors from sets of text documents) are presented. The results show substantial improvements in the cluster quality.
引用
收藏
页码:311 / 321
页数:10
相关论文
共 50 条
  • [21] pPOP: Fast yet accurate parallel hierarchical clustering using partitioning
    Dash, Manoranjan
    Petrutiu, Simona
    Scheuermann, Peter
    DATA & KNOWLEDGE ENGINEERING, 2007, 61 (03) : 563 - 578
  • [22] A Three-stage Scheme for Consumers' Partitioning Using Hierarchical Clustering Algorithm
    Nasiakou, Antonia
    Alamaniotis, Miltiadis
    Tsoukalas, Lefteri H.
    Karagiannis, Georgios
    2017 8TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS & APPLICATIONS (IISA), 2017, : 370 - 375
  • [23] Classification of bioinformatics workflows using weighted versions of partitioning and hierarchical clustering algorithms
    Lord, Etienne
    Diallo, Abdoulaye Banire
    Makarenkov, Vladimir
    BMC BIOINFORMATICS, 2015, 16
  • [24] Classification of bioinformatics workflows using weighted versions of partitioning and hierarchical clustering algorithms
    Etienne Lord
    Abdoulaye Baniré Diallo
    Vladimir Makarenkov
    BMC Bioinformatics, 16
  • [25] Spectral Images Browsing using Principal Component Analysis and Set Partitioning in Hierarchical Tree
    Ma, Long
    Zhao, Deping
    MIPPR 2011: REMOTE SENSING IMAGE PROCESSING, GEOGRAPHIC INFORMATION SYSTEMS, AND OTHER APPLICATIONS, 2011, 8006
  • [26] Towards a personalized health care using a divisive hierarchical clustering approach for comorbidity and the prediction of conditioned group risks
    Navarro-Cerdan, J. Ramon
    Sanchez-Gomis, Manuel
    Pons, Patricia
    Galvez-Settier, Santiago
    Valverde, Francisco
    Ferrer-Albero, Ana
    Sauri, Inmaculada
    Fernandez, Antonio
    Redon, Josep
    HEALTH INFORMATICS JOURNAL, 2023, 29 (04)
  • [27] Hierarchical and Non-hierarchical Medoid Clustering Using Asymmetric Similarity Measures
    Miyamoto, Sadaaki
    Kaizu, Yousuke
    Endo, Yasunori
    2016 JOINT 8TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 17TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2016, : 400 - 403
  • [28] Determination of cultural areas based on medieval pottery using an original divisive hierarchical clustering method with geographical constraint (MapClust)
    Bellanger, Lise
    Coulon, Arthur
    Husi, Philippe
    JOURNAL OF ARCHAEOLOGICAL SCIENCE, 2021, 132
  • [29] Regionalization of Precipitation Regimes in Iran Using Principal Component Analysis and Hierarchical Clustering Analysis
    Darand, Mohammad
    Daneshvar, Mohammad Reza Mansouri
    ENVIRONMENTAL PROCESSES-AN INTERNATIONAL JOURNAL, 2014, 1 (04): : 517 - 532
  • [30] Interpretation Method of Nonlinear Multilayer Principal Component Analysis by using Sparsity and Hierarchical Clustering
    Koda, Natsuki
    Watanabe, Sumio
    2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 1063 - 1066