Optimized leaf ordering with class labels for hierarchical clustering

被引:3
|
作者
Novoselova, Natalia [1 ]
Wang, Junxi [2 ]
Klawonn, Frank [2 ,3 ]
机构
[1] United Inst Informat Problems, Dept Bioinformat, Surganova Str 6, Minsk 220012, BELARUS
[2] Helmholtz Ctr Infect Res, Biostat, D-38124 Braunschweig, Germany
[3] Ostfalia Univ Appl Sci, Dept Comp Sci, D-38302 Wolfenbuttel, Germany
关键词
Hierarchical clustering; dendrogram; leaf ordering; dynamic programming; biomedical data;
D O I
10.1142/S0219720015500122
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Hierarchical clustering is extensively used in the bioinformatics community to analyze biomedical data. These data are often tagged with class labels, as e.g. disease subtypes or gene ontology (GO) terms. Heatmaps in connection with dendrograms are the common standard to visualize results of hierarchical clustering. The heatmap can be enriched by an additional color bar at the side, indicating for each instance in the data set to which class it belongs. In the ideal case, when the clustering matches perfectly with the classes, one would expect that instances from the same class cluster together and the color bar consists of well-separated color blocks without frequent alteration of colors (classes). But even in the case when instances from the same class cluster perfectly together, the dendrogram might not reflect this important aspect due to the fact that its representation is not unique. In this paper, we propose a leaf ordering algorithm for the dendrogram that preserving the hierarchical clustering result tries to group instances from the same class together. It is based on the concept of dynamic programming which can efficiently compute the optimal or nearly optimal order, consistent with the structure of the tree.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Expanding the Class of Global Objective Functions for Dissimilarity-Based Hierarchical Clustering
    Roch, Sebastien
    JOURNAL OF CLASSIFICATION, 2023, 40 (03) : 513 - 526
  • [32] Expanding the Class of Global Objective Functions for Dissimilarity-Based Hierarchical Clustering
    Sebastien Roch
    Journal of Classification, 2023, 40 (3) : 513 - 526
  • [33] A Hierarchical Class-Grouping Approach, and a Study of Classification Strategies for Leaf Classification
    Prajapati, Ravinder
    Bhavsar, Arnav
    Sao, Anil
    2015 FIFTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2015,
  • [34] Improving Hierarchical Classification with Partial Labels
    Nam Nguyen
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 315 - 320
  • [35] Identification of Sports Athletes Psychological Stress Based on K-Means Optimized Hierarchical Clustering
    Huang, Jun
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [36] Research on partition strategy of urban water supply network based on optimized hierarchical clustering algorithm
    Xia, Wei
    Wang, Shi
    Shi, Mingjun
    Xia, Qing
    Jin, Wenting
    WATER SUPPLY, 2022, 22 (04) : 4387 - 4399
  • [37] Affinity Clustering: Hierarchical Clustering at Scale
    Bateni, MohammadHossein
    Behnezhad, Soheil
    Derakhshan, Mahsa
    Hajiaghayi, MohammadTaghi
    Kiveris, Raimondas
    Lattanzi, Silvio
    Mirrokni, Vahab
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [38] THE HIERARCHICAL ORDERING IN MULTIATTRIBUTE FILES
    CHANG, CC
    DU, MW
    LEE, RCT
    INFORMATION SCIENCES, 1983, 31 (01) : 41 - 75
  • [39] Hierarchical Ordering of Reticular Networks
    Mileyko, Yuriy
    Edelsbrunner, Herbert
    Price, Charles A.
    Weitz, Joshua S.
    PLOS ONE, 2012, 7 (06):
  • [40] Object ordering in hierarchical systems
    Makeev, S.P.
    Shakhnov, I.F.
    Izvestiya Akademii Nauk: Tekhnicheskaia Kibernetika, 1991, (03): : 29 - 46