Weighted rank aggregation of cluster validation measures: a Monte Carlo cross-entropy approach

被引:201
|
作者
Pihur, Vasyl [1 ]
Datta, Susmita [1 ]
Datta, Somnath [1 ]
机构
[1] Univ Louisville, Dept Bioinformat & Biostat, Louisville, KY 40202 USA
关键词
D O I
10.1093/bioinformatics/btm158
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Biologists often employ clustering techniques in the explorative phase of microarray data analysis to discover relevant biological groupings. Given the availability of numerous clustering algorithms in the machine-learning literature, an user might want to select one that performs the best for his/her data set or application. While various validation measures have been proposed over the years to judge the quality of clusters produced by a given clustering algorithm including their biological relevance, unfortunately, a given clustering algorithm can perform poorly under one validation measure while outperforming many other algorithms under another validation measure. A manual synthesis of results from multiple validation measures is nearly impossible in practice, especially, when a large number of clustering algorithms are to be compared using several measures. An automated and objective way of reconciling the rankings is needed. Results: Using a Monte Carlo cross-entropy algorithm, we successfully combine the ranks of a set of clustering algorithms under consideration via a weighted aggregation that optimizes a distance criterion. The proposed weighted rank aggregation allows for a far more objective and automated assessment of clustering results than a simple visual inspection. We illustrate our procedure using one simulated as well as three real gene expression data sets from various platforms where we rank a total of eleven clustering algorithms using a combined examination of 10 different validation measures. The aggregate rankings were found for a given number of clusters k and also for an entire range of k.
引用
收藏
页码:1607 / 1615
页数:9
相关论文
共 50 条
  • [41] Probabilistic linguistic decision-making based on the hybrid entropy and cross-entropy measures
    Bing Fang
    Fuzzy Optimization and Decision Making, 2023, 22 : 415 - 445
  • [42] MINIMUM CROSS-ENTROPY PATTERN-CLASSIFICATION AND CLUSTER-ANALYSIS
    SHORE, JE
    GRAY, RM
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1982, 4 (01) : 11 - 17
  • [43] Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition
    Pineiro-Martin, Andres
    Garcia-Mateo, Carmen
    Docio-Fernandez, Laura
    Del Carmen Lopez-Perez, Maria
    Rehm, Georg
    INTERSPEECH 2024, 2024, : 1235 - 1239
  • [44] RE-WEIGHTED SOFTMAX CROSS-ENTROPY TO CONTROL FORGETTING IN FEDERATED LEARNING
    Legate, Gwen
    Caccia, Lucas
    Belilovsky, Eugene
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 764 - 780
  • [45] Class Distance Weighted Cross-Entropy Loss for Ulcerative Colitis Severity Estimation
    Polat, Gorkem
    Ergenc, Ilkay
    Kani, Haluk Tarik
    Alahdab, Yesim Ozen
    Atug, Ozlen
    Temizel, Alptekin
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2022, 2022, 13413 : 157 - 171
  • [46] Misclassification-guided loss under the weighted cross-entropy loss framework
    Wu, Yan-Xue
    Du, Kai
    Wang, Xian-Jie
    Min, Fan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (08) : 4685 - 4720
  • [47] Novel hesitant fuzzy linguistic entropy and cross-entropy measures in multiple criteria decision making
    B. Farhadinia
    Zeshui Xu
    Applied Intelligence, 2018, 48 : 3915 - 3927
  • [48] Recognition of imbalanced underwater acoustic datasets with exponentially weighted cross-entropy loss
    Dong, Yafen
    Shen, Xiaohong
    Jiang, Zhe
    Wang, Haiyan
    APPLIED ACOUSTICS, 2021, 174
  • [49] Novel hesitant fuzzy linguistic entropy and cross-entropy measures in multiple criteria decision making
    Farhadinia, B.
    Xu, Zeshui
    APPLIED INTELLIGENCE, 2018, 48 (11) : 3915 - 3927
  • [50] A minimum cross-entropy approach to hidden Markov model adaptation
    Afify, M
    Gong, YF
    Haton, JP
    IEEE SIGNAL PROCESSING LETTERS, 1999, 6 (06) : 132 - 134