Weighted rank aggregation of cluster validation measures: a Monte Carlo cross-entropy approach

被引:201
|
作者
Pihur, Vasyl [1 ]
Datta, Susmita [1 ]
Datta, Somnath [1 ]
机构
[1] Univ Louisville, Dept Bioinformat & Biostat, Louisville, KY 40202 USA
关键词
D O I
10.1093/bioinformatics/btm158
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Biologists often employ clustering techniques in the explorative phase of microarray data analysis to discover relevant biological groupings. Given the availability of numerous clustering algorithms in the machine-learning literature, an user might want to select one that performs the best for his/her data set or application. While various validation measures have been proposed over the years to judge the quality of clusters produced by a given clustering algorithm including their biological relevance, unfortunately, a given clustering algorithm can perform poorly under one validation measure while outperforming many other algorithms under another validation measure. A manual synthesis of results from multiple validation measures is nearly impossible in practice, especially, when a large number of clustering algorithms are to be compared using several measures. An automated and objective way of reconciling the rankings is needed. Results: Using a Monte Carlo cross-entropy algorithm, we successfully combine the ranks of a set of clustering algorithms under consideration via a weighted aggregation that optimizes a distance criterion. The proposed weighted rank aggregation allows for a far more objective and automated assessment of clustering results than a simple visual inspection. We illustrate our procedure using one simulated as well as three real gene expression data sets from various platforms where we rank a total of eleven clustering algorithms using a combined examination of 10 different validation measures. The aggregate rankings were found for a given number of clusters k and also for an entire range of k.
引用
收藏
页码:1607 / 1615
页数:9
相关论文
共 50 条
  • [21] Detecting influential observations by cluster analysis and Monte Carlo cross-validation
    Bian, Xihui
    Cai, Wensheng
    Shao, Xueguang
    Chen, Da
    Grant, Edward R.
    ANALYST, 2010, 135 (11) : 2841 - 2847
  • [22] Cross-Entropy Approach for Computing a Pareto Fronts
    Sebaa, Karim
    UKSIM-AMSS 15TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM 2013), 2013, : 61 - 66
  • [23] Multiobjective Optimization Using Cross-Entropy Approach
    Sebaa, Karim
    Tlemcani, Abdelhalim
    Bouhedda, Mounir
    Henini, Noureddine
    JOURNAL OF OPTIMIZATION, 2013, 2013
  • [24] A cross-entropy algorithm based on Quasi-Monte Carlo estimation and its application in hull form optimization
    Liu, Xin
    Zhang, Heng
    Liu, Qiang
    Dong, Suzhen
    Xiao, Changshi
    INTERNATIONAL JOURNAL OF NAVAL ARCHITECTURE AND OCEAN ENGINEERING, 2021, 13 : 115 - 125
  • [25] Cloud Service Reliability Assessment Approach based on Multi-valued Neutrosophic Cross-entropy and Entropy Measures
    Wang, Yi
    Wang, Xiao-kang
    Wang, Jian-qiang
    FILOMAT, 2018, 32 (08) : 2793 - 2812
  • [26] Novel Pythagorean fuzzy entropy and Pythagorean fuzzy cross-entropy measures and their applications
    Li, Longmei
    Zheng, Tingting
    Yin, Wenjing
    Wu, Qiuyue
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 6527 - 6546
  • [27] Novel Pythagorean fuzzy entropy and Pythagorean fuzzy cross-entropy measures and their applications
    Li, Longmei
    Zheng, Tingting
    Yin, Wenjing
    Wu, Qiuyue
    Journal of Intelligent and Fuzzy Systems, 2021, 41 (06): : 6527 - 6546
  • [28] Using the Cross-Entropy Method to Re-Rank Search Results
    Roitman, Haggai
    Hummel, Shay
    Kurland, Oren
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 839 - 842
  • [29] Ascertainment of the number of samples in the validation set in Monte Carlo cross validation and the selection of model dimension with Monte Carlo cross validation
    Du, Yi Ping
    Kasemsumran, Surnaporn
    Maruo, Katsuhiko
    Nakagawa, Takehiro
    Ozaki, Yukihiro
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2006, 82 (1-2) : 83 - 89
  • [30] Monte-Carlo approximation of minimum entropy measures
    Jourdain, B
    Nguyen, L
    COMPTES RENDUS DE L ACADEMIE DES SCIENCES SERIE I-MATHEMATIQUE, 2001, 332 (04): : 345 - 350