Stability-based model order selection in clustering with applications to gene expression data

被引:0
|
作者
Roth, V [1 ]
Braun, ML [1 ]
Lange, T [1 ]
Buhmann, JM [1 ]
机构
[1] Univ Bonn, Inst Comp Sci, Dept 3, D-53117 Bonn, Germany
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The concept of cluster stability is introduced to assess the validity of data partitionings found by clustering algorithms. It allows us to explicitly quantify the quality of a clustering solution, without being dependent on external information. The principle of maximizing the cluster stability can be interpreted as choosing the most self-consistent data partitioning. We present an empirical estimator for the theoretically derived stability index, based on resampling. Experiments are conducted on well known gene expression data sets, re-analyzing the work by Alon et al. [1] and by Spellman et al. [8].
引用
收藏
页码:607 / 612
页数:6
相关论文
共 50 条
  • [1] Hierarchical Stability-Based Model Selection For Clustering Algorithms
    Yin, Bing
    Hamerly, Greg
    EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 217 - 222
  • [2] Clustering stability-based feature selection for unsupervised texture classification
    Klepaczko, Artur
    Materka, Andrzej
    Machine Graphics and Vision, 2009, 18 (02): : 125 - 141
  • [3] A model selection criterion for model-based clustering of annotated gene expression data
    Gallopin, Melina
    Celeux, Gilles
    Jaffrezic, Florence
    Rau, Andrea
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2015, 14 (05) : 413 - 428
  • [4] Algorithmic paradigms for stability-based cluster validity and model selection statistical methods, with applications to microarray data analysis
    Giancarlo, R.
    Utro, F.
    THEORETICAL COMPUTER SCIENCE, 2012, 428 : 58 - 79
  • [5] Stability-based biomarker selection
    Wehrens, Ron
    Franceschi, Pietro
    Vrhovsek, Urska
    Mattivi, Fulvio
    ANALYTICA CHIMICA ACTA, 2011, 705 (1-2) : 15 - 23
  • [6] Stability-based validation of clustering solutions
    Lange, T
    Roth, V
    Braun, ML
    Buhmann, JM
    NEURAL COMPUTATION, 2004, 16 (06) : 1299 - 1323
  • [7] Gene Selection for Cancer Clustering Analysis Based on Expression Data
    Xu, Taosheng
    Su, Ning
    Wang, Rujing
    Song, Liangtu
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 516 - 519
  • [8] PSO Based Feature Selection for Clustering Gene Expression Data
    Deepthi, P. S.
    Thampi, Sabu M.
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [9] A kernel-based clustering method for gene selection with gene expression data
    Chen, Huihui
    Zhang, Yusen
    Gutman, Ivan
    JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 62 : 12 - 20
  • [10] Model-based clustering and data transformations for gene expression data
    Yeung, KY
    Fraley, C
    Murua, A
    Raftery, AE
    Ruzzo, WL
    BIOINFORMATICS, 2001, 17 (10) : 977 - 987