Stability-based model order selection in clustering with applications to gene expression data

被引:0
|
作者
Roth, V [1 ]
Braun, ML [1 ]
Lange, T [1 ]
Buhmann, JM [1 ]
机构
[1] Univ Bonn, Inst Comp Sci, Dept 3, D-53117 Bonn, Germany
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The concept of cluster stability is introduced to assess the validity of data partitionings found by clustering algorithms. It allows us to explicitly quantify the quality of a clustering solution, without being dependent on external information. The principle of maximizing the cluster stability can be interpreted as choosing the most self-consistent data partitioning. We present an empirical estimator for the theoretically derived stability index, based on resampling. Experiments are conducted on well known gene expression data sets, re-analyzing the work by Alon et al. [1] and by Spellman et al. [8].
引用
收藏
页码:607 / 612
页数:6
相关论文
共 50 条
  • [31] Model order selection for bio-molecular data clustering
    Alberto Bertoni
    Giorgio Valentini
    BMC Bioinformatics, 8
  • [32] Model order selection for bio-molecular data clustering
    Bertoni, Alberto
    Valentini, Giorgio
    BMC BIOINFORMATICS, 2007, 8 (Suppl 2)
  • [33] Attribute clustering for grouping, selection, and classification of gene expression data
    Au, WH
    Chan, KCC
    Wong, AKC
    Wang, Y
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2005, 2 (02) : 83 - 101
  • [34] MODEL-BASED CLUSTERING WITH DATA CORRECTION FOR REMOVING ARTIFACTS IN GENE EXPRESSION DATA
    Young, William Chad
    Raftery, Adrian E.
    Yeung, Ka Yee
    ANNALS OF APPLIED STATISTICS, 2017, 11 (04): : 1998 - 2026
  • [35] Neighbor stability-based VANET clustering for urban vehicular environments
    Jung-Hyok Kwon
    Hyun Soo Chang
    Taeshik Shon
    Jai-Jin Jung
    Eui-Jik Kim
    The Journal of Supercomputing, 2016, 72 : 161 - 176
  • [36] A model for gene selection and classification of gene expression data
    Mohamad M.S.
    Omatu S.
    Deris S.
    Hashim S.Z.M.
    Artificial Life and Robotics, 2007, 11 (2) : 219 - 222
  • [37] Projection Based Clustering of Gene Expression Data
    Tasoulis, Sotiris K.
    Plagianakos, Vassilis P.
    Tasoulis, Dimitris K.
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, 2010, 6160 : 228 - +
  • [38] An Ensemble Filtering and Supervised Clustering based Informative Gene Selection Algorithm in Microarray Gene Expression Data
    Bose, Shilpi
    Das, Chandra
    Banerjee, Abhik
    Chattopadhyay, Matangini
    Chattopadhyay, Samiran
    2020 4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NETWORKS (CINE 2020), 2020,
  • [39] Learning mixed graphical models with separate sparsity parameters and stability-based model selection
    Sedgewick, Andrew J.
    Shi, Ivy
    Donovan, Rory M.
    Benos, Panayiotis V.
    BMC BIOINFORMATICS, 2016, 17
  • [40] A Bayesian hierarchical hidden Markov model for clustering and gene selection: Application to kidney cancer gene expression data
    Chekouo, Thierry
    Mukherjee, Himadri
    BIOMETRICAL JOURNAL, 2024, 66 (04)