Identifying cluster number for subspace projected functional data clustering

被引:23
|
作者
Li, Pai-Ling [1 ]
Chiou, Jeng-Min [2 ]
机构
[1] Tamkang Univ, New Taipei City 25137, Taiwan
[2] Acad Sinica, Taipei 11529, Taiwan
关键词
Bootstrapping; Cluster analysis; Functional data analysis; Functional principal components; Gene expression profiles; Hypothesis test; GENE-EXPRESSION; DATA SET; CLASSIFICATION; VALIDATION; ALGORITHM; MODEL;
D O I
10.1016/j.csda.2011.01.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We propose a new approach, the forward functional testing (FFT) procedure, to cluster number selection for functional data clustering. We present a framework of subspace projected functional data clustering based on the functional multiplicative random-effects model, and propose to perform functional hypothesis tests on equivalence of cluster structures to identify the number of clusters. The aim is to find the maximum number of distinctive clusters while retaining significant differences between cluster structures. The null hypotheses comprise equalities between the cluster mean functions and between the sets of cluster eigenfunctions of the covariance kernels. Bootstrap resampling methods are developed to construct reference distributions of the derived test statistics. We compare several other cluster number selection criteria, extended from methods of multivariate data, with the proposed FFT procedure. The performance of the proposed approaches is examined by simulation studies, with applications to clustering gene expression profiles. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:2090 / 2103
页数:14
相关论文
共 50 条
  • [1] Iterative projected clustering by subspace mining
    Yiu, ML
    Mamoulis, N
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (02) : 176 - 189
  • [2] Cluster Validation for Subspace Clustering on High Dimensional Data
    Chen, Lifei
    Jiang, Qingshan
    Wang, Shengrui
    2008 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2008), VOLS 1-4, 2008, : 225 - +
  • [3] Simultaneous Subspace Clustering and Cluster Number Estimating Based on Triplet Relationship
    Liang, Jie
    Yang, Jufeng
    Cheng, Ming-Ming
    Rosin, Paul L.
    Wang, Liang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (08) : 3973 - 3985
  • [4] Subspace and projected clustering: experimental evaluation and analysis
    Gabriela Moise
    Arthur Zimek
    Peer Kröger
    Hans-Peter Kriegel
    Jörg Sander
    Knowledge and Information Systems, 2009, 21 : 299 - 326
  • [5] Subspace and projected clustering: experimental evaluation and analysis
    Moise, Gabriela
    Zimek, Arthur
    Kroeger, Peer
    Kriegel, Hans-Peter
    Sander, Joerg
    KNOWLEDGE AND INFORMATION SYSTEMS, 2009, 21 (03) : 299 - 326
  • [6] Clustering of functional data in a low-dimensional subspace
    Michio Yamamoto
    Advances in Data Analysis and Classification, 2012, 6 : 219 - 247
  • [7] Clustering of functional data in a low-dimensional subspace
    Yamamoto, Michio
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2012, 6 (03) : 219 - 247
  • [8] Determination of cluster number in clustering microarray data
    Shen, JD
    Chang, SI
    Lee, ES
    Deng, YP
    Brown, SJ
    APPLIED MATHEMATICS AND COMPUTATION, 2005, 169 (02) : 1172 - 1185
  • [9] Subspace Clustering of Categorical and Numerical Data With an Unknown Number of Clusters
    Jia, Hong
    Cheung, Yiu-Ming
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (08) : 3308 - 3325
  • [10] Hierarchical Clustering of Projected Data Streams Using Cluster Validity Index
    Pardeshi, Bharat
    Toshniwal, Durga
    ADVANCES IN COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, PT I, 2011, 131 : 551 - 559