Complementary hierarchical clustering

被引:20
|
作者
Nowak, Gen [1 ]
Tibshirani, Robert [1 ,2 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Hlth Res & Policy, Stanford, CA 94305 USA
关键词
hierarchical clustering; microarray; principal components; relative gene importance;
D O I
10.1093/biostatistics/kxm046
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
When applying hierarchical clustering algorithms to cluster patient samples from microarray data, the clustering patterns generated by most algorithms tend to be dominated by groups of highly differentially expressed genes that have closely related expression patterns. Sometimes, these genes may not be relevant to the biological process under study or their functions may already be known. The problem is that these genes can potentially drown out the effects of other genes that are relevant or have novel functions. We propose a procedure called complementary hierarchical clustering that is designed to uncover the structures arising from these novel genes that are not as highly expressed. Simulation studies show that the procedure is effective when applied to a variety of examples. We also define a concept called relative gene importance that can be used to identify the influential genes in a given clustering. Finally, we analyze a microarray data set from 295 breast cancer patients, using clustering with the correlation-based distance measure. The complementary clustering reveals a grouping of the patients which is uncorrelated with a number of known prognostic signatures and significantly differing distant metastasis-free probabilities.
引用
收藏
页码:467 / 483
页数:17
相关论文
共 50 条
  • [41] STABILITY OF A HIERARCHICAL-CLUSTERING
    SMITH, SP
    DUBES, R
    PATTERN RECOGNITION, 1980, 12 (03) : 177 - 187
  • [42] Hierarchical kernel spectral clustering
    Alzate, Carlos
    Suykens, Johan A. K.
    NEURAL NETWORKS, 2012, 35 : 21 - 30
  • [43] Sequential Hierarchical Pattern Clustering
    Farran, Bassam
    Ramanan, Amirthalingam
    Niranjan, Mahesan
    PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2009, 5780 : 79 - 88
  • [44] Hierarchical clustering on metric lattice
    Meng X.
    Liu M.
    Wu J.
    Zhou H.
    Xu F.
    Wu Q.
    International Journal of Intelligent Information and Database Systems, 2020, 13 (01) : 1 - 16
  • [45] Approximating Hierarchical MV-sets for Hierarchical Clustering
    Glazer, Assaf
    Weissbrod, Omer
    Lindenbaum, Michael
    Markovitch, Shaul
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [46] Hierarchical Clustering for Euclidean Data
    Charikar, Moses
    Chatziafratis, Vaggos
    Niazadeh, Rad
    Yaroslavtsev, Grigory
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [47] An Axiomatic Definition of Hierarchical Clustering
    Arias-Castro, Ery
    Coda, Elizabeth
    JOURNAL OF MACHINE LEARNING RESEARCH, 2025, 26
  • [48] Hierarchical clustering for image databases
    Bhatia, Sanjiv K.
    2005 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT 2005), 2005, : 177 - 182
  • [49] A Hierarchical Algorithm for Extreme Clustering
    Kobren, Ari
    Monath, Nicholas
    Krishnamurthy, Akshay
    McCallum, Andrew
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 255 - 264
  • [50] VOICE CLASSIFICATION BY HIERARCHICAL CLUSTERING
    SILBIGER, HR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1966, 40 (05): : 1282 - &