Complementary hierarchical clustering

被引:20
|
作者
Nowak, Gen [1 ]
Tibshirani, Robert [1 ,2 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Hlth Res & Policy, Stanford, CA 94305 USA
关键词
hierarchical clustering; microarray; principal components; relative gene importance;
D O I
10.1093/biostatistics/kxm046
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
When applying hierarchical clustering algorithms to cluster patient samples from microarray data, the clustering patterns generated by most algorithms tend to be dominated by groups of highly differentially expressed genes that have closely related expression patterns. Sometimes, these genes may not be relevant to the biological process under study or their functions may already be known. The problem is that these genes can potentially drown out the effects of other genes that are relevant or have novel functions. We propose a procedure called complementary hierarchical clustering that is designed to uncover the structures arising from these novel genes that are not as highly expressed. Simulation studies show that the procedure is effective when applied to a variety of examples. We also define a concept called relative gene importance that can be used to identify the influential genes in a given clustering. Finally, we analyze a microarray data set from 295 breast cancer patients, using clustering with the correlation-based distance measure. The complementary clustering reveals a grouping of the patients which is uncorrelated with a number of known prognostic signatures and significantly differing distant metastasis-free probabilities.
引用
收藏
页码:467 / 483
页数:17
相关论文
共 50 条
  • [1] Robust complementary hierarchical clustering for gene expression data analysis by β-divergence
    Badsha, Md. Bahadur
    Mollah, Md. Nurul Hague
    Jahan, Nusrat
    Kurata, Hiroyuki
    JOURNAL OF BIOSCIENCE AND BIOENGINEERING, 2013, 116 (03) : 397 - 407
  • [2] Hierarchical Clustering via Sketches and Hierarchical Correlation Clustering
    Vainstein, Danny
    Chatziafratis, Vaggos
    Citovsky, Gui
    Rajagopalan, Anand
    Mahdian, Mohammad
    Azar, Yossi
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 559 - +
  • [3] Incremental Clustering for Hierarchical Clustering
    Narita, Kakeru
    Hochin, Teruhisa
    Nomiya, Hiroki
    2018 5TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE/ INTELLIGENCE AND APPLIED INFORMATICS (CSII 2018), 2018, : 102 - 107
  • [4] A new hierarchical multiple criteria ordered clustering approach as a complementary tool for sorting and ranking problems
    Diaz, Raymundo
    Fernandez, Eduardo
    Figueira, Jose-Rui
    Navarro, Jorge
    Solares, Efrain
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2023, 117
  • [5] Affinity Clustering: Hierarchical Clustering at Scale
    Bateni, MohammadHossein
    Behnezhad, Soheil
    Derakhshan, Mahsa
    Hajiaghayi, MohammadTaghi
    Kiveris, Raimondas
    Lattanzi, Silvio
    Mirrokni, Vahab
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [6] Comparisons of clustering SSCI journals by emerging hierarchical clustering, hierarchical clustering and minimum spanning tree
    Chang, Yunfeng
    Zhao, Yuan
    Feng, Shengqin
    Proceedings - 2012 9th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2012, 2012, : 2898 - 2901
  • [7] Hierarchical spherical clustering
    Torra, V
    Miyamoto, S
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2002, 10 (02) : 157 - 172
  • [8] On validation of hierarchical clustering
    Mucha, Hans-Joachim
    ADVANCES IN DATA ANALYSIS, 2007, : 115 - 122
  • [9] TANGLES AND HIERARCHICAL CLUSTERING
    Fluck, Eva
    SIAM JOURNAL ON DISCRETE MATHEMATICS, 2024, 38 (01) : 75 - 92
  • [10] Robust Hierarchical Clustering
    Balcan, Maria-Florina
    Liang, Yingyu
    Gupta, Pramod
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 3831 - 3871