Judging the quality of gene expression-based clustering methods using gene annotation

被引:220
|
作者
Gibbons, FD [1 ]
Roth, FP [1 ]
机构
[1] Harvard Univ, Sch Med, Dept Biol Chem & Mol Pharmacol, Boston, MA 02115 USA
关键词
D O I
10.1101/gr.397002
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We compare several commonly used expression-based gene clustering algorithms using a figure of merit based on the mutual information between cluster membership and known gene attributes. By studying various publicly available expression data sets we conclude that enrichment of clusters for biological function is, in general, highest at rather low cluster numbers. As a measure of dissimilarity between the expression patterns of two genes, no method outperforms Euclidean distance for ratio-based measurements, or Pearson distance for non-ratio-based measurements at the optimal choice of cluster number. We show the self-organized-map approach to be best for both measurement types at higher numbers of clusters. Clusters of genes derived from single- and average-linkage hierarchical clustering tend to produce worse-than-random results.
引用
收藏
页码:1574 / 1581
页数:8
相关论文
共 50 条
  • [1] Assisted gene expression-based clustering with AWNCut
    Li, Yang
    Bie, Ruofan
    Hidalgo, Sebastian J. Teran
    Qin, Yichen
    Wu, Mengyun
    Ma, Shuangge
    STATISTICS IN MEDICINE, 2018, 37 (29) : 4386 - 4403
  • [2] Multi-View Gene Clustering using Gene Ontology and Expression-based Similarities
    Giri, Swagarika Jaharlal
    Saha, Sriparna
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [3] Validation and functional annotation of expression-based clusters based on gene ontology
    Steuer, Ralf
    Humburg, Peter
    Selbig, Joachim
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [4] Validation and functional annotation of expression-based clusters based on gene ontology
    Ralf Steuer
    Peter Humburg
    Joachim Selbig
    BMC Bioinformatics, 7
  • [5] eMBI: Boosting Gene Expression-based Clustering for Cancer Subtypes
    Chang, Zheng
    Wang, Zhenjia
    Ashby, Cody
    Zhou, Chuan
    Li, Guojun
    Zhang, Shuzhong
    Huang, Xiuzhen
    CANCER INFORMATICS, 2014, 13 : 105 - 112
  • [6] Computational methods for gene expression-based tumor classification
    Xiong, MM
    Jin, L
    Li, WJ
    Boerwinkle, E
    BIOTECHNIQUES, 2000, 29 (06) : 1264 - +
  • [7] Gene expression-based approaches to beef quality research
    Lehnert, SA
    Wang, YH
    Tan, SH
    Reverter, A
    AUSTRALIAN JOURNAL OF EXPERIMENTAL AGRICULTURE, 2006, 46 (02): : 165 - 172
  • [8] Feature (gene) selection in gene expression-based tumor classification
    Xiong, MM
    Li, WJ
    Zhao, JY
    Jin, L
    Boerwinkle, E
    MOLECULAR GENETICS AND METABOLISM, 2001, 73 (03) : 239 - 247
  • [9] Accurate Gene Expression-Based Biodosimetry Using a Minimal Set of Human Gene Transcripts
    Tucker, James D.
    Joiner, Michael C.
    Thomas, Robert A.
    Grever, William E.
    Bakhmutsky, Marina V.
    Chinkhota, Chantelle N.
    Smolinski, Joseph M.
    Divine, George W.
    Auner, Gregory W.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2014, 88 (04): : 933 - 939
  • [10] Gene Expression Data clustering using Unsupervised Methods
    Chandrasekhar, T.
    Thangavel, K.
    Elayaraja, E.
    2011 THIRD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2011, : 146 - 150