The local maximum clustering method and its application in microarray gene expression data analysis

被引:18
|
作者
Wu, XW [1 ]
Chen, YD
Brooks, BR
Su, YA
机构
[1] Natl Heart Lung & Blood Inst, Lab Biophys Chem, NIH, Bethesda, MD 20892 USA
[2] Natl Human Genome Res Inst, NIH, Bethesda, MD 20892 USA
[3] Loyola Univ, Ctr Med, Dept Pathol, Maywood, IL 60153 USA
关键词
data cluster; clustering method; microarray; gene expression; classification; model data sets;
D O I
10.1155/S1110865704309145
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An unsupervised data clustering method, called the local maximum clustering (LMC) method, is proposed for identifying clusters in experiment data sets based on research interest. A magnitude property is defined according to research purposes, and data sets are clustered around each local maximum of the magnitude property. By properly defining a magnitude property, this method can overcome many difficulties in microarray data clustering such as reduced projection in similarities, noises, and arbitrary gene distribution. To critically evaluate the performance of this clustering method in comparison with other methods, we designed three model data sets with known cluster distributions and applied the LMC method as well as the hierarchic clustering method, the K-mean clustering method, and the self-organized map method to these model data sets. The results show that the LMC method produces the most accurate clustering results. As an example of application, we applied the method to cluster the leukemia samples reported in the microarray study of Golub et al. (1999).
引用
收藏
页码:53 / 63
页数:11
相关论文
共 50 条
  • [41] PCGEN: A practical approach to projected clustering and its application to gene expression data
    Bouguessa, Mohamed
    Wang, Shengrui
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 661 - 667
  • [42] Interrelated two-way clustering and its application on gene expression data
    Tang, C
    Zhang, AD
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2005, 14 (04) : 577 - 597
  • [43] Incorporating biological knowledge into distance-based clustering analysis of microarray gene expression data
    Huang, DS
    Pan, W
    BIOINFORMATICS, 2006, 22 (10) : 1259 - 1268
  • [44] Clustering microarray gene expression data using enhanced harmony search
    Pandi, M.
    Premalatha, K.
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2015, 7 (05) : 296 - 306
  • [45] Comparisons and validation of statistical clustering techniques for microarray gene expression data
    Datta, S
    Datta, S
    BIOINFORMATICS, 2003, 19 (04) : 459 - 466
  • [46] Local Modeling Classifier for Microarray Gene-Expression Data
    Porto-Diaz, Iago
    Bolon-Canedo, Veronica
    Alonso-Betanzos, Amparo
    Fontenla-Romero, Oscar
    ARTIFICIAL NEURAL NETWORKS (ICANN 2010), PT III, 2010, 6354 : 11 - 20
  • [47] Beyond Gene Clustering of Microarray Data
    Zheng, W. Jim
    Mao, Linyong
    Moussa, Omar
    Yordy, John S.
    Wang, Suiquan
    Kraft, Andrew S.
    Watson, Dennis K.
    IN VITRO CELLULAR & DEVELOPMENTAL BIOLOGY-ANIMAL, 2009, 45 : S3 - S3
  • [48] Gene Ontology Assisted Exploratory Microarray Clustering and Its Application to Cancer
    Macintyre, Geoff
    Bailey, James
    Gustafsson, Daniel
    Boussioutas, Alex
    Haviv, Izhak
    Kowalczyk, Adam
    PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2008, 5265 : 400 - +
  • [49] A moment-based method for estimating the proportion of true null hypotheses and its application to microarray gene expression data
    Lai, Yinglei
    BIOSTATISTICS, 2007, 8 (04) : 744 - 755
  • [50] Consensus clustering: A resampling-based method for class discovery and visualization of gene expression microarray data
    Monti, S
    Tamayo, P
    Mesirov, J
    Golub, T
    MACHINE LEARNING, 2003, 52 (1-2) : 91 - 118