Finding Correlated Biclusters from Gene Expression Data

被引:33
|
作者
Yang, Wen-Hui [1 ,2 ]
Dai, Dao-Qing [1 ,2 ]
Yan, Hong [3 ,4 ]
机构
[1] Sun Yat Sen Zhongshan Univ, Ctr Comp Vis, Guangzhou 510275, Guangdong, Peoples R China
[2] Sun Yat Sen Zhongshan Univ, Dept Math, Fac Math & Comp, Guangzhou 510275, Guangdong, Peoples R China
[3] City Univ Hong Kong, Dept Elect Engn, Kowloon, Hong Kong, Peoples R China
[4] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2006, Australia
关键词
Biclustering; pattern classification; gene expression data; singular-value decomposition; data mining; biology computing; SINGULAR-VALUE DECOMPOSITION; MICROARRAY DATA; DISCRIMINANT-ANALYSIS; CLUSTER-ANALYSIS; PATTERNS; MODELS;
D O I
10.1109/TKDE.2010.150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting biologically relevant information from DNA microarrays is a very important task for drug development and test, function annotation, and cancer diagnosis. Various clustering methods have been proposed for the analysis of gene expression data, but when analyzing the large and heterogeneous collections of gene expression data, conventional clustering algorithms often cannot produce a satisfactory solution. Biclustering algorithm has been presented as an alternative approach to standard clustering techniques to identify local structures from gene expression data set. These patterns may provide clues about the main biological processes associated with different physiological states. In this paper, different from existing bicluster patterns, we first introduce a more general pattern: correlated bicluster, which has intuitive biological interpretation. Then, we propose a novel transform technique based on singular value decomposition so that identifying correlated-bicluster problem from gene expression matrix is transformed into two global clustering problems. The Mixed-Clustering algorithm and the Lift algorithm are devised to efficiently produce delta-corBiclusters. The biclusters obtained using our method from gene expression data sets of multiple human organs and the yeast Saccharomyces cerevisiae demonstrate clear biological meanings.
引用
收藏
页码:568 / 584
页数:17
相关论文
共 50 条
  • [21] Quality Measures for Gene Expression Biclusters
    Pontes, Beatriz
    Girldez, Ral
    Aguilar-Ruiz, Jess S.
    PLOS ONE, 2015, 10 (03):
  • [22] Discovering Non-Redundant Overlapping Biclusters on Gene Expression Data
    Duy Tin Truong
    Battiti, Roberto
    Brunato, Mauro
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 747 - 756
  • [23] Evolving Coherent and Non-trivial Biclusters from Gene Expression Data: An Evolutionary Approach
    Mukhopadhyay, Anirban
    Maulik, Ujjwal
    Bandyopadhyay, Sanghamitra
    2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4, 2008, : 2471 - +
  • [24] An Evolutionary Algorithm for Discovering Biclusters in Gene Expression Data of Breast Cancer
    Huang, Qinghua
    Lu, Minhua
    Yan, Hong
    2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 829 - +
  • [25] Mining Negative Correlation Biclusters from Gene Expression Data using Generic Association Rules
    Houari, Amina
    Ayadi, Wassim
    Ben Yahia, Sadok
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 : 278 - 287
  • [26] Noise-robust algorithm for identifying functionally associated biclusters from gene expression data
    Ahn, Jaegyoon
    Yoon, Youngmi
    Park, Sanghyun
    INFORMATION SCIENCES, 2011, 181 (03) : 435 - 449
  • [27] A new FCA-based method for identifying biclusters in gene expression data
    Houari, Amina
    Ayadi, Wassim
    Ben Yahia, Sadok
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (11) : 1879 - 1893
  • [28] Discovering Biclusters by Iteratively Sorting with Weighted Correlation Coefficient in Gene Expression Data
    Li Teng
    Laiwan Chan
    Journal of Signal Processing Systems, 2008, 50 : 267 - 280
  • [29] Discovering biclusters by iteratively sorting with weighted correlation coefficient in gene expression data
    Teng, Li
    Chan, Laiwan
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2008, 50 (03): : 267 - 280
  • [30] Multi-objective Optimization Approach to find Biclusters in Gene Expression Data
    Dale, Jeffrey
    Zhao, Junya
    Obafemi-Ajayi, Tayo
    2019 16TH IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY - CIBCB 2019, 2019, : 77 - 84