Gene Expression Data Analysis Using a Novel Approach to Biclustering Combining Discrete and Continuous Data

被引:7
|
作者
Christinat, Yann [1 ]
Wachmann, Bernd [2 ]
Zhang, Lei [2 ]
机构
[1] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, Lab Computat Biol & Bioinformat, CH-1015 Lausanne, Switzerland
[2] Siemens Corp Res, Princeton, NJ 08540 USA
关键词
Data mining; biclustering algorithm; gene expression data; discrete data; simultaneous clustering; microarray analysis;
D O I
10.1109/TCBB.2007.70251
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Many different methods exist for pattern detection in gene expression data. In contrast to classical methods, biclustering has the ability to cluster a group of genes together with a group of conditions (replicates, set of patients, or drug compounds). However, since the problem is NP-complex, most algorithms use heuristic search functions and, therefore, might converge toward local maxima. By using the results of biclustering on discrete data as a starting point for a local search function on continuous data, our algorithm avoids the problem of heuristic initialization. Similar to Order-Preserving Submatrices (OPSM), our algorithm aims to detect biclusters whose rows and columns can be ordered such that row values are growing across the bicluster's columns and vice versa. Results have been generated on the yeast genome (Saccharomyces cerevisiae), a human cancer data set, and random data. Results on the yeast genome showed that 89 percent of the 100 biggest nonoverlapping biclusters were enriched with Gene Ontology annotations. A comparison with the methods OPSM and Iterative Signature Algorithm (ISA, a generalization of singular value decomposition) demonstrated a better efficiency when using gene and condition orders. We present results on random and real data sets that show the ability of our algorithm to capture statistically significant and biologically relevant biclusters.
引用
收藏
页码:583 / 593
页数:11
相关论文
共 50 条
  • [21] DNA microarray data analysis: A novel biclustering algorithm approach
    Tchagang, Alain B.
    Tewfik, Ahmed H.
    Eurasip Journal on Applied Signal Processing, 2006, 2006
  • [22] BICLUSTERING ANALYSIS OF GENE EXPRESSION DATA USING MULTI-OBJECTIVE EVOLUTIONARY ALGORITHMS
    Golchin, Maryam
    Davarpanah, Seyed Hashem
    Liew, Alan Wee-Chung
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOL. 2, 2015, : 505 - 510
  • [23] DNA microarray data analysis: A novel biclustering algorithm approach
    Tchagang, Alain B.
    Tewfik, Ahmed H.
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
  • [24] Biclustering of Linear Patterns In Gene Expression Data
    Gao, Qinghui
    Ho, Christine
    Jia, Yingmin
    Li, Jingyi Jessica
    Huang, Haiyan
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (06) : 619 - 631
  • [25] Evolutionary Biclustering Algorithm of Gene Expression Data
    Ayadi, Wassim
    Maatouk, Ons
    Bouziri, Hend
    2012 23RD INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2012, : 206 - 210
  • [26] Rough overlapping biclustering of gene expression data
    Wang, Ruizhi
    Miao, Duoqian
    Li, Gang
    Zhang, Hongyun
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 828 - 834
  • [27] Semi-possibilistic Biclustering Applied to Discrete and Continuous Data
    Mahfouz, Mohamed A.
    Ismail, Mohamed A.
    ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS, 2012, 322 : 327 - 338
  • [28] Biclustering gene expression data in the presence of noise
    Abdullah, A
    Hussain, A
    ARTIFICIAL NEURAL NETWORKS: BIOLOGICAL INSPIRATIONS - ICANN 2005, PT 1, PROCEEDINGS, 2005, 3696 : 611 - 616
  • [29] An improved biclustering algorithm for gene expression data
    Jin, Sheng-Hua
    Hua, Li
    Open Cybernetics and Systemics Journal, 2014, 8 : 1141 - 1144
  • [30] An EA framework for biclustering of gene expression data
    Bleuler, S
    Preli, A
    Zitzler, E
    CEC2004: PROCEEDINGS OF THE 2004 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2004, : 166 - 173