Statistical identification of biclusters in gene expression data

被引:0
|
作者
Chakraborty, A [1 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Kharagpur 721302, W Bengal, India
关键词
gene expression data; kmeans clustering; biclustering of expression data; p-value;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A bicluster of a gene expression dataset is a subset of genes which exhibit similar expression patterns along a subset of conditions. Given a gene expression matrix, we search for submatrices that are tightly coregulated according to some scoring criterion. We do not require the identified submatrices to be disjoint or to cover the entire matrix; instead we wish to build a diverse collection of submatrices that will capture all the significant signals in gene expression data. We believe that the size of the bicluster should be small compared to the size of the gene expression data matrix. So our approach finds biclusters by starting from small tightly co-regulated submatrices and adding more rows and columns to them. Our algorithm has three steps. First, we generate a set of high quality bicluster seeds based on a partition based clustering technique. In the second phase, these bicluster seeds are enlarged by adding more genes and conditions. In the third phase, we find the p-values of the biclusters produced for statistical validation.
引用
收藏
页码:1185 / 1190
页数:6
相关论文
共 50 条
  • [1] Discovering significant biclusters in gene expression data
    Zalik, Krista Rizman
    WSEAS Transactions on Information Science and Applications, 2005, 2 (09): : 1454 - 1461
  • [2] Mining deterministic biclusters in gene expression data
    Zhang, ZH
    Teo, A
    Ooi, BC
    Tan, KL
    BIBE 2004: FOURTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2004, : 283 - 290
  • [3] Constructing gene network based on biclusters of expression data
    Liu, F.
    Yang, L.
    Tian, Z. Z.
    Wu, P.
    Sun, S. L.
    GENETICS AND MOLECULAR RESEARCH, 2016, 15 (02)
  • [4] Exhaustive search of maximal biclusters in gene expression data
    Okada, Yoshifumi
    Fujibuchi, Wataru
    Horton, Paul
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 307 - +
  • [5] Finding Correlated Biclusters from Gene Expression Data
    Yang, Wen-Hui
    Dai, Dao-Qing
    Yan, Hong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (04) : 568 - 584
  • [6] Extraction of Optimal Biclusters from Gene Expression Data
    Bagyamani, J.
    Thangavel, K.
    Rathipriya, R.
    INFORMATION AND COMMUNICATION TECHNOLOGIES, 2010, 101 : 380 - +
  • [7] Mining Functional Biclusters of DNA Microarray Gene Expression Data
    Zhao, Hongya
    Huang, Qing-Hua
    Chan, Kwok Leung
    Cheng, Lee-Ming
    Yan, Hong
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 1736 - 1741
  • [8] Parallelized Evolutionary Learning for Detection of Biclusters in Gene Expression Data
    Huang, Qinghua
    Tao, Dacheng
    Li, Xuelong
    Liew, Alan Wee-Chung
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (02) : 560 - 570
  • [9] An evaluation study of biclusters visualization techniques of gene expression data
    Aouabed, Haithem
    Elloumi, Mourad
    Santamaria, Rodrigo
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2021, 18 (04)
  • [10] Finding k-Biclusters from Gene Expression Data
    Xu, Xiaohua
    He, Ping
    Lu, Lin
    Xi, Yanqiu
    Pan, Zhoujin
    INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2012, 2012, 7390 : 433 - 439