iBBiG: iterative binary bi-clustering of gene sets

被引:35
|
作者
Gusenleitner, Daniel [1 ]
Howe, Eleanor A. [1 ,2 ]
Bentink, Stefan [1 ,3 ]
Quackenbush, John [1 ,3 ,4 ]
Culhane, Aedin C. [1 ,3 ]
机构
[1] Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
[2] Univ Oxford, Dept Stat, Oxford OX1 3TG, England
[3] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[4] Dana Farber Canc Inst, Dept Canc Biol, Boston, MA 02115 USA
基金
美国国家卫生研究院;
关键词
ENRICHMENT ANALYSIS; BIOLOGICAL PROCESSES; MICROARRAY DATA; EXPRESSION DATA; DISEASES; CCL5;
D O I
10.1093/bioinformatics/bts438
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Meta-analysis of genomics data seeks to identify genes associated with a biological phenotype across multiple datasets; however, merging data from different platforms by their features (genes) is challenging. Meta-analysis using functionally or biologically characterized gene sets simplifies data integration is biologically intuitive and is seen as having great potential, but is an emerging field with few established statistical methods. Results: We transform gene expression profiles into binary gene set profiles by discretizing results of gene set enrichment analyses and apply a new iterative bi-clustering algorithm (iBBiG) to identify groups of gene sets that are coordinately associated with groups of phenotypes across multiple studies. iBBiG is optimized for meta-analysis of large numbers of diverse genomics data that may have unmatched samples. It does not require prior knowledge of the number or size of clusters. When applied to simulated data, it outperforms commonly used clustering methods, discovers overlapping clusters of diverse sizes and is robust in the presence of noise. We apply it to meta-analysis of breast cancer studies, where iBBiG extracted novel gene set-phenotype association that predicted tumor metastases within tumor subtypes.
引用
收藏
页码:2484 / 2492
页数:9
相关论文
共 50 条
  • [41] A collaborative filtering recommendation algorithm based on information theory and bi-clustering
    Mingyang Jiang
    Zhifeng Zhang
    Jingqing Jiang
    Qinghu Wang
    Zhili Pei
    Neural Computing and Applications, 2019, 31 : 8279 - 8287
  • [42] A Principal Component Analysis Based Microarray Data Bi-clustering Method
    Zhang Yanpei
    Prinet, Veronique
    Wu Shuanhu
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 500 - +
  • [43] An approximation polynomial-time algorithm for a sequence bi-clustering problem
    Kel'manov, A. V.
    Khamidullin, S. A.
    COMPUTATIONAL MATHEMATICS AND MATHEMATICAL PHYSICS, 2015, 55 (06) : 1068 - 1076
  • [44] A collaborative filtering recommendation algorithm based on information theory and bi-clustering
    Jiang, Mingyang
    Zhang, Zhifeng
    Jiang, Jingqing
    Wang, Qinghu
    Pei, Zhili
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (12): : 8279 - 8287
  • [45] A personalized reinforcement learning recommendation algorithm using bi-clustering techniques
    Waqar, Muhammad
    Ayub, Mubbashir
    PLOS ONE, 2025, 20 (02):
  • [46] A Multi-Level Iterative Bi-Clustering Method for Discovering miRNA Co-regulation Network of Abiotic Stress Tolerance in Soybeans
    Chang, Haowu
    Zhang, Hao
    Zhang, Tianyue
    Su, Lingtao
    Qin, Qing-Ming
    Li, Guihua
    Li, Xueqing
    Wang, Li
    Zhao, Tianheng
    Zhao, Enshuang
    Zhao, Hengyi
    Liu, Yuanning
    Stacey, Gary
    Xu, Dong
    FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [47] Kpax3: Bayesian bi-clustering of large sequence datasets
    Pessia, Alberto
    Corander, Jukka
    BIOINFORMATICS, 2018, 34 (12) : 2132 - 2133
  • [48] Iterative factor clustering of binary data
    Alfonso Iodice D’Enza
    Francesco Palumbo
    Computational Statistics, 2013, 28 : 789 - 807
  • [49] A Novel Hybrid Approach for Multi-Objective Bi-Clustering in Microarray Data
    Trivedi N.
    Kanungo S.
    Recent Advances in Computer Science and Communications, 2021, 14 (8 2563) : 2578
  • [50] Iterative factor clustering of binary data
    D'Enza, Alfonso Iodice
    Palumbo, Francesco
    COMPUTATIONAL STATISTICS, 2013, 28 (02) : 789 - 807