Identification of metagenes and their Interactions through Large-scale Analysis of Arabidopsis Gene Expression Data

被引:9
|
作者
Wilson, Tyler J. [1 ]
Lai, Liming [1 ]
Ban, Yuguang [1 ]
Ge, Steven X. [1 ]
机构
[1] S Dakota State Univ, Dept Math & Stat, Brookings, SD 57007 USA
来源
BMC GENOMICS | 2012年 / 13卷
基金
美国国家卫生研究院;
关键词
BIOINFORMATICS; DISCOVERY; NETWORK; REVEALS; BIOLOGY; TOOLS; LISTS;
D O I
10.1186/1471-2164-13-237
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Many plant genes have been identified through whole genome and deep transcriptome sequencing and other methods; yet our knowledge on the function of many of these genes remains limited. The integration and analysis of large gene-expression datasets gives researchers the ability to formalize hypotheses concerning the functionality and interaction between different groups of correlated genes. Results: We applied the non-negative matrix factorization (NMF) algorithm to the AtGenExpress dataset which consists of 783 microarray samples (29 separate experimental series) conducted on the model plant Arabidopsis thaliana. We identified 15 metagenes, which are groups of genes with correlated expression. Functional roles of these metagenes are established by observing the enriched gene ontology (GO) categories using gene set enrichment analyses (GSEA). Activity levels of these metagenes in various experimental conditions are also analyzed to associate metagenes with stimuli/conditions. A metagene correlation network, constructed based on the results of NMF analysis, revealed many new interactions between the metagenes. Comparison of these metagenes with an earlier large-scale clustering analysis indicates many statistically significant overlaps. Conclusions: This study identifies a network of correlated metagenes composed of Arabidopsis genes acting in a highly correlated fashion across a broad spectrum of experimental stimuli, which may shed some light on the function of many of the un-annotated genes.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Automated Protocol for Large-Scale Modeling of Gene Expression Data
    Hall, Michelle Lynn
    Calkins, David
    Sherman, Woody
    Journal of Chemical Information and Modeling, 2016, 56 (11) : 2216 - 2224
  • [22] Discovery of Genes Essential for Heme Biosynthesis through Large-Scale Gene Expression Analysis
    Nilsson, Roland
    Schultz, Iman J.
    Pierce, Eric L.
    Soltis, Kathleen A.
    Naranuntarat, Amornrat
    Ward, Diane M.
    Baughman, Joshua M.
    Paradkar, Prasad N.
    Kingsley, Paul D.
    Culotta, Valeria C.
    Kaplan, Jerry
    Palis, James
    Paw, Barry H.
    Mootha, Vamsi K.
    CELL METABOLISM, 2009, 10 (02) : 119 - 130
  • [23] Latent network-based representations for large-scale gene expression data analysis
    Wajdi Dhifli
    Julia Puig
    Aurélien Dispot
    Mohamed Elati
    BMC Bioinformatics, 19
  • [24] Latent network-based representations for large-scale gene expression data analysis
    Dhifli, Wajdi
    Puig, Julia
    Dispot, Aurelien
    Elati, Mohamed
    BMC BIOINFORMATICS, 2019, 19 (Suppl 13)
  • [25] GEDI: a user-friendly toolbox for analysis of large-scale gene expression data
    André Fujita
    João R Sato
    Carlos E Ferreira
    Mari C Sogayar
    BMC Bioinformatics, 8
  • [26] GEDI: a user-friendly toolbox for analysis of large-scale gene expression data
    Fujita, Andre
    Sato, Joao R.
    Ferreira, Carlos E.
    Sogayar, Mari C.
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [27] Data Analysis of Large-Scale Glycan-Sample Interactions
    Sese, Jun
    TRENDS IN GLYCOSCIENCE AND GLYCOTECHNOLOGY, 2012, 24 (137) : 122 - 128
  • [28] Cluster analysis of large scale gene expression data
    Erb, RS
    Michaels, GS
    DIMENSION REDUCTION, COMPUTATIONAL COMPLEXITY AND INFORMATION, 1998, 30 : 303 - 308
  • [29] GECKO: a complete large-scale gene expression analysis platform
    Joachim Theilhaber
    Anatoly Ulyanov
    Anish Malanthara
    Jack Cole
    Dapeng Xu
    Robert Nahf
    Michael Heuer
    Christoph Brockel
    Steven Bushnell
    BMC Bioinformatics, 5
  • [30] Large-scale CDNA microarray analysis of gene expression in epilepsy
    Wei, KC
    Wu, T
    Chang, CN
    Shin, JW
    EPILEPSIA, 2005, 46 : 198 - 198