NCBI GEO: archive for gene expression and epigenomics data sets: 23-year update

被引:81
|
作者
Clough, Emily [1 ]
Barrett, Tanya [1 ]
Wilhite, Stephen E. [1 ]
Ledoux, Pierre [1 ]
Evangelista, Carlos [1 ]
Kim, Irene F. [1 ]
Tomashevsky, Maxim [1 ]
Marshall, Kimberly A. [1 ]
Phillippy, Katherine H. [1 ]
Sherman, Patti M. [1 ]
Lee, Hyeseung [1 ]
Zhang, Naigong [1 ]
Serova, Nadezhda [1 ]
Wagner, Lukas [1 ]
Zalunin, Vadim [1 ]
Kochergin, Andrey [1 ]
Soboleva, Alexandra [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20892 USA
关键词
RNA-SEQ; OMNIBUS; PRINCIPLES; CHROMATIN; MAPS;
D O I
10.1093/nar/gkad965
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Gene Expression Omnibus (GEO) is an international public repository that archives gene expression and epigenomics data sets generated by next-generation sequencing and microarray technologies. Data are typically submitted to GEO by researchers in compliance with widespread journal and funder mandates to make generated data publicly accessible. The resource handles raw data files, processed data files and descriptive metadata for over 200 000 studies and 6.5 million samples, all of which are indexed, searchable and downloadable. Additionally, GEO offers web-based tools that facilitate analysis and visualization of differential gene expression. This article presents the current status and recent advancements in GEO, including the generation of consistently computed gene expression count matrices for thousands of RNA-seq studies, and new interactive graphical plots in GEO2R that help users identify differentially expressed genes and assess data set quality. The GEO repository is built and maintained by the National Center for Biotechnology Information (NCBI), a division of the National Library of Medicine (NLM), and is publicly accessible at https://www.ncbi.nlm.nih.gov/geo/. Graphical Abstract
引用
收藏
页码:D138 / D144
页数:7
相关论文
共 50 条
  • [21] Analyzing gene expression data in terms of gene sets:: methodological issues
    Goeman, Jelle J.
    Buehlmann, Peter
    BIOINFORMATICS, 2007, 23 (08) : 980 - 987
  • [22] ActiveSVM selects minimal gene sets from gene expression data
    Nature Computational Science, 2022, 2 : 420 - 421
  • [23] ActiveSVM selects minimal gene sets from gene expression data
    Chen, Xiaoqiao
    Thomson, Matt
    NATURE COMPUTATIONAL SCIENCE, 2022, 2 (07): : 420 - 421
  • [24] Efficient gene selection with rough sets from gene expression data
    Sun, Lijun
    Miao, Duoqian
    Zhang, Hongyun
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2008, 5009 : 164 - +
  • [25] Intrinsic bias in breast cancer gene expression data sets
    Mosley, Jonathan D.
    Keri, Ruth A.
    BMC CANCER, 2009, 9
  • [26] ErmineJ: Tool for functional analysis of gene expression data sets
    Homin K Lee
    William Braynen
    Kiran Keshav
    Paul Pavlidis
    BMC Bioinformatics, 6
  • [27] ErmineJ: Tool for functional analysis of gene expression data sets
    Lee, HK
    Braynen, W
    Keshav, K
    Pavlidis, P
    BMC BIOINFORMATICS, 2005, 6 (1)
  • [28] Intrinsic bias in breast cancer gene expression data sets
    Jonathan D Mosley
    Ruth A Keri
    BMC Cancer, 9
  • [29] Semi-automated clustering of gene expression data sets
    Kim, Minho
    Jung, Ho-Youl
    Chung, Myungguen
    Kim, Pora
    Park, Seon-Hee
    Park, Soo-Jun
    2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 4625 - 4628
  • [30] Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data
    Nathan L Tintle
    Alexandra Sitarik
    Benjamin Boerema
    Kylie Young
    Aaron A Best
    Matthew DeJongh
    BMC Bioinformatics, 13