Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays

被引:78
|
作者
Agarwal, Ashish [1 ]
Koppstein, David [1 ]
Rozowsky, Joel [1 ]
Sboner, Andrea [1 ]
Habegger, Lukas [1 ]
Hillier, LaDeana W. [3 ]
Sasidharan, Rajkumar [1 ]
Reinke, Valerie [4 ]
Waterston, Robert H. [3 ]
Gerstein, Mark [1 ,2 ]
机构
[1] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[2] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06520 USA
[3] Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
[4] Yale Univ, Sch Med, Dept Genet, New Haven, CT 06520 USA
来源
BMC GENOMICS | 2010年 / 11卷
关键词
GENE-EXPRESSION; EUKARYOTIC TRANSCRIPTOME; MICROARRAY; IDENTIFICATION; NORMALIZATION; ANNOTATION; PREDICTION;
D O I
10.1186/1471-2164-11-383
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Tiling arrays have been the tool of choice for probing an organism's transcriptome without prior assumptions about the transcribed regions, but RNA-Seq is becoming a viable alternative as the costs of sequencing continue to decrease. Understanding the relative merits of these technologies will help researchers select the appropriate technology for their needs. Results: Here, we compare these two platforms using a matched sample of poly(A)-enriched RNA isolated from the second larval stage of C. elegans. We find that the raw signals from these two technologies are reasonably well correlated but that RNA-Seq outperforms tiling arrays in several respects, notably in exon boundary detection and dynamic range of expression. By exploring the accuracy of sequencing as a function of depth of coverage, we found that about 4 million reads are required to match the sensitivity of two tiling array replicates. The effects of cross-hybridization were analyzed using a "nearest neighbor" classifier applied to array probes; we describe a method for determining potential "black list" regions whose signals are unreliable. Finally, we propose a strategy for using RNA-Seq data as a gold standard set to calibrate tiling array data. All tiling array and RNA-Seq data sets have been submitted to the modENCODE Data Coordinating Center. Conclusions: Tiling arrays effectively detect transcript expression levels at a low cost for many species while RNA-Seq provides greater accuracy in several regards. Researchers will need to carefully select the technology appropriate to the biological investigations they are undertaking. It will also be important to reconsider a comparison such as ours as sequencing technologies continue to evolve.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Identification of CNAs from RNA-Seq data
    Iwamoto, Eisuke
    Sanada, Masashi
    Yasuda, Takahiko
    CANCER SCIENCE, 2022, 113 : 1446 - 1446
  • [22] RNA-seq: An assessment of technical reproducibility and comparison with gene expression arrays
    Marioni, John C.
    Mason, Christopher E.
    Mane, Shrikant M.
    Stephens, Matthew
    Gilad, Yoav
    GENOME RESEARCH, 2008, 18 (09) : 1509 - 1517
  • [23] Comprehensive analysis of RNA-Seq data reveals extensive RNA editing in a human transcriptome
    Zhiyu Peng
    Yanbing Cheng
    Bertrand Chin-Ming Tan
    Lin Kang
    Zhijian Tian
    Yuankun Zhu
    Wenwei Zhang
    Yu Liang
    Xueda Hu
    Xuemei Tan
    Jing Guo
    Zirui Dong
    Yan Liang
    Li Bao
    Jun Wang
    Nature Biotechnology, 2012, 30 : 253 - 260
  • [24] Revealing of Mycobacterium marinum Transcriptome by RNA-seq
    Wang, Sen
    Dong, Xinran
    Zhu, Yongqiang
    Wang, Chuan
    Sun, Gang
    Luo, Tao
    Tian, Weidong
    Zheng, Huajun
    Gao, Qian
    PLOS ONE, 2013, 8 (09):
  • [25] Analysis of the Pain Transcriptome Using RNA-Seq
    Goswami, Samridhi
    Mishra, Santosh
    Hoon, Mark
    Mannes, Andrew
    Iadarola, Michael
    NEUROPSYCHOPHARMACOLOGY, 2013, 38 : S352 - S352
  • [26] Comparative profiling of the CHO transcriptome by RNA-Seq
    Jacob, Nitya M.
    Rao, Navneet
    Nissom, Peter M.
    Chin, Ju-Xin
    Yusufi, Faraaz
    Chuah, Song-Hui
    Retzel, Ernest F.
    Loo, Bernard L. W.
    Lee, Dong-Yup
    Karypis, George
    Yap, Miranda
    Hu, Wei-Shou
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2010, 239
  • [27] RNA-SEQ ANALYSIS OF THE HUMAN ALCOHOLIC TRANSCRIPTOME
    Farris, S. P.
    Harris, R. A.
    Mayfield, R. D.
    ALCOHOLISM-CLINICAL AND EXPERIMENTAL RESEARCH, 2013, 37 : 126A - 126A
  • [28] Identifiability of isoform deconvolution from junction arrays and RNA-Seq
    Hiller, David
    Jiang, Hui
    Xu, Weihong
    Wong, Wing Hung
    BIOINFORMATICS, 2009, 25 (23) : 3056 - 3059
  • [29] An empirical strategy to detect bacterial transcript structure from directional RNA-seq transcriptome data
    Wang, Yejun
    MacKenzie, Keith D.
    White, Aaron P.
    BMC GENOMICS, 2015, 16
  • [30] Full-length transcriptome assembly from RNA-Seq data without a reference genome
    Manfred G Grabherr
    Brian J Haas
    Moran Yassour
    Joshua Z Levin
    Dawn A Thompson
    Ido Amit
    Xian Adiconis
    Lin Fan
    Raktima Raychowdhury
    Qiandong Zeng
    Zehua Chen
    Evan Mauceli
    Nir Hacohen
    Andreas Gnirke
    Nicholas Rhind
    Federica di Palma
    Bruce W Birren
    Chad Nusbaum
    Kerstin Lindblad-Toh
    Nir Friedman
    Aviv Regev
    Nature Biotechnology, 2011, 29 : 644 - 652