Detection Theory in Identification of RNA-DNA Sequence Differences Using RNA-Sequencing

被引:6
|
作者
Toung, Jonathan M. [1 ]
Lahens, Nicholas [1 ]
Hogenesch, John B. [2 ,3 ,5 ]
Grant, Gregory [2 ,3 ,4 ]
机构
[1] Univ Penn, Sch Med, Genom & Computat Biol Grad Program, Philadelphia, PA 19104 USA
[2] Univ Penn, Sch Med, Inst Biomed Informat, Philadelphia, PA 19104 USA
[3] Univ Penn, Sch Med, Inst Translat Med & Therapeut, Philadelphia, PA 19104 USA
[4] Univ Penn, Sch Med, Dept Genet, Philadelphia, PA 19104 USA
[5] Univ Penn, Sch Med, Dept Pharmacol, Philadelphia, PA 19104 USA
来源
PLOS ONE | 2014年 / 9卷 / 11期
基金
美国国家卫生研究院;
关键词
EDITING SITES; ALLELIC EXPRESSION; PARALLEL DNA; TRANSCRIPTOME; ALIGNMENT; ORIGIN;
D O I
10.1371/journal.pone.0112040
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Advances in sequencing technology have allowed for detailed analyses of the transcriptome at single-nucleotide resolution, facilitating the study of RNA editing or sequence differences between RNA and DNA genome-wide. In humans, two types of post-transcriptional RNA editing processes are known to occur: A-to-I deamination by ADAR and C-to-U deamination by APOBEC1. In addition to these sequence differences, researchers have reported the existence of all 12 types of RNA-DNA sequence differences (RDDs); however, the validity of these claims is debated, as many studies claim that technical artifacts account for the majority of these non-canonical sequence differences. In this study, we used a detection theory approach to evaluate the performance of RNA-Sequencing (RNA-Seq) and associated aligners in accurately identifying RNA-DNA sequence differences. By generating simulated RNA-Seq datasets containing RDDs, we assessed the effect of alignment artifacts and sequencing error on the sensitivity and false discovery rate of RDD detection. Overall, we found that even in the presence of sequencing errors, false negative and false discovery rates of RDD detection can be contained below 10% with relatively lenient thresholds. We also assessed the ability of various filters to target false positive RDDs and found them to be effective in discriminating between true and false positives. Lastly, we used the optimal thresholds we identified from our simulated analyses to identify RDDs in a human lymphoblastoid cell line. We found approximately 6,000 RDDs, the majority of which are A-to-G edits and likely to be mediated by ADAR. Moreover, we found the majority of non A-to-G RDDs to be associated with poorer alignments and conclude from these results that the evidence for widespread non-canonical RDDs in humans is weak. Overall, we found RNA-Seq to be a powerful technique for surveying RDDs genome-wide when coupled with the appropriate thresholds and filters.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Investigating the virome of crops and trees using RNA-sequencing
    Jones, S.
    MacFarlane, S.
    Cowan, G.
    Beaton, J.
    Mukoye, B.
    Mangeni, B. C.
    Iason, G.
    Were, H.
    Torrance, L.
    MOLECULAR PLANT-MICROBE INTERACTIONS, 2019, 32 (10) : 57 - 57
  • [32] Detection of RNA-DNA binding sites in long noncoding RNAs
    Kuo, Chao-Chung
    Haenzelmann, Sonja
    Cetin, Nevcin Sentuerk
    Frank, Stefan
    Zajzon, Barna
    Derks, Jens-Peter
    Akhade, Vijay Suresh
    Ahuja, Gaurav
    Kanduri, Chandrasekhar
    Grummt, Ingrid
    Kurian, Leo
    Costa, Ivan G.
    NUCLEIC ACIDS RESEARCH, 2019, 47 (06)
  • [33] Virus expression detection reveals RNA-sequencing contamination in TCGA
    Selitsky, Sara R.
    Marron, David
    Hollern, Daniel
    Mose, Lisle E.
    Hoadley, Katherine A.
    Jones, Corbin
    Parker, Joel S.
    Dittmer, Dirk P.
    Perou, Charles M.
    BMC GENOMICS, 2020, 21 (01)
  • [34] Virus expression detection reveals RNA-sequencing contamination in TCGA
    Sara R. Selitsky
    David Marron
    Daniel Hollern
    Lisle E. Mose
    Katherine A. Hoadley
    Corbin Jones
    Joel S. Parker
    Dirk P. Dittmer
    Charles M. Perou
    BMC Genomics, 21
  • [35] Identification of miRNA expression profile in middle ear cholesteatoma using small RNA-sequencing
    Xie, Mengyao
    Tang, Qi
    Wang, Shu
    Huang, Xiaowu
    Wu, Zhiyuan
    Han, Zhijin
    Li, Chen
    Wang, Bin
    Shang, Yingying
    Yang, Hua
    BMC MEDICAL GENOMICS, 2024, 17 (01)
  • [36] RNAmut: robust identification of somatic mutations in acute myeloid leukemia using RNA-sequencing
    Gu, Muxin
    Zwiebel, Maximillian
    Ong, Swee Hoe
    Boughton, Nick
    Nomdedeu, Josep
    Basheer, Faisal
    Nannya, Yasuhito
    Quiros, Pedro M.
    Ogawa, Seishi
    Cazzola, Mario
    Rad, Roland
    Butler, Adam P.
    Vijayabaskar, M. S.
    Vassiliou, George S.
    HAEMATOLOGICA, 2020, 105 (06) : E290 - E293
  • [37] Systematic investigation of insertional and deletional RNA-DNA differences in the human transcriptome
    Cai Chen
    Ralf Bundschuh
    BMC Genomics, 13
  • [38] FORMATION AND DETECTION OF RNA-DNA HYBRID MOLECULES IN CYTOLOGICAL PREPARATIONS
    GALL, JG
    PARDUE, ML
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1969, 63 (02) : 378 - &
  • [39] Reverse Transcription Errors and RNA-DNA Differences at Short Tandem Repeats
    Fungtammasan, Arkarachai
    Tomaszkiewicz, Marta
    Campos-Sanchez, Rebeca
    Eckert, Kristin A.
    DeGiorgio, Michael
    Makova, Kateryna D.
    MOLECULAR BIOLOGY AND EVOLUTION, 2016, 33 (10) : 2744 - 2758
  • [40] Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing
    José A Robles
    Sumaira E Qureshi
    Stuart J Stephen
    Susan R Wilson
    Conrad J Burden
    Jennifer M Taylor
    BMC Genomics, 13