Hierarchical analysis of RNA-seq reads improves the accuracy of allele-specific expression

被引:52
|
作者
Raghupathy, Narayanan [1 ]
Choi, Kwangbom [1 ]
Vincent, Matthew J. [1 ]
Beane, Glen L. [1 ]
Sheppard, Keith S. [1 ]
Munger, Steven C. [1 ]
Korstanje, Ron [1 ]
Pardo-Manual de Villena, Fernando [2 ]
Churchill, Gary A. [1 ]
机构
[1] Jackson Lab, 600 Main St, Bar Harbor, ME 04609 USA
[2] Univ N Carolina, Dept Genet, Chapel Hill, NC 27514 USA
关键词
ALIGNMENT;
D O I
10.1093/bioinformatics/bty078
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Allele-specific expression (ASE) refers to the differential abundance of the allelic copies of a transcript. RNA sequencing (RNA-seq) can provide quantitative estimates of ASE for genes with transcribed polymorphisms. When short-read sequences are aligned to a diploid transcriptome, read-mapping ambiguities confound our ability to directly count reads. Multi-mapping reads aligning equally well to multiple genomic locations, isoforms or alleles can comprise the majority (>85%) of reads. Discarding them can result in biases and substantial loss of information. Methods have been developed that use weighted allocation of read counts but these methods treat the different types of multi-reads equivalently. We propose a hierarchical approach to allocation of read counts that first resolves ambiguities among genes, then among isoforms, and lastly between alleles. We have implemented our model in EMASE software (Expectation-Maximization for Allele Specific Expression) to estimate total gene expression, isoform usage and ASE based on this hierarchical allocation. Results: Methods that align RNA-seq reads to a diploid transcriptome incorporating known genetic variants improve estimates of ASE and total gene expression compared to methods that use reference genome alignments. Weighted allocation methods outperform methods that discard multi-reads. Hierarchical allocation of reads improves estimation of ASE even when data are simulated from a non-hierarchical model. Analysis of RNA-seq data from F1 hybrid mice using EMASE reveals widespread ASE associated with cis-acting polymorphisms and a small number of parent-of-origin effects.
引用
收藏
页码:2177 / 2184
页数:8
相关论文
共 50 条
  • [1] Fully Bayesian analysis of allele-specific RNA-seq data
    Alvarez-Castro, Ignacio
    Niemi, Jarad
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2019, 16 (06) : 7751 - 7770
  • [2] RNA-Seq Reveals Allele-Specific Expression of Somatic Mutations in Neuroblastoma
    Sun, Lan
    Liu, Qian
    Tu, Lingli
    Li, Xiaoqing
    Wang, Jingyi
    Wang, Kai
    Zhong, Jiang F.
    MOLECULAR THERAPY, 2021, 29 (04) : 194 - 195
  • [3] Targeted RNA-seq improves efficiency, resolution, and accuracy of allele specific expression for human term placentas
    Wu, Weisheng
    Lovett, Jennie L.
    Shedden, Kerby
    Strassmann, Beverly, I
    Vincenz, Claudius
    G3-GENES GENOMES GENETICS, 2021, 11 (08):
  • [4] Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data
    Wood, David L. A.
    Nones, Katia
    Steptoe, Anita
    Christ, Angelika
    Harliwong, Ivon
    Newell, Felicity
    Bruxner, Timothy J. C.
    Miller, David
    Cloonan, Nicole
    Grimmond, Sean M.
    PLOS ONE, 2015, 10 (05):
  • [5] Estimates of allele-specific expression in Drosophila with a single genome sequence and RNA-seq data
    Quinn, Andrew
    Juneja, Punita
    Jiggins, Francis M.
    BIOINFORMATICS, 2014, 30 (18) : 2603 - 2610
  • [6] Analysis of allele-specific expression using RNA-seq of the Korean native pig and Landrace reciprocal cross
    Ahn, Byeongyong
    Choi, Min-Kyeung
    Yum, Joori
    Cho, In-Cheol
    Kim, Jin-Hoi
    Park, Chankyu
    ASIAN-AUSTRALASIAN JOURNAL OF ANIMAL SCIENCES, 2019, 32 (12): : 1816 - 1825
  • [7] Proper Use of Allele-Specific Expression Improves Statistical Power for cis-eQTL Mapping with RNA-Seq Data
    Hu, Yi-Juan
    Sun, Wei
    Tzeng, Jung-Ying
    Perou, Charles M.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (511) : 962 - 974
  • [8] Allele-specific RNA-seq expression profiling of imprinted genes in mouse isogenic pluripotent states
    René A. M. Dirks
    Guido van Mierlo
    Hindrik H. D. Kerstens
    Andreia S. Bernardo
    Julianna Kobolák
    István Bock
    Julien Maruotti
    Roger A. Pedersen
    András Dinnyés
    Martijn A. Huynen
    Alice Jouneau
    Hendrik Marks
    Epigenetics & Chromatin, 12
  • [9] A Bayesian approach for estimating allele-specific expression from RNA-Seq data with diploid genomes
    Naoki Nariai
    Kaname Kojima
    Takahiro Mimori
    Yosuke Kawai
    Masao Nagasaki
    BMC Genomics, 17
  • [10] Allele-specific RNA-seq expression profiling of imprinted genes in mouse isogenic pluripotent states
    Dirks, Rene A. M.
    van Mierlo, Guido
    Kerstens, Hindrik H. D.
    Bernardo, Andreia S.
    Kobolak, Julianna
    Bock, Istvan
    Maruotti, Julien
    Pedersen, Roger A.
    Dinnyes, Andras
    Huynen, Martijn A.
    Jouneau, Alice
    Marks, Hendrik
    EPIGENETICS & CHROMATIN, 2019, 12 (1)