PBSeq: Modeling base-level bias to estimate gene and isoform expression for RNA-seq data

被引:0
|
作者
Li Zhang
Xuejun Liu
机构
[1] Nanjing University of Aeronautics and Astronautics,College of Computer Science and Technology
来源
International Journal of Machine Learning and Cybernetics | 2017年 / 8卷
关键词
RNA-seq; Base-level bias; Gene and isoform expression level; Expression of uncertainty;
D O I
暂无
中图分类号
学科分类号
摘要
Due to its unprecedented high-throughput and high-resolution, RNA-seq rapidly becomes a revolutionary and powerful technology for transcriptome analysis. However, RNA-seq library preparation results in non-uniformity of read distribution in the represented genes. When estimating gene and isoform expression level, the non-uniformity needs to be accounted and corrected to improve the estimation accuracy. In this paper, we propose PBSeq, a Poisson model utilizing a base-level bias correction strategy to estimate gene and isoform expression. The base-level bias correction strategy simultaneously considers the positional and sequence-specific biases at starting position of reads mapped to the genes of interest. The PBSeq not only provides the expression values but also estimates the uncertainty associated with expression estimation, which represents the variation across replicates and is useful for downstream analysis. We utilize a simulated dataset and three real RNA-seq datasets to validate the PBSeq model. Results show that PBseq can accurately estimate gene and isoform expression levels and is computationally efficient compared with other state-of-art methods.
引用
收藏
页码:1247 / 1258
页数:11
相关论文
共 50 条
  • [1] PBSeq: Modeling base-level bias to estimate gene and isoform expression for RNA-seq data
    Zhang, Li
    Liu, Xuejun
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2017, 8 (04) : 1247 - 1258
  • [2] Benchmarking RNA-Seq Aligners at Base-Level and Junction Base-Level Resolution Using the Arabidopsis thaliana Genome
    Coxe, Tallon
    Burks, David J.
    Singh, Utkarsh
    Mittler, Ron
    Azad, Rajeev K.
    PLANTS-BASEL, 2024, 13 (05):
  • [3] Quantification of mutant-allele expression at isoform level in cancer from RNA-seq data
    Deng, Wenjiang
    Mou, Tian
    Pawitan, Yudi
    Trung Nghia Vu
    NAR GENOMICS AND BIOINFORMATICS, 2022, 4 (03)
  • [4] Statistical inferences for isoform expression in RNA-Seq
    Jiang, Hui
    Wong, Wing Hung
    BIOINFORMATICS, 2009, 25 (08) : 1026 - 1032
  • [5] Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data
    Wood, David L. A.
    Nones, Katia
    Steptoe, Anita
    Christ, Angelika
    Harliwong, Ivon
    Newell, Felicity
    Bruxner, Timothy J. C.
    Miller, David
    Cloonan, Nicole
    Grimmond, Sean M.
    PLOS ONE, 2015, 10 (05):
  • [6] On Differential Gene Expression Using RNA-Seq Data
    Lee, Juhee
    Ji, Yuan
    Liang, Shoudan
    Cai, Guoshuai
    Mueller, Peter
    CANCER INFORMATICS, 2011, 10 : 205 - 215
  • [7] NURD: an implementation of a new method to estimate isoform expression from non-uniform RNA-seq data
    Xinyun Ma
    Xuegong Zhang
    BMC Bioinformatics, 14
  • [8] NURD: an implementation of a new method to estimate isoform expression from non-uniform RNA-seq data
    Ma, Xinyun
    Zhang, Xuegong
    BMC BIOINFORMATICS, 2013, 14
  • [9] Modeling Alternative Splicing Variants from RNA-Seq Data with Isoform Graphs
    Beretta, Stefano
    Bonizzoni, Paola
    Della Vedova, Gianluca
    Pirola, Yuri
    Rizzi, Raffaella
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2014, 21 (01) : 16 - 40
  • [10] Length bias correction for RNA-seq data in gene set analyses
    Gao, Liyan
    Fang, Zhide
    Zhang, Kui
    Zhi, Degui
    Cui, Xiangqin
    BIOINFORMATICS, 2011, 27 (05) : 662 - 669