PBSeq: Modeling base-level bias to estimate gene and isoform expression for RNA-seq data

被引:0
|
作者
Li Zhang
Xuejun Liu
机构
[1] Nanjing University of Aeronautics and Astronautics,College of Computer Science and Technology
来源
International Journal of Machine Learning and Cybernetics | 2017年 / 8卷
关键词
RNA-seq; Base-level bias; Gene and isoform expression level; Expression of uncertainty;
D O I
暂无
中图分类号
学科分类号
摘要
Due to its unprecedented high-throughput and high-resolution, RNA-seq rapidly becomes a revolutionary and powerful technology for transcriptome analysis. However, RNA-seq library preparation results in non-uniformity of read distribution in the represented genes. When estimating gene and isoform expression level, the non-uniformity needs to be accounted and corrected to improve the estimation accuracy. In this paper, we propose PBSeq, a Poisson model utilizing a base-level bias correction strategy to estimate gene and isoform expression. The base-level bias correction strategy simultaneously considers the positional and sequence-specific biases at starting position of reads mapped to the genes of interest. The PBSeq not only provides the expression values but also estimates the uncertainty associated with expression estimation, which represents the variation across replicates and is useful for downstream analysis. We utilize a simulated dataset and three real RNA-seq datasets to validate the PBSeq model. Results show that PBseq can accurately estimate gene and isoform expression levels and is computationally efficient compared with other state-of-art methods.
引用
收藏
页码:1247 / 1258
页数:11
相关论文
共 50 条
  • [21] Comparative evaluation of isoform-level gene expression estimation algorithms for RNA-seq and exon-array platforms
    Dapas, Matthew
    Kandpal, Manoj
    Bi, Yingtao
    Davuluri, Ramana V.
    BRIEFINGS IN BIOINFORMATICS, 2017, 18 (02) : 260 - 269
  • [22] Bias and Correction in RNA-seq Data for Marine Species
    Kai Song
    Li Li
    Guofan Zhang
    Marine Biotechnology, 2017, 19 : 541 - 550
  • [23] Bias and Correction in RNA-seq Data for Marine Species
    Song, Kai
    Li, Li
    Zhang, Guofan
    MARINE BIOTECHNOLOGY, 2017, 19 (05) : 541 - 550
  • [24] Towards Reliable Isoform Quantification Using RNA-Seq Data
    Howard, Brian E.
    Heber, Steffen
    2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2009, : 130 - 135
  • [25] Towards reliable isoform quantification using RNA-SEQ data
    Brian E Howard
    Steffen Heber
    BMC Bioinformatics, 11
  • [26] A structured sparse regression method for estimating isoform expression level from multi-sample RNA-seq data
    Zhang, L.
    Liu, X. J.
    GENETICS AND MOLECULAR RESEARCH, 2016, 15 (02)
  • [27] Towards reliable isoform quantification using RNA-SEQ data
    Howard, Brian E.
    Heber, Steffen
    BMC BIOINFORMATICS, 2010, 11
  • [28] Differential expression analysis of RNA-seq data at single-base resolution
    Frazee, Alyssa C.
    Sabunciyan, Sarven
    Hansen, Kasper D.
    Irizarry, Rafael A.
    Leek, Jeffrey T.
    BIOSTATISTICS, 2014, 15 (03) : 413 - 426
  • [29] Joint estimation of isoform expression and isoform-specific read distribution using multisample RNA-Seq data
    Suo, Chen
    Calza, Stefano
    Salim, Agus
    Pawitan, Yudi
    BIOINFORMATICS, 2014, 30 (04) : 506 - 513
  • [30] Differential gene expression analysis using coexpression and RNA-Seq data
    Yang, Ei-Wen
    Girke, Thomas
    Jiang, Tao
    BIOINFORMATICS, 2013, 29 (17) : 2153 - 2161