A Semi-parametric Bayesian Approach for Differential Expression Analysis of RNA-seq Data

被引:5
|
作者
Liu, Fangfang [1 ]
Wang, Chong [1 ,2 ]
Liu, Peng [1 ]
机构
[1] Iowa State Univ, Dept Stat, Ames, IA 50011 USA
[2] Iowa State Univ, Dept Vet Diagnost & Prod Anim Med, Ames, IA 50011 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Bayesian nonparametric method; Differential expression; Dirichlet process; Posterior probability; RNA-seq; DEVELOPMENTAL DYNAMICS; STATISTICAL-METHODS; DISTRIBUTIONS; NORMALIZATION; MIXTURES; MODEL;
D O I
10.1007/s13253-015-0227-0
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
RNA-sequencing (RNA-seq) technologies have revolutionized the way that agricultural biologists study gene expression as well as generated a tremendous amount of data waiting for analysis. Detecting differentially expressed genes is one of the fundamental steps in RNA-seq data analysis. In this paper, we model the count data from RNA-seq experiments with a Poisson-Gamma hierarchical model, or equivalently, a negative binomial model. We derive a semi-parametric Bayesian approach with a Dirichlet process as the prior model for the distribution of fold changes between the two treatment means. An inference strategy using Gibbs algorithm is developed for differential expression analysis. The results of several simulation studies show that our proposed method outperforms other methods including the popularly applied edgeR and DESeq methods. We also discuss an application of our method to a dataset that compares gene expression between bundle sheath and mesophyll cells in maize leaves. Supplementary materials accompanying this paper appear online.
引用
收藏
页码:555 / 576
页数:22
相关论文
共 50 条
  • [41] rmRNAseq: differential expression analysis for repeated-measures RNA-seq data
    Nguyen, Yet
    Nettleton, Dan
    BIOINFORMATICS, 2020, 36 (16) : 4432 - 4439
  • [42] Error estimates for the analysis of differential expression from RNA-seq count data
    Burden, Conrad J.
    Qureshi, Sumaira E.
    Wilson, Susan R.
    PEERJ, 2014, 2
  • [43] Measuring differential gene expression with RNA-seq: challenges and strategies for data analysis
    Finotello, Francesca
    Di Camillo, Barbara
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2015, 14 (02) : 130 - 142
  • [44] contamDE: differential expression analysis of RNA-seq data for contaminated tumor samples
    Shen, Qi
    Hu, Jiyuan
    Jiang, Ning
    Hu, Xiaohua
    Luo, Zewei
    Zhang, Hong
    BIOINFORMATICS, 2016, 32 (05) : 705 - 712
  • [45] An iteration normalization and test method for differential expression analysis of RNA-seq data
    Zhou, Yan
    Lin, Nan
    Zhang, Baoxue
    BIODATA MINING, 2014, 7
  • [46] Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data
    Rapaport, Franck
    Khanin, Raya
    Liang, Yupu
    Pirun, Mono
    Krek, Azra
    Zumbo, Paul
    Mason, Christopher E.
    Socci, Nicholas D.
    Betel, Doron
    GENOME BIOLOGY, 2013, 14 (09):
  • [47] Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data
    Li, Jun
    Tibshirani, Robert
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2013, 22 (05) : 519 - 536
  • [48] BADGE: A novel Bayesian model for accurate abundance quantification and differential analysis of RNA-Seq data
    Gu, Jinghua
    Wang, Xiao
    Halakivi-Clarke, Leena
    Clarke, Robert
    Xuan, Jianhua
    BMC BIOINFORMATICS, 2014, 15
  • [49] BADGE: A novel Bayesian model for accurate abundance quantification and differential analysis of RNA-Seq data
    Jinghua Gu
    Xiao Wang
    Leena Halakivi-Clarke
    Robert Clarke
    Jianhua Xuan
    BMC Bioinformatics, 15
  • [50] A Bayesian semi-parametric approach to the ordinal calibration problem
    Paz Casanova, Maria
    Orellana, Yasna
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2016, 45 (22) : 6596 - 6610