Length Bias Correction in Gene Ontology Enrichment Analysis Using Logistic Regression

被引:23
|
作者
Mi, Gu [1 ]
Di, Yanming [1 ,2 ]
Emerson, Sarah [1 ]
Cumbie, Jason S. [2 ,3 ]
Chang, Jeff H. [2 ,3 ,4 ]
机构
[1] Oregon State Univ, Dept Stat, Corvallis, OR 97331 USA
[2] Oregon State Univ, Mol & Cellular Biol Program, Corvallis, OR 97331 USA
[3] Oregon State Univ, Dept Bot & Plant Pathol, Corvallis, OR 97331 USA
[4] Oregon State Univ, Ctr Genome Res & Biocomp, Corvallis, OR 97331 USA
来源
PLOS ONE | 2012年 / 7卷 / 10期
基金
美国国家卫生研究院; 美国食品与农业研究所;
关键词
DIFFERENTIAL EXPRESSION ANALYSIS; RNA-SEQ DATA; TOOLS; GRAPH;
D O I
10.1371/journal.pone.0046128
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
When assessing differential gene expression from RNA sequencing data, commonly used statistical tests tend to have greater power to detect differential expression of genes encoding longer transcripts. This phenomenon, called "length bias", will influence subsequent analyses such as Gene Ontology enrichment analysis. In the presence of length bias, Gene Ontology categories that include longer genes are more likely to be identified as enriched. These categories, however, are not necessarily biologically more relevant. We show that one can effectively adjust for length bias in Gene Ontology analysis by including transcript length as a covariate in a logistic regression model. The logistic regression model makes the statistical issue underlying length bias more transparent: transcript length becomes a confounding factor when it correlates with both the Gene Ontology membership and the significance of the differential expression test. The inclusion of the transcript length as a covariate allows one to investigate the direct correlation between the Gene Ontology membership and the significance of testing differential expression, conditional on the transcript length. We present both real and simulated data examples to show that the logistic regression approach is simple, effective, and flexible.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Bias correction of AIC in logistic regression models
    Yanagihara, H
    Sekiguchi, R
    Fujikoshi, Y
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2003, 115 (02) : 349 - 360
  • [2] Bias correction in logistic regression with missing categorical covariates
    Das, Ujjwal
    Maiti, Tapabrata
    Pradhan, Vivek
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2010, 140 (09) : 2478 - 2485
  • [3] Comparison of Bias Correction Methods for the Rare Event Logistic Regression
    Kim, Hyungwoo
    Ko, Taeseok
    Park, No-Wook
    Lee, Woojoo
    KOREAN JOURNAL OF APPLIED STATISTICS, 2014, 27 (02) : 277 - 290
  • [4] Correction: Gene ontology enrichment analysis of congenital diaphragmatic hernia-associated genes
    Timothy R. A. Dalmer
    Robin D. Clugston
    Pediatric Research, 2019, 86 : 676 - 676
  • [5] ADJUSTING FOR NONRESPONSE BIAS USING LOGISTIC-REGRESSION
    ALHO, JM
    BIOMETRIKA, 1990, 77 (03) : 617 - 624
  • [6] WEIGHTED LOGISTIC REGRESSION FOR MULTIPLE BIAS ANALYSIS.
    Johnson, C. Y.
    Howards, P. P.
    Strickland, M. J.
    Waller, D. K.
    Flanders, W. D.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2012, 175 : S46 - S46
  • [7] Gene ontology enrichment analysis of parkin interactants
    Zanon, A.
    Pichler, I.
    Rakovic, A.
    Schwienbacher, C.
    Hicks, A. A.
    Alexa, A.
    Domingues, F. S.
    Klein, C.
    Pramstaller, P. P.
    MOVEMENT DISORDERS, 2011, 26 : S349 - S349
  • [8] Bias analysis for misclassification in a multicategorical exposure in a logistic regression model
    Liu, Yaqing
    Liu, Juxin
    Zhang, Fuxi
    STATISTICS & PROBABILITY LETTERS, 2013, 83 (12) : 2621 - 2626
  • [9] A Comparative Study of the Bias Correction Methods for Differential Item Functioning Analysis in Logistic Regression with Rare Events Data
    Faghih, Marjan
    Bagheri, Zahra
    Stevanovic, Dejan
    Ayatollahi, Seyyed Mohhamad Taghi
    Jafari, Peyman
    BIOMED RESEARCH INTERNATIONAL, 2020, 2020
  • [10] Bias in logistic regression due to imperfect diagnostic test results and practical correction approaches
    Valle, Denis
    Lima, Joanna M. Tucker
    Millar, Justin
    Amratia, Punam
    Haque, Ubydul
    MALARIA JOURNAL, 2015, 14