Detecting differentially expressed genes in heterogeneous diseases using half Student's t-test

被引:9
|
作者
Hsu, Chun-Lun
Lee, Wen-Chung
机构
[1] Natl Taiwan Univ, Coll Publ Hlth, Res Ctr Genes Environm & Human Hlth, Taipei 10764, Taiwan
[2] Natl Taiwan Univ, Coll Publ Hlth, Grad Inst Epidemiol, Taipei 10764, Taiwan
关键词
Student's t-test; gene expression; heterogeneous disease; epidemiological methods; MICROARRAY EXPERIMENTS; STATISTICAL-METHODS; CANCER; DISCOVERY; TISSUES;
D O I
10.1093/ije/dyq093
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background Microarray technology provides information about hundreds and thousands of gene-expression data in a single experiment. To search for disease-related genes, researchers test for those genes that are differentially expressed between the case subjects and the control subjects. Methods The authors propose a new test, the 'half Student's t-test', specifically for detecting differentially expressed genes in heterogeneous diseases. Monte-Carlo simulation shows that the test maintains the nominal alpha level quite well for both normal and non-normal distributions. Power of the half Student's t is higher than that of the conventional 'pooled' Student's t when there is heterogeneity in the disease under study. The power gain by using the half Student's t can reach similar to 10% when the standard deviation of the case group is 50% larger than that of the control group. Results Application to a colon cancer data reveals that when the false discovery rate (FDR) is controlled at 0.05, the half Student's t can detect 344 differentially expressed genes, whereas the pooled Student's t can detect only 65 genes. Or alternatively, if only 50 genes are to be selected, the FDR for the pooled Student's t has to be set at 0.0320 (false positive rate of similar to 3%), but for the half Student's t, it can be at as low as 0.0001 (false positive rate of about one per ten thousands). Conclusions The half Student's t-test is to be recommended for the detection of differentially expressed genes in heterogeneous diseases.
引用
收藏
页码:1597 / 1604
页数:8
相关论文
共 50 条
  • [1] Detecting differentially expressed genes of heterogeneous and positively skewed data using half Johnson's modified t-test
    Tzeng, I-Shiang
    Chen, Li-Shya
    Chang, Shy-Shin
    Lee, Yung-Ling Leo
    COGENT BIOLOGY, 2016, 2 (01):
  • [2] Detecting differentially expressed genes in heterogeneous diseases using control-only analysis of variance
    Tzeng, I-Shiang
    Lee, Wen-Chung
    ANNALS OF EPIDEMIOLOGY, 2012, 22 (08) : 598 - 602
  • [3] Why Psychologists Should by Default Use Welch's t-test Instead of Student's t-test
    Delacre, Marie
    Lakens, Daniel
    Leys, Christophe
    INTERNATIONAL REVIEW OF SOCIAL PSYCHOLOGY, 2017, 30 (01): : 92 - 101
  • [4] Confidence intervals for the power of Student's t-test
    Tarasinska, J
    STATISTICS & PROBABILITY LETTERS, 2005, 73 (02) : 125 - 130
  • [5] Student's t-test for Gaussian scale mixtures
    Bakirov N.K.
    Székely G.J.
    Journal of Mathematical Sciences, 2006, 139 (3) : 6497 - 6505
  • [6] The unequal variance t-test is an underused alternative to Student's t-test and the Mann-Whitney U test
    Ruxton, GD
    BEHAVIORAL ECOLOGY, 2006, 17 (04) : 688 - 690
  • [7] Identification of upstream transcription factor binding sites in orthologous genes using mixed Student's t-test statistics
    Huang, Tinghua
    Xiao, Hong
    Tian, Qi
    He, Zhen
    Yuan, Cheng
    Lin, Zezhao
    Gao, Xuejun
    Yao, Min
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (06)
  • [8] Detecting differentially expressed genes using calibrated Bayes factors
    Yu, Fang
    Chen, Ming-Hui
    Ku, Lynn
    STATISTICA SINICA, 2008, 18 (02) : 783 - 802
  • [9] Application of Student's t-test, Analysis of Variance, and Covariance
    Mishra, Prabhaker
    Singh, Uttam
    Pandey, Chandra M.
    Mishra, Priyadarshni
    Pandey, Gaurav
    ANNALS OF CARDIAC ANAESTHESIA, 2019, 22 (04) : 407 - 411
  • [10] Image segmentation using the student's t-test and the divergence of direction on spherical regions
    Stetten, George
    Horvath, Samantha
    Galeotti, John
    Shukla, Gaurav
    Wang, Bo
    Chapman, Brian
    MEDICAL IMAGING 2010: IMAGE PROCESSING, 2010, 7623