Detecting differentially expressed genes in heterogeneous diseases using half Student's t-test

被引:9
|
作者
Hsu, Chun-Lun
Lee, Wen-Chung
机构
[1] Natl Taiwan Univ, Coll Publ Hlth, Res Ctr Genes Environm & Human Hlth, Taipei 10764, Taiwan
[2] Natl Taiwan Univ, Coll Publ Hlth, Grad Inst Epidemiol, Taipei 10764, Taiwan
关键词
Student's t-test; gene expression; heterogeneous disease; epidemiological methods; MICROARRAY EXPERIMENTS; STATISTICAL-METHODS; CANCER; DISCOVERY; TISSUES;
D O I
10.1093/ije/dyq093
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background Microarray technology provides information about hundreds and thousands of gene-expression data in a single experiment. To search for disease-related genes, researchers test for those genes that are differentially expressed between the case subjects and the control subjects. Methods The authors propose a new test, the 'half Student's t-test', specifically for detecting differentially expressed genes in heterogeneous diseases. Monte-Carlo simulation shows that the test maintains the nominal alpha level quite well for both normal and non-normal distributions. Power of the half Student's t is higher than that of the conventional 'pooled' Student's t when there is heterogeneity in the disease under study. The power gain by using the half Student's t can reach similar to 10% when the standard deviation of the case group is 50% larger than that of the control group. Results Application to a colon cancer data reveals that when the false discovery rate (FDR) is controlled at 0.05, the half Student's t can detect 344 differentially expressed genes, whereas the pooled Student's t can detect only 65 genes. Or alternatively, if only 50 genes are to be selected, the FDR for the pooled Student's t has to be set at 0.0320 (false positive rate of similar to 3%), but for the half Student's t, it can be at as low as 0.0001 (false positive rate of about one per ten thousands). Conclusions The half Student's t-test is to be recommended for the detection of differentially expressed genes in heterogeneous diseases.
引用
收藏
页码:1597 / 1604
页数:8
相关论文
共 50 条
  • [21] Fisher's combined p-value for detecting differentially expressed genes using Affymetrix expression arrays
    Hess, Ann
    Iyer, Hari
    BMC GENOMICS, 2007, 8
  • [22] Fisher's combined p-value for detecting differentially expressed genes using Affymetrix expression arrays
    Ann Hess
    Hari Iyer
    BMC Genomics, 8
  • [23] Detecting differentially expressed genes from RNA-seq data using fuzzy clustering
    Ando, Yuki
    Shimokawa, Asanao
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2024,
  • [24] The "Window t test":: a simple and powerful approach to detect differentially expressed genes in microarray datasets
    Berger, Fabrice
    De Hertogh, Benoit
    Pierre, Michael
    Gaigneaux, Anthoula
    Depiereux, Eric
    CENTRAL EUROPEAN JOURNAL OF BIOLOGY, 2008, 3 (03): : 327 - 344
  • [25] The research on screening differentially expressed genes in Hirschsprung's disease by using Microarray
    Qin, Ke-Wang
    Shi, Hao
    Zhang, Lei
    Liu, Peng-Fei
    Cai, Wei-Long
    Wu, Kun-He
    Zhu, Xiao-Chun
    Lan, Fei-Fei
    JOURNAL OF PEDIATRIC SURGERY, 2013, 48 (11) : 2281 - 2288
  • [26] Statistical student's t-test in carbon nanospheres synthesis from cis-1,4-polyisoprene
    Guzman-Fuentes, Jaime A.
    Contreras-Navarrete, Jose J.
    Cadenas-Calderon, Erasmo
    Ambriz-Torres, Jael M.
    Garcia-Ruiz, Diana L.
    Gutierrez-Garcia, Carmen J.
    Granados-Martinez, Francisco G.
    Garcia-Gonzalez, Leandro
    Zamora-Peredo, Luis
    Arredondo-Leon, Yesenia
    Hernandez-Cristobal, Orlando
    Domratcheva-Lvova, Lada
    MRS ADVANCES, 2020, 5 (63) : 3371 - 3377
  • [27] Statistical student’s t-test in carbon nanospheres synthesis from cis-1,4-polyisoprene
    Jaime A. Guzmán-Fuentes
    José J. Contreras-Navarrete
    Erasmo Cadenas-Calderón
    Jael M. Ambriz-Torres
    Diana L. García-Ruíz
    Carmen J. Gutiérrez-García
    Francisco G. Granados-Martínez
    Leandro García-González
    Luis Zamora-Peredo
    Yesenia Arredondo-León
    Orlando Hernandez-Cristobal
    Lada Domratcheva-Lvova
    MRS Advances, 2020, 5 : 3371 - 3377
  • [28] Identification of differentially expressed hub genes in Alzheimer's disease using microarray dataset
    Kumar, Vijayasarathy Sampath
    Shanthi, Veerappapillai
    RESEARCH JOURNAL OF BIOTECHNOLOGY, 2023, 18 (10): : 77 - 86
  • [29] A powerful nonparametric method for detecting differentially co-expressed genes: distance correlation screening and edge-count test
    Zhang, Qingyang
    BMC SYSTEMS BIOLOGY, 2018, 12
  • [30] Comparison of Statistical Data Models for Identifying Differentially Expressed Genes Using a Generalized Likelihood Ratio Test
    Seng, Kok-Yong
    Glenny, Robb W.
    Madtes, David K.
    Spilker, Mary E.
    Vicini, Paolo
    Gharib, Sina A.
    GENE REGULATION AND SYSTEMS BIOLOGY, 2008, 2 : 125 - 139