Detecting differentially expressed genes in heterogeneous diseases using half Student's t-test

被引:9
|
作者
Hsu, Chun-Lun
Lee, Wen-Chung
机构
[1] Natl Taiwan Univ, Coll Publ Hlth, Res Ctr Genes Environm & Human Hlth, Taipei 10764, Taiwan
[2] Natl Taiwan Univ, Coll Publ Hlth, Grad Inst Epidemiol, Taipei 10764, Taiwan
关键词
Student's t-test; gene expression; heterogeneous disease; epidemiological methods; MICROARRAY EXPERIMENTS; STATISTICAL-METHODS; CANCER; DISCOVERY; TISSUES;
D O I
10.1093/ije/dyq093
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background Microarray technology provides information about hundreds and thousands of gene-expression data in a single experiment. To search for disease-related genes, researchers test for those genes that are differentially expressed between the case subjects and the control subjects. Methods The authors propose a new test, the 'half Student's t-test', specifically for detecting differentially expressed genes in heterogeneous diseases. Monte-Carlo simulation shows that the test maintains the nominal alpha level quite well for both normal and non-normal distributions. Power of the half Student's t is higher than that of the conventional 'pooled' Student's t when there is heterogeneity in the disease under study. The power gain by using the half Student's t can reach similar to 10% when the standard deviation of the case group is 50% larger than that of the control group. Results Application to a colon cancer data reveals that when the false discovery rate (FDR) is controlled at 0.05, the half Student's t can detect 344 differentially expressed genes, whereas the pooled Student's t can detect only 65 genes. Or alternatively, if only 50 genes are to be selected, the FDR for the pooled Student's t has to be set at 0.0320 (false positive rate of similar to 3%), but for the half Student's t, it can be at as low as 0.0001 (false positive rate of about one per ten thousands). Conclusions The half Student's t-test is to be recommended for the detection of differentially expressed genes in heterogeneous diseases.
引用
收藏
页码:1597 / 1604
页数:8
相关论文
共 50 条
  • [41] Identification of differentially expressed genes in a human Burkitt's lymphoma cell line using differential mRNA display.
    Rapoport, AP
    Simons-Evelyn, M
    Bailey-Dell, K
    Fenton, R
    BLOOD, 1999, 94 (10) : 518A - 518A
  • [42] Identification of differentially expressed genes in ileal Peyer's Patch of scrapie-infected sheep using RNA arbitrarily primed PCR
    Austbo, Lars
    Kampmann, Andreas
    Mueller-Ladner, Ulf
    Neumann, Elena
    Olsaker, Ingrid
    Skretting, Grethe
    BMC VETERINARY RESEARCH, 2008, 4 (1)
  • [43] Identification of differentially expressed genes in ileal Peyer's Patch of scrapie-infected sheep using RNA arbitrarily primed PCR
    Lars Austbø
    Andreas Kampmann
    Ulf Müller-Ladner
    Elena Neumann
    Ingrid Olsaker
    Grethe Skretting
    BMC Veterinary Research, 4
  • [44] Enhancing Hotelling's T2 Statistic using Shrinkage Covariance Matrix for Identifying Differentially Expressed Gene Sets
    Karjanto, Suryaefiza
    Aripin, Rasimah
    Ramli, Norazan Mohamed
    Ghani, Nor Azura Md
    PROCEEDINGS IWBBIO 2014: INTERNATIONAL WORK-CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1 AND 2, 2014, : 1780 - 1791
  • [45] Transcriptome analysis reveals differentially expressed genes involved in somatic embryogenesis and podophyllotoxin biosynthesis of Sinopodophyllum hexandrum (Royle) T. S. Ying
    Guo, Shenghu
    Chen, Yuchao
    Zhu, Yongxing
    Tian, Mei
    PROTOPLASMA, 2023, 260 (04) : 1221 - 1232
  • [46] Transcriptome analysis reveals differentially expressed genes involved in somatic embryogenesis and podophyllotoxin biosynthesis of Sinopodophyllum hexandrum (Royle) T. S. Ying
    Shenghu Guo
    Yuchao Chen
    Yongxing Zhu
    Mei Tian
    Protoplasma, 2023, 260 : 1221 - 1232
  • [47] Attach importance of the bootstrap t test against Student's t test in clinical epidemiology: a demonstrative comparison using COVID-19 as an example
    Zhao, Shi
    Yang, Zuyao
    Musa, Salihu S.
    Ran, Jinjun
    Chong, Marc K. C.
    Javanbakht, Mohammad
    He, Daihai
    Wang, Maggie H.
    EPIDEMIOLOGY & INFECTION, 2021, 149
  • [48] SVM-based computer-aided diagnosis of the Alzheimer's disease using t-test NMSE feature selection with feature correlation weighting
    Chaves, R.
    Ramirez, J.
    Gorriz, J. M.
    Lopez, M.
    Salas-Gonzalez, D.
    Alvarez, I.
    Segovia, F.
    NEUROSCIENCE LETTERS, 2009, 461 (03) : 293 - 297
  • [49] Examination of the relationship between PiB-PET and FDG-PET in Alzheimer's disease using random forest and two-sample t-test
    Tsubaki, Yuma
    Akamatsu, Go
    Shimokawa, Natsumi
    Takashima, Aya
    Katsube, Suguru
    Sato, Hideaki
    Kumamoto, Kodai
    Sasaki, Masayuki
    JOURNAL OF NUCLEAR MEDICINE, 2020, 61
  • [50] Development and Validation of UV-Derivative Spectroscopic and RP-HPLC Methods for the Determination of Amlodipine Besylate and Valsartan in Tablet Dosage form and Comparison of the Developed Methods by Student's T-Test
    Usharani, N.
    Divya, K.
    Ashrtiha, V. V. S.
    INDIAN JOURNAL OF PHARMACEUTICAL EDUCATION AND RESEARCH, 2017, 51 (04) : S776 - S782