Current scientific techniques in genomics and image processing routinely produce hypothesis testing problems with hundreds or thousands of cases to consider simultaneously. This poses new difficulties for the statistician, but also opens new opportunities. In particular, it allows empirical estimation of an appropriate null hypothesis. The empirical null may be considerably more dispersed than the usual theoretical null distribution that would be used for any one case considered separately. An empirical Bayes analysis plan for this situation is developed, using a local version of the false discovery rate to examine the inference issues. Two genomics problems are used as examples to show the importance of correctly choosing the null hypothesis.
机构:
Cornell Univ, Dept Human Dev, G331 Martha Van Rensselaer Hall, Ithaca, NY 14853 USA
Cornell Univ, Human Neurosci Inst, G331 Martha Van Rensselaer Hall, Ithaca, NY 14853 USACornell Univ, Dept Human Dev, G331 Martha Van Rensselaer Hall, Ithaca, NY 14853 USA