Empirical Bayes Mean Estimation With Nonparametric Errors Via Order Statistic Regression on Replicated Data

被引:2
|
作者
Ignatiadis, Nikolaos [1 ]
Saha, Sujayam [2 ]
Sun, Dennis L. [3 ]
Muralidharan, Omkar [2 ]
机构
[1] Stanford Univ, Dept Stat, Sequoia Hall,390 Jane Stanford Way, Stanford, CA 94305 USA
[2] Google Inc, Mountain View, CA USA
[3] Calif Polytech State Univ San Luis Obispo, Dept Stat, San Luis Obispo, CA 93407 USA
关键词
Asymptotic optimality; Empirical Bayes; Linear regression; L-statistics; Nonparametric regression; MAXIMUM-LIKELIHOOD ESTIMATOR; PANEL-DATA; MODELS; INFERENCE;
D O I
10.1080/01621459.2021.1967164
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We study empirical Bayes estimation of the effect sizes of N units from K noisy observations on each unit. We show that it is possible to achieve near-Bayes optimal mean squared error, without any assumptions or knowledge about the effect size distribution or the noise. The noise distribution can be heteroscedastic and vary arbitrarily from unit to unit. Our proposal, which we call Aurora, leverages the replication inherent in the K observations per unit and recasts the effect size estimation problem as a general regression problem. Aurora with linear regression provably matches the performance of a wide array of estimators including the sample mean, the trimmed mean, the sample median, as well as James-Stein shrunk versions thereof. Aurora automates effect size estimation for Internet-scale datasets, as we demonstrate on data from a large technology firm.
引用
收藏
页码:987 / 999
页数:13
相关论文
共 50 条