Efficient estimation of quantiles in missing data models

被引:12
|
作者
Diaz, Ivan [1 ,2 ]
机构
[1] Weill Cornell Med, Div Biostat & Epidemiol, New York, NY 10065 USA
[2] Google Inc, New York, NY USA
关键词
Quantile effects; Information bound; root n-consistency; TMLE; INFERENCE;
D O I
10.1016/j.jspi.2017.05.001
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a novel targeted maximum likelihood estimator (TMLE) for quantiles in semiparametric missing data models. Our proposed estimator is locally efficient, root n-consistent, asymptotically normal, and doubly robust, under regularity conditions. We use Monte Carlo simulation to compare our proposed method to existing estimators. Our proposed estimator has superior performance, with relative efficiency up to three times smaller than the inverse probability weighted estimator (IPW), and up to two times smaller than the augmented IPW. This research is motivated by a causal inference research question with highly variable treatment assignment probabilities, and a heavy tailed, highly variable outcome. Estimation of causal effects on the mean is a hard problem in such scenarios because the information bound is generally small. In our application, the efficiency bound for estimating the effect on the mean is possibly infinite. This rules out root n-consistent inference and reduces the power for testing hypothesis of no treatment effect on the mean. In our simulations, using the effect on the median allows us to test a location-shift hypothesis with 30% more power. This allows us to make claims about the effectiveness of treatment that would have been hard to make for the effect on the mean. We provide R code to implement the proposed estimators. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:39 / 51
页数:13
相关论文
共 50 条
  • [41] Efficient restricted estimators for conditional mean models with missing data
    Tan, Z.
    BIOMETRIKA, 2011, 98 (03) : 663 - 684
  • [42] An efficient estimation for the parameter in additive partially linear models with missing covariates
    Xiuli Wang
    Yunquan Song
    Shuxia Zhang
    Journal of the Korean Statistical Society, 2020, 49 : 779 - 801
  • [43] An efficient estimation for the parameter in additive partially linear models with missing covariates
    Wang, Xiuli
    Song, Yunquan
    Zhang, Shuxia
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2020, 49 (03) : 779 - 801
  • [44] Estimation of spatial panel data models with randomly missing data in the dependent variable
    Wang, Wei
    Lee, Lung-fei
    REGIONAL SCIENCE AND URBAN ECONOMICS, 2013, 43 (03) : 521 - 538
  • [45] Estimation of moments and quantiles using censored data
    Kroll, Charles N.
    Stedinger, Jery R.
    Water Resources Research, 1996, 32 (04): : 1005 - 1012
  • [46] Estimation of moments and quantiles using censored data
    Kroll, CN
    Stedinger, JR
    WATER RESOURCES RESEARCH, 1996, 32 (04) : 1005 - 1012
  • [47] ESTIMATION WITH MISSING DATA
    DIGGLE, PJ
    BIOMETRICS, 1994, 50 (02) : 580 - 580
  • [48] Estimation with missing data
    Goodwin, GC
    Feuer, A
    MATHEMATICAL AND COMPUTER MODELLING OF DYNAMICAL SYSTEMS, 1999, 5 (03) : 220 - 244
  • [49] Comparative study of flood quantiles estimation by nonparametric models
    Kim, KD
    Heo, JH
    JOURNAL OF HYDROLOGY, 2002, 260 (1-4) : 176 - 193
  • [50] Communication-efficient distributed M-estimation with missing data
    Shi, Jianwei
    Qin, Guoyou
    Zhu, Huichen
    Zhu, Zhongyi
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 161