Efficient estimation of quantiles in missing data models

被引:12
|
作者
Diaz, Ivan [1 ,2 ]
机构
[1] Weill Cornell Med, Div Biostat & Epidemiol, New York, NY 10065 USA
[2] Google Inc, New York, NY USA
关键词
Quantile effects; Information bound; root n-consistency; TMLE; INFERENCE;
D O I
10.1016/j.jspi.2017.05.001
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a novel targeted maximum likelihood estimator (TMLE) for quantiles in semiparametric missing data models. Our proposed estimator is locally efficient, root n-consistent, asymptotically normal, and doubly robust, under regularity conditions. We use Monte Carlo simulation to compare our proposed method to existing estimators. Our proposed estimator has superior performance, with relative efficiency up to three times smaller than the inverse probability weighted estimator (IPW), and up to two times smaller than the augmented IPW. This research is motivated by a causal inference research question with highly variable treatment assignment probabilities, and a heavy tailed, highly variable outcome. Estimation of causal effects on the mean is a hard problem in such scenarios because the information bound is generally small. In our application, the efficiency bound for estimating the effect on the mean is possibly infinite. This rules out root n-consistent inference and reduces the power for testing hypothesis of no treatment effect on the mean. In our simulations, using the effect on the median allows us to test a location-shift hypothesis with 30% more power. This allows us to make claims about the effectiveness of treatment that would have been hard to make for the effect on the mean. We provide R code to implement the proposed estimators. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:39 / 51
页数:13
相关论文
共 50 条
  • [1] Efficient estimation of population quantiles in general semiparametric regression models
    Maity, Arnab
    STATISTICS & PROBABILITY LETTERS, 2008, 78 (16) : 2744 - 2750
  • [2] Efficient estimation with missing data and endogeneity
    Rai, Bhavna
    ECONOMETRIC REVIEWS, 2023, 42 (02) : 220 - 239
  • [3] HOW EFFICIENT IS ESTIMATION WITH MISSING DATA?
    Karadogan, Seliz G.
    Marchegiani, Letizia
    Hansen, Lars Kai
    Larsen, Jan
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2260 - 2263
  • [4] Estimation in semiparametric models with missing data
    Chen, Song Xi
    Van Keilegom, Ingrid
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2013, 65 (04) : 785 - 805
  • [5] Estimation in semiparametric models with missing data
    Song Xi Chen
    Ingrid Van Keilegom
    Annals of the Institute of Statistical Mathematics, 2013, 65 : 785 - 805
  • [6] Robust estimation of distribution functions and quantiles with non-ignorable missing data
    Zhao, Pu-Ying
    Tang, Man-Lai
    Tang, Nian-Sheng
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2013, 41 (04): : 575 - 595
  • [7] A SIMPLE AND EFFICIENT ESTIMATION METHOD FOR MODELS WITH NON-IGNORABLE MISSING DATA
    Ai, Chunrong
    Linton, Oliver
    Zhang, Zheng
    STATISTICA SINICA, 2020, 30 (04) : 1949 - 1970
  • [8] EFFICIENT ESTIMATION IN SOME MISSING DATA PROBLEMS
    SCHICK, A
    SUSARLA, V
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1988, 19 (02) : 217 - 228
  • [9] Estimation of regression quantiles in complex surveys with data missing at random: An application to birthweight determinants
    Geraci, Marco
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (04) : 1393 - 1421
  • [10] Parameter estimation for ARX models with missing data
    Horner, M.
    Pakzad, S. N.
    LIFE-CYCLE OF ENGINEERING SYSTEMS: EMPHASIS ON SUSTAINABLE CIVIL INFRASTRUCTURE, 2017, : 2138 - 2144