UNIFORMLY VALID POST-REGULARIZATION CONFIDENCE REGIONS FOR MANY FUNCTIONAL PARAMETERS IN Z-ESTIMATION FRAMEWORK

被引:33
作者
Belloni, Alexandre [1 ]
Chernozhukov, Victor [2 ,3 ]
Chetverikov, Denis [4 ]
Wei, Ying [5 ]
机构
[1] Duke Univ, Fuqua Sch Business, 100 Fuqua Dr, Durham, NC 27708 USA
[2] MIT, Dept Econ, 50 Mem Dr, Cambridge, MA 02142 USA
[3] MIT, Operat Res Ctr, 50 Mem Dr, Cambridge, MA 02142 USA
[4] UCLA, Dept Econ, Bunche Hall,Rm 8283,315 Portola Plaza, Los Angeles, CA 90095 USA
[5] Columbia Univ, Dept Biostat, 722 West 168th St,Rm 633, New York, NY 10032 USA
关键词
Inference after model selection; moment condition models with a continuum of target parameters; Lasso and Post-Lasso with functional response data; HIGH-DIMENSIONAL REGRESSION; SQUARE-ROOT LASSO; MODEL-SELECTION; LINEAR-MODELS; SPARSE MODELS; SEMIPARAMETRIC REGRESSION; ECONOMETRIC-MODELS; EFFICIENCY BOUNDS; INFERENCE; APPROXIMATION;
D O I
10.1214/17-AOS1671
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper, we develop procedures to construct simultaneous confidence bands for (p) over tilde potentially infinite-dimensional parameters after model selection for general moment condition models where p is potentially much larger than the sample size of available data, n. This allows us to cover settings with functional response data where each of the p parameters is a function. The procedure is based on the construction of score functions that satisfy Neyman orthogonality condition approximately. The proposed simultaneous confidence bands rely on uniform central limit theorems for high-dimensional vectors (and not on Donsker arguments as we allow for (p) over tilde >> n). To construct the bands, we employ a multiplier bootstrap procedure which is computationally efficient as it only involves resampling the estimated score functions (and does not require resolving the high-dimensional optimization problems). We formally apply the general theory to inference on regression coefficient process in the distribution regression model with a logistic link, where two implementations are analyzed in detail. Simulations and an application to real data are provided to help illustrate the applicability of the results.
引用
收藏
页码:3643 / 3675
页数:33
相关论文
共 47 条
[1]   ASYMPTOTICS FOR SEMIPARAMETRIC ECONOMETRIC-MODELS VIA STOCHASTIC EQUICONTINUITY [J].
ANDREWS, DWK .
ECONOMETRICA, 1994, 62 (01) :43-72
[2]  
[Anonymous], 2010, LASSO METHODS GAUSSI
[3]  
[Anonymous], 1979, Sankhya: The Indian Journal of Statistics, Series
[4]  
[Anonymous], 1999, CAMBRIDGE STUDIES AD
[5]   Uniform post-selection inference for least absolute deviation regression and other Z-estimation problems [J].
Belloni, A. ;
Chernozhukov, V. ;
Kato, K. .
BIOMETRIKA, 2015, 102 (01) :77-94
[6]   Sparse Models and Methods for Optimal Instruments With an Application to Eminent Domain [J].
Belloni, A. ;
Chen, D. ;
Chernozhukov, V. ;
Hansen, C. .
ECONOMETRICA, 2012, 80 (06) :2369-2429
[7]   Square-root lasso: pivotal recovery of sparse signals via conic programming [J].
Belloni, A. ;
Chernozhukov, V. ;
Wang, L. .
BIOMETRIKA, 2011, 98 (04) :791-806
[8]  
BELLONI A, 2013, VALID POST SELECTION
[9]  
BELLONI A, 2018, UNIFORMLY VALID PO S, DOI [10.1214/17-AOS1671SUPP, DOI 10.1214/17-AOS1671SUPP]
[10]  
Belloni A, 2013, ECON SOC MONOGR, P245