STATISTICAL INFERENCE WITH F-STATISTICS WHEN FITTING SIMPLE MODELS TO HIGH-DIMENSIONAL DATA

被引:0
|
作者
Leeb, Hannes [1 ]
Steinberger, Lukas [1 ]
机构
[1] Univ Vienna, Vienna, Austria
关键词
P-REGRESSION PARAMETERS; PRINCIPAL COMPONENTS; ASYMPTOTIC-BEHAVIOR; M-ESTIMATORS; LARGE NUMBER; TESTS; HETEROSKEDASTICITY; EIGENVALUES; PROJECTIONS; P2/N;
D O I
10.1017/S026646662100044X
中图分类号
F [经济];
学科分类号
02 ;
摘要
We study linear subset regression in the context of the high-dimensional overall model y = upsilon + theta'z + epsilon with univariate response y and a d-vector of random regressors z, independent of epsilon. Here, "high-dimensional" means that the number d of available explanatory variables is much larger than the number n of observations.We consider simple linear submodels where y is regressed on a set of p regressors given by x = M'z, for some d x p matrix M of full rank p < n. The corresponding simple model, that is, y = alpha + beta'x + e, is usually justified by imposing appropriate restrictions on the unknown parameter theta in the overall model; otherwise, this simple model can be grossly misspecified in the sense that relevant variables may have been omitted. In this paper, we establish asymptotic validity of the standard F-test on the surrogate parameter beta, in an appropriate sense, even when the simple model is misspecified, that is, without any restrictions on theta whatsoever and without assuming Gaussian data.
引用
收藏
页码:1249 / 1272
页数:24
相关论文
共 50 条
  • [1] PREDICTION WHEN FITTING SIMPLE MODELS TO HIGH-DIMENSIONAL DATA
    Steinberger, Lukas
    Leeb, Hannes
    ANNALS OF STATISTICS, 2019, 47 (03): : 1408 - 1442
  • [2] Robust Statistical Inference for High-Dimensional Data Models with Application to Genomics
    Sen, Pranab Kumar
    AUSTRIAN JOURNAL OF STATISTICS, 2006, 35 (2-3) : 197 - 214
  • [3] On the limits of fitting complex models of population history to f-statistics
    Maier, Robert
    Flegontov, Pavel
    Flegontova, Olga
    Isildak, Ulas
    Changmai, Piya
    Reich, David
    ELIFE, 2023, 12
  • [4] STATISTICAL INFERENCE IN SPARSE HIGH-DIMENSIONAL ADDITIVE MODELS
    Gregory, Karl
    Mammen, Enno
    Wahl, Martin
    ANNALS OF STATISTICS, 2021, 49 (03): : 1514 - 1536
  • [5] Model-Free Statistical Inference on High-Dimensional Data
    Guo, Xu
    Li, Runze
    Zhang, Zhe
    Zou, Changliang
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024,
  • [6] Optimal statistical inference for individualized treatment effects in high-dimensional models
    Cai, Tianxi
    Cai, T. Tony
    Guo, Zijian
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2021, 83 (04) : 669 - 719
  • [7] Statistical Inference for High-Dimensional Matrix-Variate Factor Models
    Chen, Elynn Y.
    Fan, Jianqing
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (542) : 1038 - 1055
  • [8] Statistical Inference for High-Dimensional Generalized Linear Models With Binary Outcomes
    Cai, T. Tony
    Guo, Zijian
    Ma, Rong
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (542) : 1319 - 1332
  • [9] Inference for mixed models of ANOVA type with high-dimensional data
    Chen, Fei
    Li, Zaixing
    Shi, Lei
    Zhu, Lixing
    JOURNAL OF MULTIVARIATE ANALYSIS, 2015, 133 : 382 - 401
  • [10] On inference in high-dimensional logistic regression models with separated data
    Lewis, R. M.
    Battey, H. S.
    BIOMETRIKA, 2024, 111 (03)