Statistical Inference After Model Selection

被引:48
|
作者
Berk, Richard [1 ,2 ]
Brown, Lawrence [1 ]
Zhao, Linda [1 ]
机构
[1] Univ Penn, Dept Stat, Philadelphia, PA 19104 USA
[2] Univ Penn, Dept Criminol, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
Model selection; Statistical inference; Mixtures of distributions; DANTZIG SELECTOR; LARGER;
D O I
10.1007/s10940-009-9077-7
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
Conventional statistical inference requires that a model of how the data were generated be known before the data are analyzed. Yet in criminology, and in the social sciences more broadly, a variety of model selection procedures are routinely undertaken followed by statistical tests and confidence intervals computed for a "final" model. In this paper, we examine such practices and show how they are typically misguided. The parameters being estimated are no longer well defined, and post-model-selection sampling distributions are mixtures with properties that are very different from what is conventionally assumed. Confidence intervals and statistical tests do not perform as they should. We examine in some detail the specific mechanisms responsible. We also offer some suggestions for better practice and show though a criminal justice example using real data how proper statistical inference in principle may be obtained.
引用
收藏
页码:217 / 236
页数:20
相关论文
共 50 条
  • [1] Statistical Inference After Model Selection
    Richard Berk
    Lawrence Brown
    Linda Zhao
    Journal of Quantitative Criminology, 2010, 26 : 217 - 236
  • [2] Inference after model selection
    Shen, XT
    Huang, HC
    Ye, J
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (467) : 751 - 762
  • [3] Selection of a model in reflectometry: use of the linear statistical inference
    Samoilenko, I
    Feigin, L
    Shchedrin, B
    Antolini, R
    PHYSICA B, 2000, 283 (1-3): : 262 - 267
  • [4] Statistical inference and model selection for the 1861 Hagelloch measles epidemic
    Neal, PJ
    Roberts, GO
    BIOSTATISTICS, 2004, 5 (02) : 249 - 261
  • [5] STATISTICAL INFERENCE FOR MEASUREMENT EQUATION SELECTION IN THE LOG-REALGARCH MODEL
    Li, Yu-Ning
    Zhang, Yi
    Zhang, Caiya
    ECONOMETRIC THEORY, 2019, 35 (05) : 943 - 977
  • [6] Robust process capability indices and statistical inference based on model selection
    Wang, Shuai
    Chiang, Jyun-You
    Tsai, Tzong-Ru
    Qin, Yan
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 156
  • [7] UNIFORM ASYMPTOTIC INFERENCE AND THE BOOTSTRAP AFTER MODEL SELECTION
    Tibshirani, Ryan J.
    Rinaldo, Alessandro
    Tibshirani, Rob
    Wasserman, Larry
    ANNALS OF STATISTICS, 2018, 46 (03): : 1255 - 1287
  • [8] STATISTICAL-INFERENCE, MODEL SELECTION AND RESEARCH EXPERIENCE - A MULTINOMIAL MODEL OF DATA MINING
    MARQUEZ, J
    SHACKMARQUEZ, J
    WASCHER, WL
    ECONOMICS LETTERS, 1985, 18 (01) : 39 - 44
  • [9] Bootstrap for inference after model selection and model averaging for likelihood models
    Garcia-Angulo, Andrea C.
    Claeskens, Gerda
    METRIKA, 2024, 88 (3) : 311 - 340
  • [10] Akaike-type criteria and the reliability of inference: Model selection versus statistical model specification
    Spanos, Aris
    JOURNAL OF ECONOMETRICS, 2010, 158 (02) : 204 - 220