Statistical Inference After Model Selection

被引:48
|
作者
Berk, Richard [1 ,2 ]
Brown, Lawrence [1 ]
Zhao, Linda [1 ]
机构
[1] Univ Penn, Dept Stat, Philadelphia, PA 19104 USA
[2] Univ Penn, Dept Criminol, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
Model selection; Statistical inference; Mixtures of distributions; DANTZIG SELECTOR; LARGER;
D O I
10.1007/s10940-009-9077-7
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
Conventional statistical inference requires that a model of how the data were generated be known before the data are analyzed. Yet in criminology, and in the social sciences more broadly, a variety of model selection procedures are routinely undertaken followed by statistical tests and confidence intervals computed for a "final" model. In this paper, we examine such practices and show how they are typically misguided. The parameters being estimated are no longer well defined, and post-model-selection sampling distributions are mixtures with properties that are very different from what is conventionally assumed. Confidence intervals and statistical tests do not perform as they should. We examine in some detail the specific mechanisms responsible. We also offer some suggestions for better practice and show though a criminal justice example using real data how proper statistical inference in principle may be obtained.
引用
收藏
页码:217 / 236
页数:20
相关论文
共 50 条
  • [21] Statistical inference for the optimal approximating model
    Rohde, Angelika
    Duembgen, Lutz
    PROBABILITY THEORY AND RELATED FIELDS, 2013, 155 (3-4) : 839 - 865
  • [22] Statistical inference for the optimal approximating model
    Angelika Rohde
    Lutz Dümbgen
    Probability Theory and Related Fields, 2013, 155 : 839 - 865
  • [23] Statistical Inference of Selection and Divergence from a Time-Dependent Poisson Random Field Model
    Amei, Amei
    Sawyer, Stanley
    PLOS ONE, 2012, 7 (04):
  • [24] On model selection and model misspecification in causal inference
    Vansteelandt, Stijn
    Bekaert, Maarten
    Claeskens, Gerda
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2012, 21 (01) : 7 - 30
  • [25] Hybrid confidence intervals for informative uniform asymptotic inference after model selection
    Mccloskey, A.
    BIOMETRIKA, 2024, 111 (01) : 109 - 127
  • [26] Inference after model have selection (vol 99, pg 751, 2004)
    Shen, X.
    Huang, H. -C.
    Ye, J.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (475) : 1321 - 1321
  • [27] A hybrid inference framework for model selection
    Chai Xin
    Yang Bao-an
    Xie Zhi-ming
    PROCEEDINGS OF THE 2006 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING (13TH), VOLS 1-3, 2006, : 324 - 329
  • [28] Bayesian inference in a sample selection model
    van Hasselt, Martijn
    JOURNAL OF ECONOMETRICS, 2011, 165 (02) : 221 - 232
  • [29] Model selection and inference:: Facts and fiction
    Leeb, H
    Pötscher, BM
    ECONOMETRIC THEORY, 2005, 21 (01) : 21 - 59
  • [30] Model selection: An integral part of inference
    Buckland, ST
    Burnham, KP
    Augustin, NH
    BIOMETRICS, 1997, 53 (02) : 603 - 618