Statistical Inference After Model Selection

被引:48
|
作者
Berk, Richard [1 ,2 ]
Brown, Lawrence [1 ]
Zhao, Linda [1 ]
机构
[1] Univ Penn, Dept Stat, Philadelphia, PA 19104 USA
[2] Univ Penn, Dept Criminol, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
Model selection; Statistical inference; Mixtures of distributions; DANTZIG SELECTOR; LARGER;
D O I
10.1007/s10940-009-9077-7
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
Conventional statistical inference requires that a model of how the data were generated be known before the data are analyzed. Yet in criminology, and in the social sciences more broadly, a variety of model selection procedures are routinely undertaken followed by statistical tests and confidence intervals computed for a "final" model. In this paper, we examine such practices and show how they are typically misguided. The parameters being estimated are no longer well defined, and post-model-selection sampling distributions are mixtures with properties that are very different from what is conventionally assumed. Confidence intervals and statistical tests do not perform as they should. We examine in some detail the specific mechanisms responsible. We also offer some suggestions for better practice and show though a criminal justice example using real data how proper statistical inference in principle may be obtained.
引用
收藏
页码:217 / 236
页数:20
相关论文
共 50 条
  • [31] Model selection, simplicity, and scientific inference
    Myrvold, WC
    Harper, WL
    PHILOSOPHY OF SCIENCE, 2002, 69 (03) : S135 - S149
  • [32] ADVANCED STATISTICAL METHODS: INFERENCE, VARIABLE SELECTION, AND EXPERIMENTAL DESIGN
    Ryzhov, Ilya O.
    Zhang, Qiong
    Chen, Ye
    2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 1 - 15
  • [33] Model selection for the rate problem: A comparison of significance testing, Bayesian, and minimum description length statistical inference
    Lee, MD
    Pope, KJ
    JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2006, 50 (02) : 193 - 202
  • [34] Accounting for spatial autocorrelation from model selection to statistical inference: Application to a national survey of a diurnal raptor
    Le Rest, Kevin
    Pinaud, David
    Bretagnolle, Vincent
    ECOLOGICAL INFORMATICS, 2013, 14 : 17 - 24
  • [35] Statistical estimation with model selection
    Birge, Lucien
    INDAGATIONES MATHEMATICAE-NEW SERIES, 2006, 17 (04): : 497 - 537
  • [36] Statistical inference in high dimensional DEA model
    Yap, G.L.C.
    Ismail, W.R.
    Isa, Z.
    International Journal of Applied Mathematics and Statistics, 2012, 29 (05): : 17 - 33
  • [37] Statistical inference on uncertain nonparametric regression model
    Jianhua Ding
    Zhiqiang Zhang
    Fuzzy Optimization and Decision Making, 2021, 20 : 451 - 469
  • [38] Statistical inference in a random coefficient panel model
    Horvath, Lajos
    Trapani, Lorenzo
    JOURNAL OF ECONOMETRICS, 2016, 193 (01) : 54 - 75
  • [39] Statistical Inference in a Directed Network Model With Covariates
    Yan, Ting
    Jiang, Binyan
    Fienberg, Stephen E.
    Leng, Chenlei
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (526) : 857 - 868
  • [40] Statistical inference for the multidimensional mixed Rasch model
    Feddag, Mohand L.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2008, 37 (09) : 1732 - 1749