Statistical Inference After Model Selection

被引:48
|
作者
Berk, Richard [1 ,2 ]
Brown, Lawrence [1 ]
Zhao, Linda [1 ]
机构
[1] Univ Penn, Dept Stat, Philadelphia, PA 19104 USA
[2] Univ Penn, Dept Criminol, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
Model selection; Statistical inference; Mixtures of distributions; DANTZIG SELECTOR; LARGER;
D O I
10.1007/s10940-009-9077-7
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
Conventional statistical inference requires that a model of how the data were generated be known before the data are analyzed. Yet in criminology, and in the social sciences more broadly, a variety of model selection procedures are routinely undertaken followed by statistical tests and confidence intervals computed for a "final" model. In this paper, we examine such practices and show how they are typically misguided. The parameters being estimated are no longer well defined, and post-model-selection sampling distributions are mixtures with properties that are very different from what is conventionally assumed. Confidence intervals and statistical tests do not perform as they should. We examine in some detail the specific mechanisms responsible. We also offer some suggestions for better practice and show though a criminal justice example using real data how proper statistical inference in principle may be obtained.
引用
收藏
页码:217 / 236
页数:20
相关论文
共 50 条
  • [41] STATISTICAL INFERENCE FOR A BIVARIATE POISSON REGRESSION MODEL
    Riggs, Kent
    Young, Dean M.
    Stamey, James D.
    ADVANCES AND APPLICATIONS IN STATISTICS, 2008, 10 (01) : 55 - 73
  • [42] TOWARDS A MODEL THEORY OF STATISTICAL-INFERENCE
    HAVRANEK, T
    JOURNAL OF SYMBOLIC LOGIC, 1977, 42 (03) : 451 - 452
  • [43] Statistical Inference for a kind of Nonlinear Regression Model
    Jiang, Yuying
    Zhang, Yongming
    SUSTAINABLE DEVELOPMENT OF NATURAL RESOURCES, PTS 1-3, 2013, 616-618 : 2149 - 2152
  • [44] Statistical inference of Poisson censored δ-shock model
    Ma, Ming
    Peng, Bo
    Maocuo, La
    Ye, Jianhua
    Liu, Hua
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2024,
  • [45] Detecting model dependence in statistical inference: A response
    King, Gary
    Zeng, Langche
    INTERNATIONAL STUDIES QUARTERLY, 2007, 51 (01) : 231 - 241
  • [46] Statistical inference on uncertain nonparametric regression model
    Ding, Jianhua
    Zhang, Zhiqiang
    FUZZY OPTIMIZATION AND DECISION MAKING, 2021, 20 (04) : 451 - 469
  • [47] SAMPLING INFERENCE, AN ALTERNATE STATISTICAL-MODEL
    ZWIRNER, WW
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 1991, 17 (2-3) : 247 - 252
  • [48] Statistical inference in the multinomial multiperiod probit model
    Geweke, JF
    Keane, MP
    Runkle, DE
    JOURNAL OF ECONOMETRICS, 1997, 80 (01) : 125 - 165
  • [49] Statistical Inference in high dimensional DEA model
    Yap, G. L. C.
    Ismail, W. R.
    Isa, Z.
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2012, 29 (05): : 17 - 33
  • [50] Statistical inference on kurtosis of independent component model
    Zhou, Bowen
    Chen, Hantao
    Wang, Cheng
    RANDOM MATRICES-THEORY AND APPLICATIONS, 2024, 13 (04)