Can the strengths of AIC and BIC be shared? A conflict between model indentification and regression estimation

被引:372
|
作者
Yang, YH [1 ]
机构
[1] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
AIC; BIC; consistency; minimax-rate optimality; model averaging; model selection;
D O I
10.1093/biomet/92.4.937
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A traditional approach to statistical inference is to identify the true or best model first with little or no consideration of the specific goal of inference in the model identification stage. Can the pursuit of the true model also lead to optimal regression estimation? In model selection, it is well known that BIC is consistent in selecting the true model, and AIC is minimax-rate optimal for estimating the regression function. A recent promising direction is adaptive model selection, in which, in contrast to AIC and BIC, the penalty term is data-dependent. Some theoretical and empirical results have been obtained in support of adaptive model selection, but it is still not clear if it can really share the strengths of AIC and BIC. Model combining or averaging has attracted increasing attention as a means to overcome the model selection uncertainty. Can Bayesian model averaging be optimal for estimating the regression function in a minimax sense? We show that the answers to these questions are basically in the negative: for any model selection criterion to be consistent, it must behave suboptimally for estimating the regression function in terms of minimax rate of covergence; and Bayesian model averaging cannot be minimax-rate optimal for regression estimation.
引用
收藏
页码:937 / 950
页数:14
相关论文
共 17 条