ON MODEL SELECTION FROM A FINITE FAMILY OF POSSIBLY MISSPECIFIED TIME SERIES MODELS

被引:16
|
作者
Hsu, Hsiang-Ling [1 ]
Ing, Ching-Kang [2 ]
Tong, Howell [3 ,4 ]
机构
[1] Natl Univ Kaohsiung, Inst Stat, 700 Kaohsiung Rd, Kaohsiung 811, Taiwan
[2] Natl Tsing Hua Univ, Inst Stat, 101,Sect 2,Kuang Fu Rd, Hsinchu 30013, Taiwan
[3] Univ Elect Sci & Technol, 4,Sect 2,North Jianshe Rd, Chengdu 610054, Sichuan, Peoples R China
[4] London Sch Econ, Dept Stat, Houghton St, London WC2A 2AE, England
来源
ANNALS OF STATISTICS | 2019年 / 47卷 / 02期
关键词
AIC; BIC; misspecification-resistant information criterion; multistep prediction error; high-dimensional misspecified models; orthogonal greedy algorithm; INFORMATION CRITERIA; LINEAR-MODELS; MOMENT BOUNDS; REGRESSION; ORDER; PREDICTION; PRINCIPLES; INDEX;
D O I
10.1214/18-AOS1706
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Consider finite parametric time series models. "I have n observations and k models, which model should I choose on the basis of the data alone" is a frequently asked question in many practical situations. This poses the key problem of selecting a model from a collection of candidate models, none of which is necessarily the true data generating process (DGP). Although existing literature on model selection is vast, there is a serious lacuna in that the above problem does not seem to have received much attention. In fact, existing model selection criteria have avoided addressing the above problem directly, either by assuming that the true DGP is included among the candidate models and aiming at choosing this DGP, or by assuming that the true DGP can be asymptotically approximated by an increasing sequence of candidate models and aiming at choosing the candidate having the best predictive capability in some asymptotic sense. In this article, we propose a misspecification-resistant information criterion (MRIC) to address the key problem directly. We first prove the asymptotic efficiency of MRIC whether the true DGP is among the candidates or not, within the fixed-dimensional framework. We then extend this result to the high-dimensional case in which the number of candidate variables is much larger than the sample size. In particular, we show that MRIC can be used in conjunction with a high-dimensional model selection method to select the (asymptotically) best predictive model across several high-dimensional misspecified time series models.
引用
收藏
页码:1061 / 1087
页数:27
相关论文
共 50 条