Mallows model averaging with effective model size in fragmentary data prediction

被引:3
|
作者
Yuan, Chaoxia [1 ]
Fang, Fang [1 ,2 ]
Ni, Lyu [3 ]
机构
[1] East China Normal Univ, Sch Stat, Shanghai, Peoples R China
[2] East China Normal Univ, Key Lab Adv Theory & Applicat Stat & Data Sci MOE, Shanghai, Peoples R China
[3] East China Normal Univ, Sch Data Sci & Engn, 3663 North Zhongshan Rd, Shanghai 200062, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Asymptotic optimality; Effective model size; Fragmentary data; Multiple data sources; Mallows model averaging; GENERALIZED LINEAR-MODELS; ASYMPTOTIC OPTIMALITY; SELECTION; REGRESSION;
D O I
10.1016/j.csda.2022.107497
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Most existing model averaging methods consider fully observed data while fragmentary data, in which not all the covariate data are available for many subjects, becomes more and more popular nowadays with the increasing data sources in many areas such as economics, social sciences and medical studies. The main challenge of model averaging in fragmentary data is that the samples to fit candidate models are different to the sample used for weight selection, which introduces bias to the Mallows criterion in the classical Mallows Model Averaging (MMA). A novel Mallows model averaging method that utilizes the "effective model size " taking different samples into consideration is proposed and its asymptotic optimality is established. Empirical evidences from a simulation study and a real data analysis are presented. The proposed Effective Mallows Model Averaging (EMMA) method not only provides a novel solution to the fragmentary data prediction, but also sheds light on model selection when candidate models have different sample sizes, which has rarely been discussed in the literature. (C)& nbsp;2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Semiparametric model averaging prediction: a Bayesian approach
    Wang, Jingli
    Li, Jialiang
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2018, 60 (04) : 407 - 422
  • [42] Model averaging for prediction with discrete Bayesian networks
    Dash, Denver
    Cooper, Gregory F.
    Journal of Machine Learning Research, 2004, 5 : 1177 - 1203
  • [43] Hybrid link prediction via model averaging
    Zhang, Qi
    Tong, Tingting
    Wu, Shunyao
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2020, 556
  • [44] Bayesian averaging, prediction and nonnested model selection
    Hong, Han
    Preston, Bruce
    JOURNAL OF ECONOMETRICS, 2012, 167 (02) : 358 - 369
  • [45] Model averaging for prediction with discrete Bayesian networks
    Dash, D
    Cooper, GF
    JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 5 : 1177 - 1203
  • [46] Jackknife model averaging for additive expectile prediction
    Sun, Xianwen
    Zhang, Lixin
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2024, 53 (19) : 6799 - 6831
  • [47] Bayesian model averaging for river flow prediction
    Paul J. Darwen
    Applied Intelligence, 2019, 49 : 103 - 111
  • [48] Bayesian model averaging for river flow prediction
    Darwen, Paul J.
    APPLIED INTELLIGENCE, 2019, 49 (01) : 103 - 111
  • [49] Semiparametric model averaging prediction for dichotomous response
    Fang, Fang
    Li, Jialiang
    Xia, Xiaochao
    JOURNAL OF ECONOMETRICS, 2022, 229 (02) : 219 - 245
  • [50] Investigation on the improvement of prediction by bootstrap model averaging
    Holländer, N
    Augustin, NH
    Sauerbrei, W
    METHODS OF INFORMATION IN MEDICINE, 2006, 45 (01) : 44 - 50