Least Squares Model Averaging for Distributed Data

被引:0
|
作者
Zhang, Haili [1 ]
Liu, Zhaobo [2 ]
Zou, Guohua [3 ]
机构
[1] Shenzhen Polytech Univ, Inst Appl Math, Shenzhen 518055, Peoples R China
[2] Shenzhen Univ, Inst Adv Study, Shenzhen 518060, Peoples R China
[3] Capital Normal Univ, Sch Math Sci, Beijing 100048, Peoples R China
基金
中国国家自然科学基金;
关键词
consistency; distributed data; divide and conquer algorithm; Mallows' criterion; model averaging; optimality; FOCUSED INFORMATION CRITERION; BIG DATA; REGRESSION; SELECTION; INFERENCE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Divide and conquer algorithm is a common strategy applied in big data. Model averaging has the natural divide-and-conquer feature, but its theory has not been developed in big data scenarios. The goal of this paper is to fill this gap. We propose two divide-and conquer-type model averaging estimators for linear models with distributed data. Under some regularity conditions, we show that the weights from Mallows model averaging criterion converge in L-2 to the theoretically optimal weights minimizing the risk of the model averaging estimator. We also give the bounds of the in-sample and out-of-sample mean squared errors and prove the asymptotic optimality for the proposed model averaging estimators. Our conclusions hold even when the dimensions and the number of candidate models are divergent. Simulation results and a real airline data analysis illustrate that the proposed model averaging methods perform better than the commonly used model selection and model averaging methods in distributed data cases. Our approaches contribute to model averaging theory in distributed data and parallel computations, and can be applied in big data analysis to save time and reduce the computational burden.
引用
收藏
页数:59
相关论文
共 50 条
  • [21] Least Squares Model Averaging for Two Non-Nested Linear Models
    Gao, Yan
    Xie, Tianfa
    Zou, Guohua
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2023, 36 (01) : 412 - 432
  • [22] On the dominance of Mallows model averaging estimator over ordinary least squares estimator
    Zhang, Xinyu
    Ullah, Aman
    Zhao, Shangwei
    ECONOMICS LETTERS, 2016, 142 : 69 - 73
  • [23] DISTRIBUTED RECURSIVE LEAST-SQUARES WITH DATA-ADAPTIVE CENSORING
    Wang, Zifeng
    Yu, Zheng
    Ling, Qing
    Berberidis, Dimitris
    Giannakis, Georgios B.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5860 - 5864
  • [24] A least squares algorithm for a mixture model for compositional data
    Mooijaart, A
    van der Heijden, PG
    van der Ark, LA
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1999, 30 (04) : 359 - 379
  • [25] Least squares algorithm for a mixture model for compositional data
    Mooijaart, Ab
    van der Heijden, Peter G.M.
    der Ark, L.Andries van
    Computational Statistics and Data Analysis, 1999, 30 (04): : 359 - 379
  • [26] Distributed Least Squares Algorithm of Continuous-Time Stochastic Regression Model Based on Sampled Data
    Xinghua Zhu
    Die Gan
    Zhixin Liu
    Journal of Systems Science and Complexity, 2024, 37 : 609 - 628
  • [27] Distributed Least Squares Algorithm of Continuous-Time Stochastic Regression Model Based on Sampled Data
    ZHU Xinghua
    GAN Die
    LIU Zhixin
    Journal of Systems Science & Complexity, 2024, 37 (02) : 609 - 628
  • [28] Distributed Least Squares Algorithm of Continuous-Time Stochastic Regression Model Based on Sampled Data
    Zhu, Xinghua
    Gan, Die
    Liu, Zhixin
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2024, 37 (02) : 609 - 628
  • [29] Limit of the optimal weight in least squares model averaging with non-nested models
    Fang, Fang
    Liu, Minhan
    ECONOMICS LETTERS, 2020, 196
  • [30] Distributed Learning with Regularized Least Squares
    Lin, Shao-Bo
    Guo, Xin
    Zhou, Ding-Xuan
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18