Robust distributed modal regression for massive data

被引:33
|
作者
Wang, Kangning [1 ]
Li, Shaomin [2 ,3 ]
机构
[1] Shandong Technol & Business Univ, Sch Stat, Yantai, Peoples R China
[2] Beijing Normal Univ, Ctr Stat & Data Sci, Zhuhai, Peoples R China
[3] Peking Univ, Guanghua Sch Management, Beijing, Peoples R China
基金
中国博士后科学基金;
关键词
Massive data; Robustness; Communication-efficient; Modal regression; Variable selection; VARIABLE SELECTION; LIKELIHOOD; LASSO;
D O I
10.1016/j.csda.2021.107225
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Modal regression is a good alternative of the mean regression and likelihood based methods, because of its robustness and high efficiency. A robust communication-efficient distributed modal regression for the distributed massive data is proposed in this paper. Specifically, the global modal regression objective function is approximated by a surrogate one at the first machine, which relates to the local datasets only through gradients. Then the resulting estimator can be obtained at the first machine and other machines only need to calculate the gradients, which can significantly reduce the communication cost. Under mild conditions, the asymptotical properties are established, which show that the proposed estimator is statistically as efficient as the global modal regression estimator. What is more, as a specific application, a penalized robust communication-efficient distributed modal regression variable selection procedure is developed. Simulation results and real data analysis are also included to validate our method. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Robust distributed multicategory angle-based classification for massive data
    Gaoming Sun
    Xiaozhou Wang
    Yibo Yan
    Riquan Zhang
    Metrika, 2024, 87 : 299 - 323
  • [22] Distributed robust Gaussian Process regression
    Mair, Sebastian
    Brefeld, Ulf
    KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 55 (02) : 415 - 435
  • [23] Distributed robust Gaussian Process regression
    Sebastian Mair
    Ulf Brefeld
    Knowledge and Information Systems, 2018, 55 : 415 - 435
  • [24] Distributed penalizing function criterion for local polynomial estimation in nonparametric regression with massive data
    Sun, Tianqi
    Li, Weiyu
    Lin, Lu
    STATISTICAL PAPERS, 2025, 66 (03)
  • [25] Is Massive MIMO Robust Against Distributed Jammers?
    Gulgun, Ziya
    Bjornson, Emil
    Larsson, Erik G.
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (01) : 457 - 469
  • [26] Modal regression with streaming data sets
    Gao, Wenliang
    Chen, Yujie
    Du, Haiyan
    Sun, Xiaofei
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2025,
  • [27] Robust Estimation for Partial Functional Linear Regression Model Based on Modal Regression
    YU Ping
    ZHU Zhongyi
    SHI Jianhong
    AI Xikai
    Journal of Systems Science & Complexity, 2020, 33 (02) : 527 - 544
  • [28] Modal-Regression-Based Broad Learning System for Robust Regression and Classification
    Liu, Licheng
    Liu, Tingyun
    Chen, C. L. Philip
    Wang, Yaonan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12344 - 12357
  • [29] Robust Estimation for Partial Functional Linear Regression Model Based on Modal Regression
    Ping Yu
    Zhongyi Zhu
    Jianhong Shi
    Xikai Ai
    Journal of Systems Science and Complexity, 2020, 33 : 527 - 544
  • [30] Robust Estimation for Partial Functional Linear Regression Model Based on Modal Regression
    Yu, Ping
    Zhu, Zhongyi
    Shi, Jianhong
    Ai, Xikai
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2020, 33 (02) : 527 - 544