Distributed estimation for large-scale expectile regression

被引:0
|
作者
Pan, Yingli [1 ]
Wang, Haoyu [1 ]
Zhao, Xiaoluo [1 ]
Xu, Kaidong [1 ]
Liu, Zhan [1 ]
机构
[1] Hubei Univ, Fac Math & Stat, Hubei Key Lab Appl Math, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Distributed algorithm; Expectile regression; GEL function; Large-scale data;
D O I
10.1080/03610918.2023.2245181
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Analysis of large volume of data is very complex due to not only the high level of skewness and heteroscedasticity of variance but also the difficulty of data storage. Expectile regression is a common alternative method to analyze heterogeneous data. Distributed storage can reduce effectively the storage burden of a single machine. In this paper, we consider fitting linear expectile regression model to estimate conditional expectile based on large-scale data. We store the data in a distributed manner and construct a gradient-enhanced loss (GEL) function as a proxy for the global loss function. A distributed algorithm is proposed for the optimization of the GEL function. The asymptotic properties of the proposed estimator are established. Simulation studies are conducted to assess the finite-sample performance of our proposed estimator. Applications to an analysis of the National Health Interview Survey data set demonstrate the practicability of the proposed method.
引用
收藏
页码:104 / 119
页数:16
相关论文
共 50 条
  • [21] Distributed Mean-Field Density Estimation for Large-Scale Systems
    Zheng, Tongjia
    Han, Qing
    Lin, Hai
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (10) : 5218 - 5229
  • [22] Distributed LMMSE Estimation for Large-Scale Systems Based on Local Information
    Wang, Yan
    Xiong, Junlin
    Ho, Daniel W. C.
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) : 8528 - 8536
  • [23] Trade-Offs in Large-Scale Distributed Tuplewise Estimation And Learning
    Vogel, Robin
    Bellet, Aurelien
    Clemencon, Stephan
    Jelassi, Ons
    Papa, Guillaume
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 11907 : 229 - 245
  • [24] Distributed Nonparametric Regression Imputation for Missing Response Problems with Large-scale Data
    Wang, Ruoyu
    Su, Miaomiao
    Wang, Qihua
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [25] A Augmented Lagrangian Approach for Distributed Robust Estimation in Large-Scale Systems
    Chan, Shing Chow
    Wu, Ho Chun
    Ho, Cheuk Hei
    Zhang, Li
    IEEE SYSTEMS JOURNAL, 2019, 13 (03): : 2986 - 2997
  • [26] Distributed dynamic state estimation with parameter identification for large-scale systems
    Sun, Yibing
    Fu, Minyue
    Wang, Bingchang
    Zhang, Huanshui
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2017, 354 (14): : 6200 - 6216
  • [27] QUANTILE REGRESSION FOR LARGE-SCALE APPLICATIONS
    Yang, Jiyan
    Meng, Xiangrui
    Mahoney, Michael W.
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2014, 36 (05): : S78 - S110
  • [28] Large-Scale Sparse Logistic Regression
    Liu, Jun
    Chen, Jianhui
    Ye, Jieping
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 547 - 555
  • [29] Bayesian estimation of large-scale simulation models with Gaussian process regression surrogates
    Barde, Sylvain
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 196
  • [30] Large-Scale Supervised Process Monitoring Based on Distributed Modified Principal Component Regression
    Rong, Mengyu
    Shi, Hongbo
    Tan, Shuai
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2019, 58 (39) : 18223 - 18240