Robust distributed modal regression for massive data
被引:33
|
作者:
Wang, Kangning
论文数: 0引用数: 0
h-index: 0
机构:
Shandong Technol & Business Univ, Sch Stat, Yantai, Peoples R ChinaShandong Technol & Business Univ, Sch Stat, Yantai, Peoples R China
Wang, Kangning
[1
]
Li, Shaomin
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Normal Univ, Ctr Stat & Data Sci, Zhuhai, Peoples R China
Peking Univ, Guanghua Sch Management, Beijing, Peoples R ChinaShandong Technol & Business Univ, Sch Stat, Yantai, Peoples R China
Li, Shaomin
[2
,3
]
机构:
[1] Shandong Technol & Business Univ, Sch Stat, Yantai, Peoples R China
[2] Beijing Normal Univ, Ctr Stat & Data Sci, Zhuhai, Peoples R China
[3] Peking Univ, Guanghua Sch Management, Beijing, Peoples R China
Modal regression is a good alternative of the mean regression and likelihood based methods, because of its robustness and high efficiency. A robust communication-efficient distributed modal regression for the distributed massive data is proposed in this paper. Specifically, the global modal regression objective function is approximated by a surrogate one at the first machine, which relates to the local datasets only through gradients. Then the resulting estimator can be obtained at the first machine and other machines only need to calculate the gradients, which can significantly reduce the communication cost. Under mild conditions, the asymptotical properties are established, which show that the proposed estimator is statistically as efficient as the global modal regression estimator. What is more, as a specific application, a penalized robust communication-efficient distributed modal regression variable selection procedure is developed. Simulation results and real data analysis are also included to validate our method. (C) 2021 Elsevier B.V. All rights reserved.
机构:
Stern School of Business, New York UniversityStern School of Business, New York University
Xi Chen
Weidong Liu
论文数: 0引用数: 0
h-index: 0
机构:
School of Mathematical Sciences and MoE Key Lab of Artificial Intelligence,Shanghai Jiao Tong UniversityStern School of Business, New York University
Weidong Liu
Xiaojun Mao
论文数: 0引用数: 0
h-index: 0
机构:
School of Data Science, Fudan UniversityStern School of Business, New York University
机构:
NYU, Stern Sch Business, New York, NY 10012 USANYU, Stern Sch Business, New York, NY 10012 USA
Chen, Xi
Liu, Weidong
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Jiao Tong Univ, Sch Math Sci, Shanghai 200240, Peoples R China
Shanghai Jiao Tong Univ, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R ChinaNYU, Stern Sch Business, New York, NY 10012 USA
Liu, Weidong
Mao, Xiaojun
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, Sch Data Sci, Shanghai 200433, Peoples R ChinaNYU, Stern Sch Business, New York, NY 10012 USA
机构:
Peking Univ, Guanghua Sch Management, Beijing, Peoples R China
Peking Univ, Ctr Stat Sci, Beijing, Peoples R ChinaPeking Univ, Guanghua Sch Management, Beijing, Peoples R China
Chen, Song Xi
Peng, Liuhua
论文数: 0引用数: 0
h-index: 0
机构:
Univ Melbourne, Sch Math & Stat, Melbourne, Vic, AustraliaPeking Univ, Guanghua Sch Management, Beijing, Peoples R China
Peng, Liuhua
ANNALS OF STATISTICS,
2021,
49
(05):
: 2851
-
2869
机构:
Changchun Univ Technol, Sch Math & Stat, Changchun 130012, Jilin, Peoples R ChinaChangchun Univ Technol, Sch Math & Stat, Changchun 130012, Jilin, Peoples R China
Zou, Yuhao
Yuan, Xiaohui
论文数: 0引用数: 0
h-index: 0
机构:
Changchun Univ Technol, Sch Math & Stat, Changchun 130012, Jilin, Peoples R ChinaChangchun Univ Technol, Sch Math & Stat, Changchun 130012, Jilin, Peoples R China
Yuan, Xiaohui
Liu, Tianqing
论文数: 0引用数: 0
h-index: 0
机构:
Jilin Univ, Ctr Appl Stat Res, Changchun 130012, Jilin, Peoples R China
Jilin Univ, Sch Math, Changchun 130012, Jilin, Peoples R ChinaChangchun Univ Technol, Sch Math & Stat, Changchun 130012, Jilin, Peoples R China