Additive partially linear models for massive heterogeneous data

被引:9
|
作者
Wang, Binhuan [1 ]
Fang, Yixin [2 ]
Lian, Heng [3 ]
Liang, Hua [4 ]
机构
[1] NYU, Dept Populat Hlth, Sch Med, New York, NY 10003 USA
[2] New Jersey Inst Technol, Dept Math Sci, Newark, NJ 07102 USA
[3] City Univ Hong Kong, Dept Math, Hong Kong, Peoples R China
[4] George Washington Univ, Dept Stat, Washington, DC 20052 USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2019年 / 13卷 / 01期
关键词
Divide-and-conquer; homogeneity; heterogeneity; oracle property; regression splines;
D O I
10.1214/18-EJS1528
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider an additive partially linear framework for modelling massive heterogeneous data. The major goal is to extract multiple common features simultaneously across all sub-populations while exploring heterogeneity of each sub-population. We propose an aggregation type of estimators for the commonality parameters that possess the asymptotic optimal bounds and the asymptotic distributions as if there were no heterogeneity. This oracle result holds when the number of sub-populations does not grow too fast and the tuning parameters are selected carefully. A plugin estimator for the heterogeneity parameter is further constructed, and shown to possess the asymptotic distribution as if the commonality information were available. Furthermore, we develop a heterogeneity test for the linear components and a homogeneity test for the non-linear components accordingly. The performance of the proposed methods is evaluated via simulation studies and an application to the Medicare Provider Utilization and Payment data.
引用
收藏
页码:391 / 431
页数:41
相关论文
共 50 条
  • [41] Variable selection for additive partially linear models with measurement error
    Zhou, Zhangong
    Jiang, Rong
    Qian, Weimin
    METRIKA, 2011, 74 (02) : 185 - 202
  • [42] Variable selection in partially linear additive models for modal regression
    Lv, Jing
    Yang, Hu
    Guo, Chaohui
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (07) : 5646 - 5665
  • [43] Empirical Likelihood for Nonparametric Components in Additive Partially Linear Models
    Zhao, Peixin
    Xue, Liugen
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2013, 42 (09) : 1935 - 1947
  • [44] Variable selection for additive partially linear models with measurement error
    Zhangong Zhou
    Rong Jiang
    Weimin Qian
    Metrika, 2011, 74 : 185 - 202
  • [45] Partially additive categories and fully complete models of linear logic
    Haghverdi, E
    TYPED LAMBDA CALCULI AND APPLICATIONS, PROCEEDINGS, 2001, 2044 : 197 - 216
  • [46] Statistical inference for partially linear additive spatial autoregressive models
    Du, Jiang
    Sun, Xiaoqian
    Cao, Ruiyuan
    Zhang, Zhongzhan
    SPATIAL STATISTICS, 2018, 25 : 52 - 67
  • [47] Variational inferences for partially linear additive models with variable selection
    Zhao, Kaifeng
    Lian, Heng
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 80 : 223 - 239
  • [48] Partially Linear Models under Data Combination
    D'Haultfoeuille, X.
    Gaillac, C.
    Maurel, A.
    REVIEW OF ECONOMIC STUDIES, 2024, 92 (01): : 238 - 267
  • [49] PROJECTED SPLINE ESTIMATION OF THE NONPARAMETRIC FUNCTION IN HIGH-DIMENSIONAL PARTIALLY LINEAR MODELS FOR MASSIVE DATA
    Lian, Heng
    Zhao, Kaifeng
    Lv, Shaogao
    ANNALS OF STATISTICS, 2019, 47 (05): : 2922 - 2949
  • [50] Communication-efficient distributed estimation of partially linear additive models for large-scale data
    Gao, Junzhuo
    Wang, Lei
    INFORMATION SCIENCES, 2023, 631 : 185 - 201