PROJECTED SPLINE ESTIMATION OF THE NONPARAMETRIC FUNCTION IN HIGH-DIMENSIONAL PARTIALLY LINEAR MODELS FOR MASSIVE DATA

被引:21
|
作者
Lian, Heng [1 ]
Zhao, Kaifeng [2 ]
Lv, Shaogao [3 ]
机构
[1] City Univ Hong Kong, Dept Math, Kowloon, 83 Tat Chee Ave, Hong Kong, Peoples R China
[2] Philips Res China, Big Data & AI, 718 Lingshi Rd, Shanghai 200040, Peoples R China
[3] Nanjing Audit Univ, Dept Stat & Math, Nanjing 211815, Jiangsu, Peoples R China
来源
ANNALS OF STATISTICS | 2019年 / 47卷 / 05期
关键词
Asymptotic normality; B-splines; local asymptotics; profiled estimation; EFFICIENT ESTIMATION; VARIABLE SELECTION; LOCAL ASYMPTOTICS; REGRESSION;
D O I
10.1214/18-AOS1769
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper, we consider the local asymptotics of the nonparametric function in a partially linear model, within the framework of the divide-and-conquer estimation. Unlike the fixed-dimensional setting in which the parametric part does not affect the nonparametric part, the high-dimensional setting makes the issue more complicated. In particular, when a sparsity-inducing penalty such as lasso is used to make the estimation of the linear part feasible, the bias introduced will propagate to the nonparametric part. We propose a novel approach for estimation of the nonparametric function and establish the local asymptotics of the estimator. The result is useful for massive data with possibly different linear coefficients in each subpopulation but common nonparametric function. Some numerical illustrations are also presented.
引用
收藏
页码:2922 / 2949
页数:28
相关论文
共 50 条
  • [31] Maximum Likelihood for Variance Estimation in High-Dimensional Linear Models
    Dicker, Lee H.
    Erdogdu, Murat A.
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 159 - 167
  • [32] Estimation of high-dimensional linear factor models with grouped variables
    Heaton, Chris
    Solo, Victor
    JOURNAL OF MULTIVARIATE ANALYSIS, 2012, 105 (01) : 348 - 367
  • [33] Optimal Estimation of Genetic Relatedness in High-Dimensional Linear Models
    Guo, Zijian
    Wang, Wanjie
    Cai, T. Tony
    Li, Hongzhe
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (525) : 358 - 369
  • [34] ESTIMATION IN HIGH-DIMENSIONAL LINEAR MODELS WITH DETERMINISTIC DESIGN MATRICES
    Shao, Jun
    Deng, Xinwei
    ANNALS OF STATISTICS, 2012, 40 (02): : 812 - 831
  • [35] Wavelet nonparametric estimation from strong spatial correlated high-dimensional data
    Frias, M. P.
    Ruiz-Medina, M. D.
    SPATIAL STATISTICS, 2016, 18 : 363 - 385
  • [36] Smooth-Threshold GEE Variable Selection in High-Dimensional Partially Linear Models with Longitudinal Data
    Tian, Ruiqin
    Xue, Liugen
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2015, 44 (07) : 1720 - 1734
  • [37] Semi-profiled distributed estimation for high-dimensional partially linear model
    Bao, Yajie
    Ren, Haojie
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 188
  • [38] Spline-Lasso in High-Dimensional Linear Regression
    Guo, Jianhua
    Hu, Jianchang
    Jing, Bing-Yi
    Zhang, Zhen
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (513) : 288 - 297
  • [39] Estimation of Linear Functionals in High-Dimensional Linear Models: From Sparsity to Nonsparsity
    Zhao, Junlong
    Zhou, Yang
    Liu, Yufeng
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (546) : 1579 - 1591
  • [40] Estimation of Low Rank High-Dimensional Multivariate Linear Models for Multi-Response Data
    Zou, Changliang
    Ke, Yuan
    Zhang, Wenyang
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (538) : 693 - 703