PROJECTED SPLINE ESTIMATION OF THE NONPARAMETRIC FUNCTION IN HIGH-DIMENSIONAL PARTIALLY LINEAR MODELS FOR MASSIVE DATA

被引:21
|
作者
Lian, Heng [1 ]
Zhao, Kaifeng [2 ]
Lv, Shaogao [3 ]
机构
[1] City Univ Hong Kong, Dept Math, Kowloon, 83 Tat Chee Ave, Hong Kong, Peoples R China
[2] Philips Res China, Big Data & AI, 718 Lingshi Rd, Shanghai 200040, Peoples R China
[3] Nanjing Audit Univ, Dept Stat & Math, Nanjing 211815, Jiangsu, Peoples R China
来源
ANNALS OF STATISTICS | 2019年 / 47卷 / 05期
关键词
Asymptotic normality; B-splines; local asymptotics; profiled estimation; EFFICIENT ESTIMATION; VARIABLE SELECTION; LOCAL ASYMPTOTICS; REGRESSION;
D O I
10.1214/18-AOS1769
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper, we consider the local asymptotics of the nonparametric function in a partially linear model, within the framework of the divide-and-conquer estimation. Unlike the fixed-dimensional setting in which the parametric part does not affect the nonparametric part, the high-dimensional setting makes the issue more complicated. In particular, when a sparsity-inducing penalty such as lasso is used to make the estimation of the linear part feasible, the bias introduced will propagate to the nonparametric part. We propose a novel approach for estimation of the nonparametric function and establish the local asymptotics of the estimator. The result is useful for massive data with possibly different linear coefficients in each subpopulation but common nonparametric function. Some numerical illustrations are also presented.
引用
收藏
页码:2922 / 2949
页数:28
相关论文
共 50 条
  • [41] Confidence intervals for high-dimensional partially linear single-index models
    Gueuning, Thomas
    Claeskens, Gerda
    JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 149 : 13 - 29
  • [42] Generalized autoregressive linear models for discrete high-dimensional data
    Pandit P.
    Sahraee-Ardakan M.
    Amini A.A.
    Rangan S.
    Fletcher A.K.
    IEEE Journal on Selected Areas in Information Theory, 2020, 1 (03): : 884 - 896
  • [43] Distributed Statistical Estimation of High-Dimensional and Nonparametric Distributions
    Han, Yanjun
    Mukherjee, Pritam
    Ozgur, Ayfer
    Weissman, Tsachy
    2018 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2018, : 506 - 510
  • [44] Double machine learning for partially linear mediation models with high-dimensional confounders
    Yang, Jichen
    Shao, Yujing
    Liu, Jin
    Wang, Lei
    NEUROCOMPUTING, 2025, 614
  • [45] Spline estimator for ultra-high dimensional partially linear varying coefficient models
    Zhaoliang Wang
    Liugen Xue
    Gaorong Li
    Fei Lu
    Annals of the Institute of Statistical Mathematics, 2019, 71 : 657 - 677
  • [46] Spline estimator for ultra-high dimensional partially linear varying coefficient models
    Wang, Zhaoliang
    Xue, Liugen
    Li, Gaorong
    Lu, Fei
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2019, 71 (03) : 657 - 677
  • [47] Average Estimation of Semiparametric Models for High-Dimensional Longitudinal Data
    ZHAO Zhihao
    ZOU Guohua
    Journal of Systems Science & Complexity, 2020, 33 (06) : 2013 - 2047
  • [48] Average Estimation of Semiparametric Models for High-Dimensional Longitudinal Data
    Zhihao Zhao
    Guohua Zou
    Journal of Systems Science and Complexity, 2020, 33 : 2013 - 2047
  • [49] Average Estimation of Semiparametric Models for High-Dimensional Longitudinal Data
    Zhao Zhihao
    Zou Guohua
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2020, 33 (06) : 2013 - 2047
  • [50] Estimation and variable selection for high-dimensional spatial data models
    Hou, Li
    Jin, Baisuo
    Wu, Yuehua
    JOURNAL OF ECONOMETRICS, 2024, 238 (02)