Bayesian regression based on principal components for high-dimensional data

被引:8
|
作者
Lee, Jaeyong [1 ]
Oh, Hee-Seok [1 ]
机构
[1] Seoul Natl Univ, Seoul 151, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
10.1016/j.jmva.2013.02.002
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The Gaussian sequence model can be obtained from the high-dimensional regression model through principal component analysis. It is shown that the Gaussian sequence model is equivalent to the original high-dimensional regression model in terms of prediction. Under a sparsity condition, we investigate the posterior consistency and convergence rates of the Gaussian sequence model. In particular, we examine two different modeling strategies: Bayesian inference with and without covariate selection. For Bayesian inferences without covariate selection, we obtain the consistency results of the estimators and posteriors with normal priors with constant and decreasing variances, and the James Stein estimator; for Bayesian inference with covariate selection, we obtain convergence rates of Bayesian model averaging (BMA) and median probability model (MPM) estimators, and the posterior with variable selection prior. Based on these results, we conclude that variable selection is essential in high-dimensional Bayesian regression. A simulation study also confirms the conclusion. The methodologies are applied to a climate prediction problem. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:175 / 192
页数:18
相关论文
共 50 条
  • [41] Quantile forward regression for high-dimensional survival data
    Lee, Eun Ryung
    Park, Seyoung
    Lee, Sang Kyu
    Hong, Hyokyoung G.
    LIFETIME DATA ANALYSIS, 2023, 29 (04) : 769 - 806
  • [42] Robust high-dimensional regression for data with anomalous responses
    Ren, Mingyang
    Zhang, Sanguo
    Zhang, Qingzhao
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2021, 73 (04) : 703 - 736
  • [43] Robust linear regression for high-dimensional data: An overview
    Filzmoser, Peter
    Nordhausen, Klaus
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2021, 13 (04)
  • [44] Quantile forward regression for high-dimensional survival data
    Eun Ryung Lee
    Seyoung Park
    Sang Kyu Lee
    Hyokyoung G. Hong
    Lifetime Data Analysis, 2023, 29 : 769 - 806
  • [45] Bayesian feature selection in high-dimensional regression in presence of correlated noise
    Feldman, Guy
    Bhadra, Anindya
    Kirshner, Sergey
    STAT, 2014, 3 (01): : 258 - 272
  • [46] High-dimensional covariance forecasting based on principal component analysis of high-frequency data
    Jian, Zhihong
    Deng, Pingjun
    Zhu, Zhican
    ECONOMIC MODELLING, 2018, 75 : 422 - 431
  • [47] Bayesian inference for high-dimensional linear regression under mnet priors
    Tan, Aixin
    Huang, Jian
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2016, 44 (02): : 180 - 197
  • [48] A Bayesian approach with generalized ridge estimation for high-dimensional regression and testing
    Yang, Szu-Peng
    Emura, Takeshi
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (08) : 6083 - 6105
  • [49] Lagged principal trend analysis for longitudinal high-dimensional data
    Zhang, Yuping
    STAT, 2019, 8 (01):
  • [50] Multilevel Functional Principal Component Analysis for High-Dimensional Data
    Zipunnikov, Vadim
    Caffo, Brian
    Yousem, David M.
    Davatzikos, Christos
    Schwartz, Brian S.
    Crainiceanu, Ciprian
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2011, 20 (04) : 852 - 873