Bayesian regression based on principal components for high-dimensional data

被引:8
|
作者
Lee, Jaeyong [1 ]
Oh, Hee-Seok [1 ]
机构
[1] Seoul Natl Univ, Seoul 151, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
10.1016/j.jmva.2013.02.002
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The Gaussian sequence model can be obtained from the high-dimensional regression model through principal component analysis. It is shown that the Gaussian sequence model is equivalent to the original high-dimensional regression model in terms of prediction. Under a sparsity condition, we investigate the posterior consistency and convergence rates of the Gaussian sequence model. In particular, we examine two different modeling strategies: Bayesian inference with and without covariate selection. For Bayesian inferences without covariate selection, we obtain the consistency results of the estimators and posteriors with normal priors with constant and decreasing variances, and the James Stein estimator; for Bayesian inference with covariate selection, we obtain convergence rates of Bayesian model averaging (BMA) and median probability model (MPM) estimators, and the posterior with variable selection prior. Based on these results, we conclude that variable selection is essential in high-dimensional Bayesian regression. A simulation study also confirms the conclusion. The methodologies are applied to a climate prediction problem. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:175 / 192
页数:18
相关论文
共 50 条
  • [31] Comparing the performance of linear and nonlinear principal components in the context of high-dimensional genomic data integration
    Islam, Shofiqul
    Anand, Sonia
    Hamid, Jemila
    Thabane, Lehana
    Beyene, Joseph
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2017, 16 (03) : 199 - 216
  • [32] Sparse Bayesian multinomial probit regression model with correlation prior for high-dimensional data classification
    Yang Aijun
    Jiang Xuejun
    Liu Pengfei
    Lin Jinguan
    STATISTICS & PROBABILITY LETTERS, 2016, 119 : 241 - 247
  • [33] MWPCR: Multiscale Weighted Principal Component Regression for High-Dimensional Prediction
    Zhu, Hongtu
    Shen, Dan
    Peng, Xuewei
    Liu, Leo Yufeng
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (519) : 1009 - 1021
  • [34] Bayesian variable selection in clustering high-dimensional data
    Tadesse, MG
    Sha, N
    Vannucci, M
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (470) : 602 - 617
  • [35] Efficient quadratures for high-dimensional Bayesian data assimilation
    Cheng, Ming
    Wang, Peng
    Tartakovsky, Daniel M.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2024, 506
  • [36] Bayesian variable selection for high-dimensional rank data
    Cui, Can
    Singh, Susheela P.
    Staicu, Ana-Maria
    Reich, Brian J.
    ENVIRONMETRICS, 2021, 32 (07)
  • [37] Model-based regression clustering for high-dimensional data: application to functional data
    Devijver, Emilie
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2017, 11 (02) : 243 - 279
  • [38] High-Dimensional Principal Projections
    Mas, Andre
    Ruymgaart, Frits
    COMPLEX ANALYSIS AND OPERATOR THEORY, 2015, 9 (01) : 35 - 63
  • [39] High-Dimensional Principal Projections
    André Mas
    Frits Ruymgaart
    Complex Analysis and Operator Theory, 2015, 9 : 35 - 63
  • [40] Robust high-dimensional regression for data with anomalous responses
    Mingyang Ren
    Sanguo Zhang
    Qingzhao Zhang
    Annals of the Institute of Statistical Mathematics, 2021, 73 : 703 - 736