MWPCR: Multiscale Weighted Principal Component Regression for High-Dimensional Prediction

被引:8
|
作者
Zhu, Hongtu [1 ,2 ]
Shen, Dan [3 ,4 ]
Peng, Xuewei [5 ]
Liu, Leo Yufeng [1 ,6 ]
机构
[1] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[2] Univ N Carolina, Dept Biostat, Chapel Hill, NC USA
[3] Univ S Florida, Interdisciplinary Data Sci Consortium, Tampa, FL USA
[4] Univ S Florida, Dept Math & Stat, Tampa, FL USA
[5] Texas A&M Univ, College Stn, TX USA
[6] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC USA
关键词
Alzheimer; Feature; Principal component analysis; Regression; Spatial; Supervised; MODELS; CLASSIFICATION; VARIABLES; TUTORIAL;
D O I
10.1080/01621459.2016.1261710
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a multiscale weighted principal component regression (MWPCR) framework for the use of high-dimensional features with strong spatial features (e.g., smoothness and correlation) to predict an outcome variable, such as disease status. This development is motivated by identifying imaging biomarkers that could potentially aid detection, diagnosis, assessment of prognosis, prediction of response to treatment, and monitoring of disease status, among many others. The MWPCR can be regarded as a novel integration of principal components analysis (PCA), kernel methods, and regression models. In MWPCR, we introduce various weight matrices to prewhitten high-dimensional feature vectors, perform matrix decomposition for both dimension reduction and feature extraction, and build a prediction model by using the extracted features. Examples of such weight matrices include an importance score weight matrix for the selection of individual features at each location and a spatial weight matrix for the incorporation of the spatial pattern of feature vectors. We integrate the importance of score weights with the spatial weights to recover the low-dimensional structure of high-dimensional features. We demonstrate the utility of our methods through extensive simulations and real data analyses of the Alzheimer's disease neuroimaging initiative (ADNI) dataset. Supplementary materials for this article are available online.
引用
收藏
页码:1009 / 1021
页数:13
相关论文
共 50 条
  • [21] Multiscale Two-Directional Two-Dimensional Principal Component Analysis and Its Application to High-Dimensional Biomedical Signal Classification
    Xie, Hong-Bo
    Zhou, Ping
    Guo, Tianruo
    Sivakumar, Bellie
    Zhang, Xu
    Dokos, Socrates
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2016, 63 (07) : 1416 - 1425
  • [22] New high-dimensional indexing structure based on principal component sorting
    School of Computer Science and Engineering, Xidian Univ., Xi'an 710071, China
    Xi Tong Cheng Yu Dian Zi Ji Shu/Syst Eng Electron, 2006, 12 (1927-1931):
  • [23] Sparse principal component analysis for high-dimensional stationary time series
    Fujimori, Kou
    Goto, Yuichi
    Liu, Yan
    Taniguchi, Masanobu
    SCANDINAVIAN JOURNAL OF STATISTICS, 2023, 50 (04) : 1953 - 1983
  • [24] High-Dimensional Principal Projections
    Mas, Andre
    Ruymgaart, Frits
    COMPLEX ANALYSIS AND OPERATOR THEORY, 2015, 9 (01) : 35 - 63
  • [25] High-Dimensional Principal Projections
    André Mas
    Frits Ruymgaart
    Complex Analysis and Operator Theory, 2015, 9 : 35 - 63
  • [26] Prediction of AOD data by geographical and temporal weighted regression with nonlinear principal component analysis
    Guangchao Li
    Wei Chen
    Ruren Li
    Yijin Chen
    Hongru Bi
    Haimeng Zhao
    Lihe Li
    Arabian Journal of Geosciences, 2020, 13
  • [27] Prediction of AOD data by geographical and temporal weighted regression with nonlinear principal component analysis
    Li, Guangchao
    Chen, Wei
    Li, Ruren
    Chen, Yijin
    Bi, Hongru
    Zhao, Haimeng
    Li, Lihe
    ARABIAN JOURNAL OF GEOSCIENCES, 2020, 13 (17)
  • [28] Penalized weighted smoothed quantile regression for high-dimensional longitudinal data
    Song, Yanan
    Han, Haohui
    Fu, Liya
    Wang, Ting
    STATISTICS IN MEDICINE, 2024, 43 (10) : 2007 - 2042
  • [29] High-Dimensional Multivariate Linear Regression with Weighted Nuclear Norm Regularization
    Suh, Namjoon
    Lin, Li-Hsiang
    Huo, Xiaoming
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024, 33 (04) : 1264 - 1275
  • [30] Weak signals in high-dimensional regression: Detection, estimation and prediction
    Li, Yanming
    Hong, Hyokyoung G.
    Ahmed, S. Ejaz
    Li, Yi
    APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2019, 35 (02) : 283 - 298