Multilevel Functional Principal Component Analysis for High-Dimensional Data

被引:56
|
作者
Zipunnikov, Vadim [1 ]
Caffo, Brian [1 ]
Yousem, David M. [2 ]
Davatzikos, Christos [3 ]
Schwartz, Brian S. [4 ]
Crainiceanu, Ciprian [1 ]
机构
[1] Johns Hopkins Univ, Dept Biostat, Baltimore, MD 21205 USA
[2] Johns Hopkins Univ, Dept Radiol, Baltimore, MD 21205 USA
[3] Univ Penn, Sch Med, Dept Radiol, Philadelphia, PA 19104 USA
[4] Johns Hopkins Bloomberg Sch Publ Hlth, Baltimore, MD 21205 USA
关键词
Brain imaging data; MRI; Voxel-based morphology; VOXEL-BASED MORPHOMETRY; COGNITIVE FUNCTION; LEAD-EXPOSURE; BRAIN VOLUMES; ASSOCIATIONS; WORKERS; MODELS;
D O I
10.1198/jcgs.2011.10122
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose fast and scalable statistical methods for the analysis of hundreds or thousands of high-dimensional vectors observed at multiple visits. The proposed inferential methods do not require loading the entire dataset at once in the computer memory and instead use only sequential access to data. This allows deployment of our methodology on low-resource computers where computations can be done in minutes on extremely large datasets. Our methods are motivated by and applied to a study where hundreds of subjects were scanned using Magnetic Resonance Imaging (MRI) at two visits roughly five years apart. The original data possess over ten billion measurements. The approach can be applied to any type of study where data can be unfolded into a long vector including densely observed functions and images. Supplemental materials are provided with source code for simulations, some technical details and proofs, and additional imaging results of the brain study.
引用
收藏
页码:852 / 873
页数:22
相关论文
共 50 条
  • [21] Fast Multilevel Functional Principal Component Analysis
    Cui, Erjia
    Li, Ruonan
    Crainiceanu, Ciprian M.
    Xiao, Luo
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2023, 32 (02) : 366 - 377
  • [22] Constrained principal component analysis with stochastically ordered scores for high-dimensional mass spectrometry data
    Hyun, Hyeong Jin
    Kim, Youngrae
    Kim, Sun Jo
    Kim, Joungyeon
    Lim, Johan
    Lim, Dong Kyu
    Kwon, Sung Won
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 216
  • [23] Tensor robust principal component analysis with total generalized variation for high-dimensional data recovery
    Xu, Zhi
    Yang, Jing-Hua
    Wang, Chuan-long
    Wang, Fusheng
    Yan, Xi-hong
    APPLIED MATHEMATICS AND COMPUTATION, 2024, 483
  • [24] Sparse principal component analysis for high-dimensional stationary time series
    Fujimori, Kou
    Goto, Yuichi
    Liu, Yan
    Taniguchi, Masanobu
    SCANDINAVIAN JOURNAL OF STATISTICS, 2023, 50 (04) : 1953 - 1983
  • [25] High Dimensional Principal Component Analysis with Contaminated Data
    Xu, Huan
    Caramanis, Constantine
    Mannor, Shie
    ITW: 2009 IEEE INFORMATION THEORY WORKSHOP ON NETWORKING AND INFORMATION THEORY, 2009, : 246 - +
  • [26] Stringing High-Dimensional Data for Functional Analysis
    Chen, Kun
    Chen, Kehui
    Mueller, Hans-Georg
    Wang, Jane-Ling
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (493) : 275 - 284
  • [27] Principal Component Analysis of Two-Dimensional Functional Data
    Zhou, Lan
    Pan, Huijun
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2014, 23 (03) : 779 - 801
  • [28] A new proposal for a principal component-based test for high-dimensional data applied to the analysis of PhyloChip data
    Ding, Guo-Chun
    Smalla, Kornelia
    Heuer, Holger
    Kropf, Siegfried
    BIOMETRICAL JOURNAL, 2012, 54 (01) : 94 - 107
  • [29] Lagged principal trend analysis for longitudinal high-dimensional data
    Zhang, Yuping
    STAT, 2019, 8 (01):
  • [30] Joint principal trend analysis for longitudinal high-dimensional data
    Zhang, Yuping
    Ouyang, Zhengqing
    BIOMETRICS, 2018, 74 (02) : 430 - 438