Multilevel Functional Principal Component Analysis for High-Dimensional Data

被引:56
|
作者
Zipunnikov, Vadim [1 ]
Caffo, Brian [1 ]
Yousem, David M. [2 ]
Davatzikos, Christos [3 ]
Schwartz, Brian S. [4 ]
Crainiceanu, Ciprian [1 ]
机构
[1] Johns Hopkins Univ, Dept Biostat, Baltimore, MD 21205 USA
[2] Johns Hopkins Univ, Dept Radiol, Baltimore, MD 21205 USA
[3] Univ Penn, Sch Med, Dept Radiol, Philadelphia, PA 19104 USA
[4] Johns Hopkins Bloomberg Sch Publ Hlth, Baltimore, MD 21205 USA
关键词
Brain imaging data; MRI; Voxel-based morphology; VOXEL-BASED MORPHOMETRY; COGNITIVE FUNCTION; LEAD-EXPOSURE; BRAIN VOLUMES; ASSOCIATIONS; WORKERS; MODELS;
D O I
10.1198/jcgs.2011.10122
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose fast and scalable statistical methods for the analysis of hundreds or thousands of high-dimensional vectors observed at multiple visits. The proposed inferential methods do not require loading the entire dataset at once in the computer memory and instead use only sequential access to data. This allows deployment of our methodology on low-resource computers where computations can be done in minutes on extremely large datasets. Our methods are motivated by and applied to a study where hundreds of subjects were scanned using Magnetic Resonance Imaging (MRI) at two visits roughly five years apart. The original data possess over ten billion measurements. The approach can be applied to any type of study where data can be unfolded into a long vector including densely observed functions and images. Supplemental materials are provided with source code for simulations, some technical details and proofs, and additional imaging results of the brain study.
引用
收藏
页码:852 / 873
页数:22
相关论文
共 50 条
  • [41] When and Why are Principal Component Scores a Good Tool for Visualizing High-dimensional Data?
    Hellton, Kristoffer H.
    Thoresen, Magne
    SCANDINAVIAN JOURNAL OF STATISTICS, 2017, 44 (03) : 581 - 597
  • [42] Probabilistic predictive principal component analysis for spatially misaligned and high-dimensional air pollution data with missing observations
    Vu, Phuong T.
    Larson, Timothy, V
    Szpiro, Adam A.
    ENVIRONMETRICS, 2020, 31 (04)
  • [43] Reducing high-dimensional data by principal component analysis vs. random projection for nearest neighbor classification
    Deegalla, Sampath
    Bostrom, Henrik
    ICMLA 2006: 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2006, : 245 - +
  • [44] Principal Component Analysis (PCA) for high-dimensional data. PCA is dead. Long live PCA
    Yang, Fan
    Doksum, Kjell
    Tsui, Kam-Wah
    PERSPECTIVES ON BIG DATA ANALYSIS: METHODOLOGIES AND APPLICATIONS, 2014, 622 : 1 - 10
  • [45] CONVERGENCE AND PREDICTION OF PRINCIPAL COMPONENT SCORES IN HIGH-DIMENSIONAL SETTINGS
    Lee, Seunggeun
    Zou, Fei
    Wright, Fred A.
    ANNALS OF STATISTICS, 2010, 38 (06): : 3605 - 3629
  • [46] Using principal component analysis for neural network high-dimensional potential energy surface
    Casier, Bastien
    Carniato, Stephane
    Miteva, Tsveta
    Capron, Nathalie
    Sisourat, Nicolas
    JOURNAL OF CHEMICAL PHYSICS, 2020, 152 (23):
  • [47] DIMENSIONALITY REDUCTION OF HIGH-DIMENSIONAL DATA WITH A NONLINEAR PRINCIPAL COMPONENT ALIGNED GENERATIVE TOPOGRAPHIC MAPPING
    Griebel, M.
    Hullmann, A.
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2014, 36 (03): : A1027 - A1047
  • [48] Visualization of high-dimensional data on the probabilistic principal surface
    Chang, KY
    Ghosh, J
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS 1 AND 2: INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT IN THE GLOBAL ECONOMY, 2005, : 1315 - 1319
  • [49] Optimal Linear Discriminant Analysis for High-Dimensional Functional Data
    Xue, Kaijie
    Yang, Jin
    Yao, Fang
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (546) : 1055 - 1064
  • [50] Recent advances in functional data analysis and high-dimensional statistics
    Aneiros, German
    Cao, Ricardo
    Fraiman, Ricardo
    Genest, Christian
    Vieu, Philippe
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 170 : 3 - 9