smashGP: Large-Scale Spatial Modeling via Matrix-Free Gaussian Processes

被引:0
|
作者
Erlandson, Lucas [1 ]
Gomez, Ana Maria Estrada [2 ]
Chow, Edmond [1 ]
Paynabar, Kamran [3 ]
机构
[1] Georgia Inst Technol, Sch Computat Sci & Engn, Atlanta, GA USA
[2] Purdue Univ, Sch Ind Engn, W Lafayette, IN 47907 USA
[3] Georgia Inst Technol, Sch Ind & Syst Engn, Atlanta, GA USA
关键词
Gaussian processes; Hierarchical matrices; Matrix-free methods; Spatial data analysis; RANDOM-FIELDS; CLUSTERS;
D O I
10.1080/10618600.2024.2353653
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Gaussian processes are essential for spatial data analysis. Not only do they allow the prediction of unknown values, but they also allow for uncertainty quantification. However, in the era of big data, directly using Gaussian processes has become computationally infeasible as cubic run times are required for dense matrix decomposition and inversion. Various alternatives have been proposed to reduce the computational burden of directly fitting Gaussian processes. These alternatives rely on assumptions on the underlying structure of the covariance or precision matrices, such as sparsity or low-rank. In contrast, this article uses hierarchical matrices and matrix-free methods to enable the computation of Gaussian processes for large spatial datasets by exploiting the underlying kernel properties. The proposed framework, smashGP, represents the covariance matrix as an H2 matrix in O(n) time and is able to estimate the unknown parameters of the model and predict the values of spatial observations at unobserved locations in O(n log n) time thanks to fast matrix-vector products. Additionally, it can be parallelized to take full advantage of shared-memory computing environments. With simulations and case studies, we illustrate the advantage of smashGP to model large-scale spatial datasets. Supplementary materials for this article are available online.
引用
收藏
页码:15 / 33
页数:19
相关论文
共 50 条
  • [31] A data parallel approach for large-scale Gaussian process modeling
    Choudhury, A
    Nair, PB
    Keane, AJ
    PROCEEDINGS OF THE SECOND SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2002, : 95 - 111
  • [32] Revisiting the matrix-free solution of Markov regenerative processes
    Amparore, Elvio Gilberto
    Donatelli, Susanna
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2011, 18 (06) : 1067 - 1083
  • [33] An Unconditionally Stable Matrix-free Time-domain Method Independent of Element Shape for Multiscale and Large-scale Electromagnetic Analysis
    Yan, Jin
    Jiao, Dan
    2016 PROGRESS IN ELECTROMAGNETICS RESEARCH SYMPOSIUM (PIERS), 2016, : 926 - 926
  • [34] LARGE-SCALE EXPERIMENTAL AND MODELING STUDIES OF HYDROLOGICAL PROCESSES
    SHUTTLEWORTH, WJ
    AMBIO, 1994, 23 (01) : 82 - 86
  • [35] Large-scale Retrieval of Bayesian Machine Learning Models for Time Series Data via Gaussian Processes
    Berns, Fabian
    Beecks, Christian
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KDIR), VOL 1, 2020, : 71 - 80
  • [36] Dirichlet-based Gaussian Processes for Large-scale Calibrated Classification
    Milios, Dimitrios
    Camoriano, Raffaello
    Michiardi, Pietro
    Rosasco, Lorenzo
    Filippone, Maurizio
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [37] Heteroscedastic Gaussian Processes for Data Fusion in Large Scale Terrain Modeling
    Vasudevan, Shrihari
    Ramos, Fabio
    Nettleton, Eric
    Durrant-Whyte, Hugh
    2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 3452 - 3459
  • [38] MODELING LARGE SCALE SPECIES ABUNDANCE WITH LATENT SPATIAL PROCESSES
    Chakraborty, Avishek
    Gelfand, Alan E.
    Wilson, Adam M.
    Latimer, Andrew M.
    Silander, John A., Jr.
    ANNALS OF APPLIED STATISTICS, 2010, 4 (03): : 1403 - 1429
  • [39] MATHEMATICAL-MODELING OF LARGE-SCALE PROCESSES IN EARTH MAGNITOSPHERE
    DENISENKO, VV
    ERKAYEV, NV
    ZAMAI, SS
    KITAYEV, AV
    MEZENTSEV, AV
    MATVEYENKOV, IT
    PIVOVAROV, VG
    USPEKHI FIZICHESKIKH NAUK, 1993, 163 (01): : 101 - 102
  • [40] Visualization for Large-scale Gaussian Updates
    Rougier, Jonathan
    Zammit-Mangion, Andrew
    SCANDINAVIAN JOURNAL OF STATISTICS, 2016, 43 (04) : 1153 - 1161