Computationally efficient nonstationary nearest-neighbor Gaussian process models using data-driven techniques

被引:3
|
作者
Konomi, B. A. [1 ]
Hanandeh, A. A. [2 ]
Ma, P. [3 ,4 ]
Kang, E. L. [1 ]
机构
[1] Univ Cincinnati, Dept Math Sci, Div Stat & Data Sci, Cincinnati, OH 45221 USA
[2] Yarmouk Univ, Dept Stat, Irbid, Jordan
[3] Stat & Appl Math Sci Inst, Durham, NC USA
[4] Duke Univ, Dept Stat Sci, Durham, NC USA
基金
美国国家科学基金会;
关键词
Bayesian hierarchical modeling; binary tree; large data sets; Markov chain Monte Carlo (MCMC); nonstationary covariance function; TOMS ozone data; RANDOM-FIELDS; BAYESIAN-INFERENCE; LIKELIHOOD;
D O I
10.1002/env.2571
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Due to the increased availability of measurements of various geophysical processes, a need has arisen for statistical methods suitable for the analysis of very large nonstationary spatial data sets. The nearest-neighbor Gaussian process (NNGP) models are one of the latest and most popular Gaussian process-based models, which reduce computational complexity and memory storage. The Bayesian inference is based on the assumption of a parametric covariance function that is often assumed stationary or known. Given that NNGP models are sensitive in the stationary assumption in comparison to other reduction methods, there is a need to build nonstationary covariance functions within the NNGP models. However, the construction of a nonstationary covariance function and/or matrix may be computationally expensive by itself in the presence of big data. In this paper, we develop an efficient two-stage approach that deals with nonstationarity and the computational complexity in the presence of a big spatial data set. We propose a new low-cost data-driven tree-structured partitioning technique to divide the spatial region into distinct subregions. Given the partitions, we construct computationally efficient nonstationary covariance functions for NNGP models. We demonstrate the performance of our approach through simulation experiments and an application to the global Total Ozone Matrix Spectrometer (TOMS) data set, in which the proposed approach performs well in terms of both prediction accuracy and computational complexity.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Updating the Mathematical Models of Bridges Using Data-driven Techniques
    Sabamehr, Ardalan
    Lim, Chaewoon
    Bagchi, Ashutosh
    STRUCTURAL HEALTH MONITORING 2015: SYSTEM RELIABILITY FOR VERIFICATION AND IMPLEMENTATION, VOLS. 1 AND 2, 2015, : 1243 - 1250
  • [22] Fast Bayesian inference of block Nearest Neighbor Gaussian models for large data
    Zaida C. Quiroz
    Marcos O. Prates
    Dipak K. Dey
    H.åvard Rue
    Statistics and Computing, 2023, 33
  • [23] Predicting complete cytoreduction for advanced ovarian cancer patients using nearest-neighbor models
    Laios, Alexandros
    Gryparis, Alexandros
    DeJong, Diederick
    Hutson, Richard
    Theophilou, Georgios
    Leach, Chris
    JOURNAL OF OVARIAN RESEARCH, 2020, 13 (01)
  • [24] ANALYSIS OF REPEAT-PROTEIN FOLDING USING NEAREST-NEIGHBOR STATISTICAL MECHANICAL MODELS
    Aksel, Tura
    Barrick, Doug
    METHODS IN ENZYMOLOGY: BIOTHERMODYNAMICS,VOL 455, PART A, 2009, 455 : 95 - 125
  • [25] Predicting complete cytoreduction for advanced ovarian cancer patients using nearest-neighbor models
    Alexandros Laios
    Alexandros Gryparis
    Diederick DeJong
    Richard Hutson
    Georgios Theophilou
    Chris Leach
    Journal of Ovarian Research, 13
  • [26] Registering Particle Data Sets Using a Rotation and Translation Invariant Nearest-Neighbor Algorithm
    Ritchie, Nicholas W. M.
    Wight, Scott
    Ortiz-Montalvo, Diana
    Lindstrom, Abigail P.
    MICROSCOPY AND MICROANALYSIS, 2023, 29 (02) : 512 - 519
  • [27] Computationally Efficient Bayesian Learning of Gaussian Process State Space Models
    Svensson, Andreas
    Solin, Arno
    Sarkka, Simo
    Schon, Thomas B.
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 213 - 221
  • [28] Nearest-neighbor method using multiple neighborhood similarities for social media data mining
    Wang, Shuhui
    Huang, Qingming
    Jiang, Shuqiang
    Tian, Qi
    Qin, Lei
    NEUROCOMPUTING, 2012, 95 : 105 - 116
  • [29] Computationally Efficient Nanophotonic Design Through Data-Driven Eigenmode Expansion
    Oktay, Mehmet Can
    Gorgulu, Kazim
    Magden, Emir Salih
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2024, 42 (22) : 7894 - 7902
  • [30] Data-Driven Event Assessment in Power Systems using Gaussian Mixture Models
    Chowdhury, Sirin Duna
    Senroy, Nilanjan
    De, Swades
    2019 IEEE MILAN POWERTECH, 2019,