Computationally efficient nonstationary nearest-neighbor Gaussian process models using data-driven techniques

被引:3
|
作者
Konomi, B. A. [1 ]
Hanandeh, A. A. [2 ]
Ma, P. [3 ,4 ]
Kang, E. L. [1 ]
机构
[1] Univ Cincinnati, Dept Math Sci, Div Stat & Data Sci, Cincinnati, OH 45221 USA
[2] Yarmouk Univ, Dept Stat, Irbid, Jordan
[3] Stat & Appl Math Sci Inst, Durham, NC USA
[4] Duke Univ, Dept Stat Sci, Durham, NC USA
基金
美国国家科学基金会;
关键词
Bayesian hierarchical modeling; binary tree; large data sets; Markov chain Monte Carlo (MCMC); nonstationary covariance function; TOMS ozone data; RANDOM-FIELDS; BAYESIAN-INFERENCE; LIKELIHOOD;
D O I
10.1002/env.2571
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Due to the increased availability of measurements of various geophysical processes, a need has arisen for statistical methods suitable for the analysis of very large nonstationary spatial data sets. The nearest-neighbor Gaussian process (NNGP) models are one of the latest and most popular Gaussian process-based models, which reduce computational complexity and memory storage. The Bayesian inference is based on the assumption of a parametric covariance function that is often assumed stationary or known. Given that NNGP models are sensitive in the stationary assumption in comparison to other reduction methods, there is a need to build nonstationary covariance functions within the NNGP models. However, the construction of a nonstationary covariance function and/or matrix may be computationally expensive by itself in the presence of big data. In this paper, we develop an efficient two-stage approach that deals with nonstationarity and the computational complexity in the presence of a big spatial data set. We propose a new low-cost data-driven tree-structured partitioning technique to divide the spatial region into distinct subregions. Given the partitions, we construct computationally efficient nonstationary covariance functions for NNGP models. We demonstrate the performance of our approach through simulation experiments and an application to the global Total Ozone Matrix Spectrometer (TOMS) data set, in which the proposed approach performs well in terms of both prediction accuracy and computational complexity.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Data-driven Soft Sensors using Factor Graphs and Gaussian Mixture Models
    Gienger, Andreas
    Sawodny, Oliver
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 4466 - 4471
  • [32] 3DNN: 3D Nearest Neighbor Data-Driven Geometric Scene Understanding Using 3D Models
    Satkin, Scott
    Rashid, Maheen
    Lin, Jason
    Hebert, Martial
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) : 69 - 97
  • [33] Data-driven stochastic AC-OPF using Gaussian process regression
    Mitrovic, Mile
    Lukashevich, Aleksandr
    Vorobev, Petr
    Terzija, Vladimir
    Budennyy, Semen
    Maximov, Yury
    Deka, Deepjyoti
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2023, 152
  • [34] GAUSSIAN PROCESS EMULATION FOR BIG DATA IN DATA-DRIVEN METAMATERIALS DESIGN
    Bostanabad, Ramin
    Chan, Yu-Chin
    Wang, Liwei
    Zhu, Ping
    Chen, Wei
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2019, VOL 2A, 2020,
  • [35] Data-Driven Calibration of Multifidelity Multiscale Fracture Models Via Latent Map Gaussian Process
    Deng, Shiguang
    Mora, Carlos
    Apelian, Diran
    Bostanabad, Ramin
    JOURNAL OF MECHANICAL DESIGN, 2023, 145 (01)
  • [36] DATA-DRIVEN COMBUSTION MODELING FOR A TURBULENT FLAME SIMULATED WITH A COMPUTATIONALLY EFFICIENT SOLVER
    Talei, Mohsen
    Ma, Man-Ching
    Sandberg, Richard
    PROCEEDINGS OF THE ASME TURBO EXPO 2020: TURBOMACHINERY TECHNICAL CONFERENCE AND EXHIBITION, VOL 4A, 2020,
  • [37] Computationally efficient data-driven model predictive control for modular multilevel converters
    Raja, Muneeb Masood
    Wang, Haoran
    Arshad, Muhammad Haseeb
    Kish, Gregory J.
    Zhao, Qing
    IET ELECTRIC POWER APPLICATIONS, 2024, 18 (12) : 1844 - 1859
  • [38] Computationally Efficient Data-Driven Joint Chance Constraints for Power Systems Scheduling
    Wu, Chutian
    Kargarian, Amin
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2023, 38 (03) : 2858 - 2867
  • [39] Computationally efficient data-driven surge map modeling for centrifugal air compressors
    Wu, Xin
    Li, Yaoyu
    2007 AMERICAN CONTROL CONFERENCE, VOLS 1-13, 2007, : 5321 - 5326
  • [40] Single Image Super Resolution Using Nearest Neighbor Local Gaussian Process Regression
    Lu Ziwei
    Wu Chengdong
    Yu Xiaosheng
    PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING (ICMLC 2018), 2018, : 236 - 241