NONSEPARABLE DYNAMIC NEAREST NEIGHBOR GAUSSIAN PROCESS MODELS FOR LARGE SPATIO-TEMPORAL DATA WITH AN APPLICATION TO PARTICULATE MATTER ANALYSIS

被引:81
|
作者
Datta, Abhirup [1 ]
Banerjee, Sudipto [2 ]
Finley, Andrew O. [3 ,4 ]
Hamm, Nicholas A. S. [5 ]
Schaap, Martijn [6 ]
机构
[1] Johns Hopkins Univ, Dept Biostat, Baltimore, MD USA
[2] Univ Calif Los Angeles, Dept Biostat, Los Angeles, CA 90095 USA
[3] Michigan State Univ, Dept Forestry, E Lansing, MI 48824 USA
[4] Michigan State Univ, Dept Geog, E Lansing, MI 48824 USA
[5] Univ Twente, Fac Geoinformat Sci & Earth Observat ITC, NL-7500 AE Enschede, Netherlands
[6] TNO, Dept Climate Air & Sustainabil, NL-3508 TA Utrecht, Netherlands
来源
ANNALS OF APPLIED STATISTICS | 2016年 / 10卷 / 03期
基金
美国国家科学基金会;
关键词
Nonseparable spatio-temporal models; scalable Gaussian process; nearest neighbors; Bayesian inference; Markov chain Monte Carlo; environmental pollutants; AIR-POLLUTION; COVARIANCE FUNCTIONS; SPATIAL MODELS; LOTOS-EUROS; SPACE; QUALITY; IDENTIFICATION; INTERPOLATION; PREDICTION;
D O I
10.1214/16-AOAS931
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Particulate matter (PM) is a class of malicious environmental pollutants known to be detrimental to human health. Regulatory efforts aimed at curbing PM levels in different countries often require high resolution space-time maps that can identify red-flag regions exceeding statutory concentration limits. Continuous spatio-temporal Gaussian Process (GP) models can deliver maps depicting predicted PM levels and quantify predictive uncertainty. However, GP-based approaches are usually thwarted by computational challenges posed by large datasets. We construct a novel class of scalable Dynamic Nearest Neighbor Gaussian Process (DNNGP) models that can provide a sparse approximation to any spatio-temporal GP (e.g., with nonseparable covariance structures). The DNNGP we develop here can be used as a sparsity-inducing prior for spatio-temporal random effects in any Bayesian hierarchical model to deliver full posterior inference. Storage and memory requirements for a DNNGP model are linear in the size of the dataset, thereby delivering massive scalability without sacrificing inferential richness. Extensive numerical studies reveal that the DNNGP provides substantially superior approximations to the underlying process than low-rank approximations. Finally, we use the DNNGP to analyze a massive air quality dataset to substantially improve predictions of PM levels across Europe in conjunction with the LOTOS-EUROS chemistry transport models (CTMs).
引用
收藏
页码:1286 / 1316
页数:31
相关论文
共 50 条
  • [21] A spatio-temporal model for the analysis and prediction of fine particulate matter concentration in Beijing
    Wan, Yating
    Xu, Minya
    Huang, Hui
    Chen, Song Xi
    ENVIRONMETRICS, 2021, 32 (01)
  • [22] SPATIO-TEMPORAL LARGE MARGIN NEAREST NEIGHBOR (ST-LMNN) BASED ON RIEMANNIAN FEATURES FOR INDIVIDUAL IDENTIFICATION
    Su, Yong
    Feng, Zhiyong
    Xing, Meng
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [23] starve: An R package for spatio-temporal analysis of research survey data using nearest-neighbour Gaussian processes
    Lawler, Ethan
    Field, Chris
    Mills Flemming, Joanna
    METHODS IN ECOLOGY AND EVOLUTION, 2023, 14 (03): : 817 - 830
  • [24] Gaussian processes on the support of cylindrical surfaces, with application to periodic spatio-temporal data
    Majumdar, Anandamayee
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2014, 153 : 27 - 41
  • [25] Dynamic lattice-Markov spatio-temporal models for environmental data
    Garside, LM
    Wilkinson, DJ
    BAYESIAN STATISTICS 7, 2003, : 535 - 542
  • [26] Scalable Semiparametric Spatio-temporal Regression for Large Data Analysis
    Ma, Ting Fung
    Wang, Fangfang
    Zhu, Jun
    Ives, Anthony R.
    Lewinska, Katarzyna E.
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2023, 28 (02) : 279 - 298
  • [27] Scalable Semiparametric Spatio-temporal Regression for Large Data Analysis
    Ting Fung Ma
    Fangfang Wang
    Jun Zhu
    Anthony R. Ives
    Katarzyna E. Lewińska
    Journal of Agricultural, Biological and Environmental Statistics, 2023, 28 : 279 - 298
  • [28] Spatio-temporal Data Analysis for Development of Microclimate Prediction Models
    Hong, Tageui
    Heo, Yeonsook
    PROCEEDINGS OF BUILDING SIMULATION 2021: 17TH CONFERENCE OF IBPSA, 2022, 17 : 878 - 885
  • [29] Spatio-Temporal Gaussian Process Models for Extended and Group Object Tracking With Irregular Shapes
    Aftab, Waqas
    Hostettler, Roland
    De Freitas, Allan
    Arvaneh, Mahnaz
    Mihaylova, Lyudmila
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (03) : 2137 - 2151
  • [30] SPATIO-TEMPORAL ANALYSIS OF DEPENDENT RISK WITH AN APPLICATION TO CYBERATTACKS DATA
    Kim, Shonghyun
    Lim, Chae Young
    Rho, Yeonwoo
    ANNALS OF APPLIED STATISTICS, 2024, 18 (04): : 3549 - 3569