Locally varying geostatistical machine learning for spatial prediction

被引:0
|
作者
Fouedjio, Francky [1 ,2 ,3 ]
Arya, Emet [1 ,3 ]
机构
[1] Rio Tinto, Data & Analyt, 152-158 St Georges Terrace, Perth, WA 6000, Australia
[2] Kaplan Business Sch Pty Ltd, Perth Campus,1325 Hay St, Perth, WA 6005, Australia
[3] Edith Cowan Univ, Sch Sci, 270 Joondalup Dr, Joondalup, WA 6027, Australia
关键词
Data augmentation; Geostatistics; Local stationarity; Machine learning; Conditional simulation; Spatial auto-correlation; Spatial non-stationarity; Spatial uncertainty;
D O I
10.1016/j.aiig.2024.100081
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Machine learning methods dealing with the spatial auto-correlation of the response variable have garnered significant attention in the context of spatial prediction. Nonetheless, under these methods, the relationship between the response variable and explanatory variables is assumed to be homogeneous throughout the entire study area. This assumption, known as spatial stationarity, is very questionable in real-world situations due to the influence of contextual factors. Therefore, allowing the relationship between the target variable and predictor variables to vary spatially within the study region is more reasonable. However, existing machine learning techniques accounting for the spatially varying relationship between the dependent variable and the predictor variables do not capture the spatial auto-correlation of the dependent variable itself. Moreover, under these techniques, local machine learning models are effectively built using only fewer observations, which can lead to well-known issues such as over-fitting and the curse of dimensionality. This paper introduces a novel geostatistical machine learning approach where both the spatial auto-correlation of the response variable and the spatial non-stationarity of the regression relationship between the response and predictor variables are explicitly considered. The basic idea consists of relying on the local stationarity assumption to build a collection of local machine learning models while leveraging on the local spatial auto-correlation of the response variable to locally augment the training dataset. The proposed method's effectiveness is showcased via experiments conducted on synthetic spatial data with known characteristics as well as real-world spatial data. In the synthetic (resp. real) case study, the proposed method's predictive accuracy, as indicated by the Root Mean Square Error (RMSE) on the test set, is 17% (resp. 7%) better than that of popular machine learning methods dealing with the response variable's spatial auto-correlation. Additionally, this method is not only valuable for spatial prediction but also offers a deeper understanding of how the relationship between the target and predictor variables varies across space, and it can even be used to investigate the local significance of predictor variables.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Advanced geostatistical and machine-learning models for spatial data analysis of radioactively contaminated regions
    Kanevski, M
    Demyanov, V
    Pozdnukhov, A
    Parkin, R
    Savelieva, E
    Timonin, V
    Maignan, M
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2003, : 137 - 149
  • [22] Predicting spatial distribution of stable isotopes in precipitation by classical geostatistical- and machine learning methods
    Erdelyi, Daniel
    Hatvani, Istvan Gabor
    Jeon, Hyeongseon
    Jones, Matthew
    Tyler, Jonathan
    Kern, Zoltan
    JOURNAL OF HYDROLOGY, 2023, 617
  • [23] A stabilized and versatile spatial prediction method for geostatistical models
    Chen, Chun-Shu
    Yang, Hong-Ding
    Li, Yang
    ENVIRONMETRICS, 2014, 25 (02) : 127 - 141
  • [24] Spatial temperature prediction-a machine learning and GIS perspective
    Sailaja, B.
    Gayatri, S.
    Rathod, Santosha
    Padmavathi, Ch.
    Kumar, R. Nagarjuna
    Kumar, R. Mahender
    Sundaram, R. M.
    THEORETICAL AND APPLIED CLIMATOLOGY, 2024, 155 (11) : 9619 - 9642
  • [25] Machine Learning Modeling for Spatial-Temporal Prediction of Geohazard
    Ma, Junwei
    Dou, Jie
    SENSORS, 2023, 23 (22)
  • [26] Spatial prediction of soil contamination based on machine learning: a review
    Zhang Yang
    Lei Mei
    Li Kai
    Ju Tienan
    Frontiers of Environmental Science & Engineering, 2023, 17 (08)
  • [27] Geographically weighted regression with the integration of machine learning for spatial prediction
    Yang, Wentao
    Deng, Min
    Tang, Jianbo
    Luo, Liang
    JOURNAL OF GEOGRAPHICAL SYSTEMS, 2023, 25 (02) : 213 - 236
  • [28] Geographically weighted regression with the integration of machine learning for spatial prediction
    Wentao Yang
    Min Deng
    Jianbo Tang
    Liang Luo
    Journal of Geographical Systems, 2023, 25 : 213 - 236
  • [29] Spatial prediction of soil contamination based on machine learning: a review
    Zhang, Yang
    Lei, Mei
    Li, Kai
    Ju, Tienan
    FRONTIERS OF ENVIRONMENTAL SCIENCE & ENGINEERING, 2023, 17 (08)
  • [30] A coupled geostatistical and machine learning approach to address spatial prediction of trace metals and pollution indices in sediments of the abandoned gold mining site of Bekao, Adamawa, Cameroon
    Sayom, Reynolds Yvan Abende
    Mfenjou, Martin Luther
    Ngounouno, Mouhamed Ayiwouo
    Etoundi, Michele Maguy Cathya
    Boroh, William Andre
    Ngueyep, Luc Leroy Mambou
    Meying, Arsene
    HELIYON, 2023, 9 (08)