Ecodatacube.eu: analysis-ready open environmental data cube for Europe

被引:7
|
作者
Witjes, Martijn [1 ]
Parente, Leandro [1 ]
Krizan, Josip [2 ]
Hengl, Tomislav [1 ]
Antonic, Luka [2 ]
机构
[1] OpenGeoHub, Wageningen, Netherlands
[2] MultiOne, Zagreb, Croatia
来源
PEERJ | 2023年 / 11卷
关键词
Sentinel-2; Landsat; Data cube; Digital terrain model; Elevation; Gap filling; Analysis-ready data; Land cover; Lucas; EcoDataCube; LAND-COVER; SENTINEL-2; MAP; ELEVATION;
D O I
10.7717/peerj.15478
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The article describes the production steps and accuracy assessment of an analysisready, open-access European data cube consisting of 2000-2020+ Landsat data, 2017-2021+ Sentinel-2 data and a 30 m resolution digital terrain model (DTM). The main purpose of the data cube is to make annual continental-scale spatiotemporal machine learning tasks accessible to a wider user base by providing a spatially and temporally consistent multidimensional feature space. This has required systematic spatiotemporal harmonization, efficient compression, and imputation of missing values. Sentinel-2 and Landsat reflectance values were aggregated into four quarterly averages approximating the four seasons common in Europe (winter, spring, summer and autumn), as well as the 25th and 75th percentile, in order to retain intra-seasonal variance. Remaining missing data in the Landsat time-series was imputed with a temporal moving window median (TMWM) approach. An accuracy assessment shows TMWM performs relatively better in Southern Europe and lower in mountainous regions such as the Scandinavian Mountains, the Alps, and the Pyrenees. We quantify the usability of the different component data sets for spatiotemporal machine learning tasks with a series of land cover classification experiments, which show that models utilizing the full feature space (30 m DTM, 30 m Landsat, 30 m and 10 m Sentinel-2) yield the highest land cover classification accuracy, with different data sets improving the results for different land cover classes. The data sets presented in the article are part of the EcoDataCube platform, which also hosts open vegetation, soil, and land use/land cover (LULC) maps created. All data sets are available under CC-BY license as Cloud-Optimized GeoTIFFs (ca. 12 TB in size) through SpatioTemporal Asset Catalog (STAC) and the EcoDataCube data portal.
引用
收藏
页数:30
相关论文
共 50 条
  • [1] SAR ANALYSIS READY DATA AND TOOLS FOR THE OPEN DATA CUBE
    Rosenqvist, Ake
    Killough, Brian D.
    Lubawy, Andrew M.
    Rattz, John C.
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 3391 - 3394
  • [2] DATACUBE STANDARDS AND THEIR CONTRIBUTION TO ANALYSIS-READY DATA
    Baumann, Peter
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 2051 - 2053
  • [3] Towards Sentinel-1 SAR Analysis-Ready Data: A Best Practices Assessment on Preparing Backscatter Data for the Cube
    Truckenbrodt, John
    Freemantle, Terri
    Williams, Chris
    Jones, Tom
    Small, David
    Dubois, Clemence
    Thiel, Christian
    Rossi, Cristian
    Syriou, Asimina
    Giuliani, Gregory
    DATA, 2019, 4 (03)
  • [4] ADVANCEMENTS IN THE OPEN DATA CUBE AND ANALYSIS READY DATA - PAST, PRESENT AND FUTURE
    Killough, Brian
    Siqueira, Andreia
    Dyke, George
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 3373 - 3375
  • [5] Generation of Raw RPPA Data and Their Conversion to Analysis-Ready Data
    Akbani, Rehan
    Ling, Shiyun
    Lu, Yiling
    REVERSE PHASE PROTEIN ARRAYS: FROM TECHNICAL AND ANALYTICAL FUNDAMENTALS TO APPLICATIONS, 2019, 1188 : 165 - 180
  • [6] An improved process for the creation, maintenance, and documentation of analysis-ready data
    Glaser, Allan
    DRUG INFORMATION JOURNAL, 2006, 40 (03): : 331 - 335
  • [7] An Improved Process for the Creation, Maintenance, and Documentation of Analysis-ready Data
    Allan Glaser
    Drug information journal : DIJ / Drug Information Association, 2006, 40 (3): : 331 - 335
  • [8] Good practices for sharing analysis-ready data in mammalogy and biodiversity research
    Verde Arregoitia, Luis D.
    Cooper, Natalie
    D'Elia, Guillermo
    HYSTRIX-ITALIAN JOURNAL OF MAMMALOGY, 2018, 29 (02): : 155 - 161
  • [9] TOWARDS A FRAMEWORK FOR OFFERING REMOTE SENSING DATA IN AN ANALYSIS-READY FORMAT
    Zhao, Jianghua
    Wang, Xuezhi
    Zhou, Yuanchun
    Qin, Qiming
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 5258 - 5261
  • [10] Pangeo Forge: Crowdsourcing Analysis-Ready, Cloud Optimized Data Production
    Stern, Charles
    Abernathey, Ryan
    Hamman, Joseph
    Wegener, Rachel
    Lepore, Chiara
    Harkins, Sean
    Merose, Alexander
    FRONTIERS IN CLIMATE, 2022, 3