Application of training data affects success in broad-scale local climate zone mapping

被引:13
|
作者
Xu, Chunxue [1 ]
Hystad, Perry [2 ]
Chen, Rui [3 ]
Van Den Hoek, Jamon [1 ]
Hutchinson, Rebecca A. [4 ,5 ]
Hankey, Steve [6 ]
Kennedy, Robert [1 ]
机构
[1] Oregon State Univ, Coll Earth Ocean & Atmospher Sci, Corvallis, OR 97331 USA
[2] Oregon State Univ, Coll Publ Hlth & Human Sci, Corvallis, OR 97331 USA
[3] Tufts Univ, Dept Comp Sci, Medford, MA 02155 USA
[4] Oregon State Univ, Sch Elect Engn & Comp Sci, Corvallis, OR 97331 USA
[5] Oregon State Univ, Dept Fisheries Wildlife & Conservat Sci, Corvallis, OR 97331 USA
[6] VA Tech, Sch Publ & Int Affairs, Blacksburg, VA USA
关键词
Local climate zone; Machine learning; Training areas; Crowdsourced data; Spatial autocorrelation; DIFFERENCE WATER INDEX; SENTINEL-2; IMAGES; CROSS-VALIDATION; CLASSIFICATION; FOREST; NDWI;
D O I
10.1016/j.jag.2021.102482
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Satellite imagery has been widely used to map urbanization processes. To address the urgent need for urban landscape mapping that goes beyond urban footprint analysis, the local climate zone (LCZ) scheme has been increasingly used to reveal the urban forms and functions important to urban heat islands and micro-climates across the globe. As with most supervised classification strategies, proper application of training data is critical for the success of LCZ classification models. However, the collection and application of LCZ training areas brings with it two challenges that may affect mapping success. First, because digitizing training areas is a timeconsuming task, there is a broad effort in the LCZ mapping community to create a crowdsourced data collection among different experts. However, this strategy likely leads to inconsistencies in labels that could weaken models. Second, the LCZ labeling process typically involves the delineation of large zones from which multiple training samples are drawn, but those samples are likely spatially autocorrelated and lead to overly optimistic estimates of model accuracy. Although both effects - inconsistent labeling and spatial autocorrelation - are theoretically possible, it is unknown whether they substantially affect accuracy. We investigated both issues, specifically asking: (i) how do the discrepancies of LCZ labeling by different experts impact broad-scale LCZ mapping? (ii) to what extent does spatial correlation affect model prediction power? We used two classifiers (Random Forests and ResNets) to map eight metropolitan areas in the US into LCZs, comparing training areas drawn by different or consistent interpreters, and data splitting strategy using rules that allow or reduce spatial autocorrelation. We found large discrepancies among results built from crowdsourced training areas digitized by different experts; improving the consistency of labels can lead to substantial improvements in LCZ classification accuracy. Second, we found that spatial autocorrelation can boost the apparent accuracy of the classifier by 16% to 21%, leading to erroneous interpretation of mapping results. The two effects interplay as well: spatial auto correlation in the raw data can lead to an underestimation of the model's predictive error when modeling with crowdsourced training areas of high inconsistency. Due to the uncertainty in the labeling process and spatial autocorrelation in derived training data, broad-scale LCZ mapping results should be interpreted with caution.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Does climate determine broad-scale patterns of species richness? A test of the causal link by natural experiment
    H-Acevedo, D
    Currie, DJ
    GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2003, 12 (06): : 461 - 473
  • [42] Examining spectral reflectance features related to foliar nitrogen in forests: Implications for broad-scale nitrogen mapping
    Lepine, Lucie C.
    Ollinger, Scott V.
    Ouimette, Andrew P.
    Martin, Mary E.
    REMOTE SENSING OF ENVIRONMENT, 2016, 173 : 174 - 186
  • [43] Identification of persistent benthic assemblages in areas with different temperature variability patterns through broad-scale mapping
    Bethoney, N. David
    Zhao, Liuzhi
    Chen, Changsheng
    Stokesbury, Kevin D. E.
    PLOS ONE, 2017, 12 (05):
  • [44] Broad-scale climate variation drives the dynamics of animal populations: a global multi-taxa analysis
    Wan, Xinru
    Holyoak, Marcel
    Yan, Chuan
    Le Maho, Yvon
    Dirzo, Rodolfo
    Krebs, Charles J.
    Stenseth, Nils Chr
    Zhang, Zhibin
    BIOLOGICAL REVIEWS, 2022, 97 (06) : 2174 - 2194
  • [45] Spectral and Spatial-Based Classification for Broad-Scale Land Cover Mapping Based on Logistic Regression
    Mallinis, Georgios
    Koutsias, Nikos
    SENSORS, 2008, 8 (12) : 8067 - 8085
  • [46] Tree growth and climate in the Pacific Northwest, North America: a broad-scale analysis of changing growth environments
    Albright, Whitney L.
    Peterson, David L.
    JOURNAL OF BIOGEOGRAPHY, 2013, 40 (11) : 2119 - 2133
  • [47] Climate warming and land-use changes drive broad-scale floristic changes in Southern Sweden
    Tyler, Torbjorn
    Herbertsson, Lina
    Olsson, Pal Axel
    Froberg, Lars
    Olsson, Kjell-Arne
    Svensson, Ake
    Olsson, Ola
    GLOBAL CHANGE BIOLOGY, 2018, 24 (06) : 2607 - 2621
  • [48] Temporal variations in scale cortisol indicate consistent local-and broad-scale constraints in a wild marine teleost fish
    Lebigre, Christophe
    Woillez, Mathieu
    Barone, Herve
    Mourot, Jennyfer
    Drogou, Mickael
    Le Goff, Ronan
    Servili, Arianna
    Hennebert, Jana
    Vanhomwegen, Marine
    Aerts, Johan
    MARINE ENVIRONMENTAL RESEARCH, 2022, 182
  • [49] Data diving with cross-validation: an investigation of broad-scale gradients in Swedish weed communities
    Hallgren, E
    Palmer, MW
    Milberg, P
    JOURNAL OF ECOLOGY, 1999, 87 (06) : 1037 - 1051
  • [50] Integrating broad-scale data to assess demographic and climatic contributions to population change in a declining songbird
    Saracco, James F.
    Rubenstein, Madeleine
    ECOLOGY AND EVOLUTION, 2020, 10 (04): : 1804 - 1816