Creating a spatially continuous air temperature dataset for Taiwan using thermal remote-sensing data and machine learning algorithms

被引:8
|
作者
Tran, Duy-Phien [1 ,2 ]
Liou, Yuei-An [1 ]
机构
[1] Natl Cent Univ, Ctr Space & Remote Sensing Res, 300 Jhongda Rd, Taoyuan City 320317, Taiwan
[2] Vietnam Acad Sci & Technol, Inst Geog, 18 Hoang Quoc Viet Rd, Hanoi City, Vietnam
关键词
Air temperature; Land surface temperature; Machine learning; XGB; MODIS; LAND-SURFACE-TEMPERATURE; URBAN HEAT ISLANDS; ESTIMATING DAILY MAXIMUM; SATELLITE DATA; MODIS DATA; MINIMUM; REFINEMENTS; VALIDATION; RESOLUTION; RETRIEVAL;
D O I
10.1016/j.ecolind.2023.111469
中图分类号
X176 [生物多样性保护];
学科分类号
090705 ;
摘要
Weather stations can provide accurate and high temporal resolution air temperature (Ta) measurements, but their limited spatial coverage due to sparse distribution poses an issue and challenge. However, satellite data can offer land surface temperature (LST) observations with high spatial coverage, which have a strong relationship with Ta, making them ideal for enhancing Ta estimation. This study uses satellite-derived and auxiliary data to create a monthly mean Ta dataset with a 1 km resolution over Taiwan from 2003 to 2020. We employed three machine learning (ML) algorithms and seven different datasets comprising 12 explanatory variables with LST obtained from the MODIS to find the optimal combination of algorithm and dataset for Ta estimation in Taiwan. We applied recursive feature elimination (RFE) to reduce the model complexity and overfitting issues. For model assessment, we used five-fold cross-validation to evaluate the ML models, and indicators such as the coefficient of determination (R2), mean absolute error (MAE), and root mean square of error (RMSE) were employed. The results show that the XGB regressor performed the best among the three models with the highest accuracy. The RFE using the XGB model suggested eight selected variables, including nighttime LST, daytime LST, elevation, longitude, latitude, distance to the sea, month, and year. Based on the variance importance analysis, nighttime LST was the most crucial variable, followed by daytime LST and month. We found that the final monthly Ta dataset using the XGB model had an excellent five-fold cross-validated performance (R2 = 0.986, MAE = 0.477 degrees C, and RMSE = 0.639 degrees C). Furthermore, the XGB model not only performed well throughout all four seasons but also had high and consistent accuracy across months, years, and subsets, indicating its potential for accurately estimating Ta in Taiwan's complex topographic features with varying climate conditions. The resulting monthly Ta dataset created by our model can be an essential input for environmental studies.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Modeling maize above-ground biomass based on machine learning approaches using UAV remote-sensing data
    Han, Liang
    Yang, Guijun
    Dai, Huayang
    Xu, Bo
    Yang, Hao
    Feng, Haikuan
    Li, Zhenhai
    Yang, Xiaodong
    PLANT METHODS, 2019, 15 (1)
  • [22] Modeling maize above-ground biomass based on machine learning approaches using UAV remote-sensing data
    Liang Han
    Guijun Yang
    Huayang Dai
    Bo Xu
    Hao Yang
    Haikuan Feng
    Zhenhai Li
    Xiaodong Yang
    Plant Methods, 15
  • [23] Machine Learning Algorithms for Biophysical Classification of Lithuanian Lakes Based on Remote Sensing Data
    Grendaite, Dalia
    Stonevicius, Edvinas
    WATER, 2022, 14 (11)
  • [24] A review on advancements in lithological mapping utilizing machine learning algorithms and remote sensing data
    EL-Omairi, Mohamed Ali
    El Garouani, Abdelkader
    HELIYON, 2023, 9 (09)
  • [25] Performance assessment of machine learning algorithms for mapping of land use/land cover using remote sensing data
    Zafar, Zeeshan
    Zubair, Muhammad
    Zha, Yuanyuan
    Fahd, Shah
    Nadeem, Adeel Ahmad
    EGYPTIAN JOURNAL OF REMOTE SENSING AND SPACE SCIENCES, 2024, 27 (02): : 216 - 226
  • [26] Application of Advanced Machine Learning Algorithms to Assess Groundwater Potential Using Remote Sensing-Derived Data
    Maskooni, Ehsan Kamali
    Naghibi, Seyed Amir
    Hashemi, Hossein
    Berndtsson, Ronny
    REMOTE SENSING, 2020, 12 (17)
  • [27] Benchmarking Machine Learning Algorithms for Instantaneous Net Surface Shortwave Radiation Retrieval Using Remote Sensing Data
    Wu, Hua
    Ying, Wangmin
    REMOTE SENSING, 2019, 11 (21)
  • [28] Actual Evapotranspiration Estimates in Arid Cold Regions Using Machine Learning Algorithms with In Situ and Remote Sensing Data
    Mosre, Josefina
    Suarez, Francisco
    WATER, 2021, 13 (06)
  • [29] Comprehensive Review on Application of Machine Learning Algorithms for Water Quality Parameter Estimation Using Remote Sensing Data
    Wagle, Nimisha
    Acharya, Tri Dev
    Lee, Dong Ha
    SENSORS AND MATERIALS, 2020, 32 (11) : 3879 - 3892
  • [30] Cropland prediction using remote sensing, ancillary data, and machine learning
    Katal, Nitish
    Hooda, Nishtha
    Sharma, Ashish
    Sharma, Bhisham
    JOURNAL OF APPLIED REMOTE SENSING, 2023, 17 (02)