Creating a spatially continuous air temperature dataset for Taiwan using thermal remote-sensing data and machine learning algorithms

被引:8
|
作者
Tran, Duy-Phien [1 ,2 ]
Liou, Yuei-An [1 ]
机构
[1] Natl Cent Univ, Ctr Space & Remote Sensing Res, 300 Jhongda Rd, Taoyuan City 320317, Taiwan
[2] Vietnam Acad Sci & Technol, Inst Geog, 18 Hoang Quoc Viet Rd, Hanoi City, Vietnam
关键词
Air temperature; Land surface temperature; Machine learning; XGB; MODIS; LAND-SURFACE-TEMPERATURE; URBAN HEAT ISLANDS; ESTIMATING DAILY MAXIMUM; SATELLITE DATA; MODIS DATA; MINIMUM; REFINEMENTS; VALIDATION; RESOLUTION; RETRIEVAL;
D O I
10.1016/j.ecolind.2023.111469
中图分类号
X176 [生物多样性保护];
学科分类号
090705 ;
摘要
Weather stations can provide accurate and high temporal resolution air temperature (Ta) measurements, but their limited spatial coverage due to sparse distribution poses an issue and challenge. However, satellite data can offer land surface temperature (LST) observations with high spatial coverage, which have a strong relationship with Ta, making them ideal for enhancing Ta estimation. This study uses satellite-derived and auxiliary data to create a monthly mean Ta dataset with a 1 km resolution over Taiwan from 2003 to 2020. We employed three machine learning (ML) algorithms and seven different datasets comprising 12 explanatory variables with LST obtained from the MODIS to find the optimal combination of algorithm and dataset for Ta estimation in Taiwan. We applied recursive feature elimination (RFE) to reduce the model complexity and overfitting issues. For model assessment, we used five-fold cross-validation to evaluate the ML models, and indicators such as the coefficient of determination (R2), mean absolute error (MAE), and root mean square of error (RMSE) were employed. The results show that the XGB regressor performed the best among the three models with the highest accuracy. The RFE using the XGB model suggested eight selected variables, including nighttime LST, daytime LST, elevation, longitude, latitude, distance to the sea, month, and year. Based on the variance importance analysis, nighttime LST was the most crucial variable, followed by daytime LST and month. We found that the final monthly Ta dataset using the XGB model had an excellent five-fold cross-validated performance (R2 = 0.986, MAE = 0.477 degrees C, and RMSE = 0.639 degrees C). Furthermore, the XGB model not only performed well throughout all four seasons but also had high and consistent accuracy across months, years, and subsets, indicating its potential for accurately estimating Ta in Taiwan's complex topographic features with varying climate conditions. The resulting monthly Ta dataset created by our model can be an essential input for environmental studies.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Three-Dimensional Mapping of Habitats Using Remote-Sensing Data and Machine-Learning Algorithms
    Amani, Meisam
    Foroughnia, Fatemeh
    Moghimi, Armin
    Mahdavi, Sahel
    Jin, Shuanggen
    REMOTE SENSING, 2023, 15 (17)
  • [2] Summer Nights in Berlin, Germany: Modeling Air Temperature Spatially With Remote Sensing, Crowdsourced Weather Data, and Machine Learning
    Vulova, Stenka
    Meier, Fred
    Fenner, Daniel
    Nouri, Hamideh
    Kleinschmit, Birgit
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 5074 - 5087
  • [3] Next Day Wildfire Spread: A Machine Learning Dataset to Predict Wildfire Spreading From Remote-Sensing Data
    Huot, Fantine
    Hu, R. Lily
    Goyal, Nita
    Sankar, Tharun
    Ihme, Matthias
    Chen, Yi-Fan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [4] Spatiotemporal prediction of daily air temperature using remote sensing and machine learning in Morocco
    Modeste Meliho
    Abdellatif Khattabi
    Driss Zejli
    Collins Ashianga Orlando
    Theoretical and Applied Climatology, 2024, 155 : 2105 - 2117
  • [5] Spatiotemporal prediction of daily air temperature using remote sensing and machine learning in Morocco
    Meliho, Modeste
    Khattabi, Abdellatif
    Zejli, Driss
    Orlando, Collins Ashianga
    THEORETICAL AND APPLIED CLIMATOLOGY, 2024, 155 (03) : 2105 - 2117
  • [6] Soil Organic Carbon Assessment Using Remote-Sensing Data and Machine Learning: A Systematic Literature Review
    Lima, Arthur A. J.
    Lopes, Julio Castro
    Lopes, Rui Pedro
    de Figueiredo, Tomas
    Vidal-Vazquez, Eva
    Hernandez, Zulimar
    REMOTE SENSING, 2025, 17 (05)
  • [7] Yield prediction in a peanut breeding program using remote sensing data and machine learning algorithms
    Pugh, N. Ace
    Young, Andrew
    Ojha, Manisha
    Emendack, Yves
    Sanchez, Jacobo
    Xin, Zhanguo
    Puppala, Naveen
    FRONTIERS IN PLANT SCIENCE, 2024, 15
  • [8] Landslide Susceptibility Mapping Using Machine Learning Algorithms and Remote Sensing Data in a Tropical Environment
    Viet-Ha Nhu
    Mohammadi, Ayub
    Shahabi, Himan
    Bin Ahmad, Baharin
    Al-Ansari, Nadhir
    Shirzadi, Ataollah
    Clague, John J.
    Jaafari, Abolfazl
    Chen, Wei
    Nguyen, Hoang
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (14) : 1 - 23
  • [9] Creating Sustainable Flood Maps Using Machine Learning and Free Remote Sensing Data in Unmapped Areas
    Venegas-Quinones, Hector Leopoldo
    Garcia-Chevesich, Pablo
    Valdes-Pineda, Rodrigo
    Ferre, Ty P. A.
    Gupta, Hoshin
    Groenendyk, Derek
    Valdes, Juan B.
    McCray, John E.
    Bakkensen, Laura
    SUSTAINABILITY, 2024, 16 (20)
  • [10] Machine learning of Antarctic firn density by combining radiometer and scatterometer remote-sensing data
    Li, Weiran
    Veldhuijsen, Sanne B. M.
    Lhermitte, Stef
    CRYOSPHERE, 2025, 19 (01): : 37 - 61