Creating a spatially continuous air temperature dataset for Taiwan using thermal remote-sensing data and machine learning algorithms

被引:8
|
作者
Tran, Duy-Phien [1 ,2 ]
Liou, Yuei-An [1 ]
机构
[1] Natl Cent Univ, Ctr Space & Remote Sensing Res, 300 Jhongda Rd, Taoyuan City 320317, Taiwan
[2] Vietnam Acad Sci & Technol, Inst Geog, 18 Hoang Quoc Viet Rd, Hanoi City, Vietnam
关键词
Air temperature; Land surface temperature; Machine learning; XGB; MODIS; LAND-SURFACE-TEMPERATURE; URBAN HEAT ISLANDS; ESTIMATING DAILY MAXIMUM; SATELLITE DATA; MODIS DATA; MINIMUM; REFINEMENTS; VALIDATION; RESOLUTION; RETRIEVAL;
D O I
10.1016/j.ecolind.2023.111469
中图分类号
X176 [生物多样性保护];
学科分类号
090705 ;
摘要
Weather stations can provide accurate and high temporal resolution air temperature (Ta) measurements, but their limited spatial coverage due to sparse distribution poses an issue and challenge. However, satellite data can offer land surface temperature (LST) observations with high spatial coverage, which have a strong relationship with Ta, making them ideal for enhancing Ta estimation. This study uses satellite-derived and auxiliary data to create a monthly mean Ta dataset with a 1 km resolution over Taiwan from 2003 to 2020. We employed three machine learning (ML) algorithms and seven different datasets comprising 12 explanatory variables with LST obtained from the MODIS to find the optimal combination of algorithm and dataset for Ta estimation in Taiwan. We applied recursive feature elimination (RFE) to reduce the model complexity and overfitting issues. For model assessment, we used five-fold cross-validation to evaluate the ML models, and indicators such as the coefficient of determination (R2), mean absolute error (MAE), and root mean square of error (RMSE) were employed. The results show that the XGB regressor performed the best among the three models with the highest accuracy. The RFE using the XGB model suggested eight selected variables, including nighttime LST, daytime LST, elevation, longitude, latitude, distance to the sea, month, and year. Based on the variance importance analysis, nighttime LST was the most crucial variable, followed by daytime LST and month. We found that the final monthly Ta dataset using the XGB model had an excellent five-fold cross-validated performance (R2 = 0.986, MAE = 0.477 degrees C, and RMSE = 0.639 degrees C). Furthermore, the XGB model not only performed well throughout all four seasons but also had high and consistent accuracy across months, years, and subsets, indicating its potential for accurately estimating Ta in Taiwan's complex topographic features with varying climate conditions. The resulting monthly Ta dataset created by our model can be an essential input for environmental studies.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Predicting Forest Fire Using Remote Sensing Data And Machine Learning
    Yang, Suwei
    Lupascu, Massimo
    Meel, Kuldeep S.
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14983 - 14990
  • [32] Humanitarian applications of machine learning with remote-sensing data: review and case study in refugee settlement mapping
    Quinn, John A.
    Nyhan, Marguerite M.
    Navarro, Celia
    Coluccia, Davide
    Bromley, Lars
    Luengo-Oroz, Miguel
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2018, 376 (2128):
  • [33] Comparative Analysis of Machine-Learning Models for Soil Moisture Estimation Using High-Resolution Remote-Sensing Data
    Li, Ming
    Yan, Yueguan
    LAND, 2024, 13 (08)
  • [34] Air Quality Forecasting Using Big Data and Machine Learning Algorithms
    Youn-Seo Koo
    Yunsoo Choi
    Chang‐Hoi Ho
    Asia-Pacific Journal of Atmospheric Sciences, 2023, 59 : 529 - 530
  • [35] Air Quality Forecasting Using Big Data and Machine Learning Algorithms
    Koo, Youn-Seo
    Choi, Yunsoo
    Ho, Chang-Hoi
    ASIA-PACIFIC JOURNAL OF ATMOSPHERIC SCIENCES, 2023, 59 (05) : 529 - 530
  • [36] Predictive modelling of mineral prospectivity using satellite remote sensing and machine learning algorithms
    Mahboob, Muhammad Ahsan
    Celik, Turgay
    Genc, Bekir
    REMOTE SENSING APPLICATIONS-SOCIETY AND ENVIRONMENT, 2024, 36
  • [37] Performance Analysis of Machine Learning Algorithms on Diabetes Dataset using Big Data Analytics
    Kumar, P. Suresh
    Pranavi, S.
    2017 INTERNATIONAL CONFERENCE ON INFOCOM TECHNOLOGIES AND UNMANNED SYSTEMS (TRENDS AND FUTURE DIRECTIONS) (ICTUS), 2017, : 508 - 513
  • [38] Identification of shallow groundwater in arid lands using multi-sensor remote sensing data and machine learning algorithms
    Sahour H.
    Sultan M.
    Abdellatif B.
    Emil M.
    Abotalib A.Z.
    Abdelmohsen K.
    Vazifedan M.
    Mohammad A.T.
    Hassan S.M.
    Metwalli M.R.
    El Bastawesy M.
    Journal of Hydrology, 2022, 614
  • [39] Enhancing soil moisture retrieval in semi-arid regions using machine learning algorithms and remote sensing data
    Duan, Xulong
    Maqsoom, Ahsen
    Khalil, Umer
    Aslam, Bilal
    Amjad, Talal
    Tufail, Rana Faisal
    Alarifi, Saad S.
    Tariq, Aqil
    APPLIED SOIL ECOLOGY, 2024, 204
  • [40] Fusion of Geochemical and Remote-Sensing Data for Lithological Mapping Using Random Forest Metric Learning
    Ziye Wang
    Renguang Zuo
    Linhai Jing
    Mathematical Geosciences, 2021, 53 : 1125 - 1145