Unmasking the sky: high-resolution PM2.5 prediction in Texas using machine learning techniques

被引:0
|
作者
Zhang, Kai [1 ]
Lin, Jeffrey [2 ]
Li, Yuanfei [3 ]
Sun, Yue [4 ]
Tong, Weitian [5 ]
Li, Fangyu [6 ]
Chien, Lung-Chang [7 ]
Yang, Yiping [2 ]
Su, Wei-Chung [6 ]
Tian, Hezhong [8 ,9 ]
Fu, Peng [10 ,11 ]
Qiao, Fengxiang [12 ]
Romeiko, Xiaobo Xue [1 ]
Lin, Shao [1 ]
Luo, Sheng [13 ]
Craft, Elena [14 ]
机构
[1] SUNY Albany, Sch Publ Hlth, Dept Environm Hlth Sci, Rensselaer, NY 12144 USA
[2] Univ Texas Hlth Sci Ctr Houston, Sch Publ Hlth, Dept Biostat & Data Sci, Houston, TX USA
[3] Shanghai Univ, Asian Demog Res Inst, Shanghai, Peoples R China
[4] Clark Univ, Dept Int Dev Community & Environm, Worcester, MA USA
[5] Georgia Southern Univ, Dept Comp Sci, Statesboro, GA USA
[6] Univ Texas Hlth Sci Ctr, Dept Epidemiol Human Genet & Environm Sci, Sch Publ Hlth, Houston, TX USA
[7] Univ Nevada, Sch Publ Hlth, Dept Epidemiol & Biostat, Las Vegas, NV USA
[8] Beijing Normal Univ, Sch Environm, State Key Joint Lab Environm Simulat & Pollut Cont, Beijing, Peoples R China
[9] Beijing Normal Univ, Ctr Atmospher Environm Studies, Beijing, Peoples R China
[10] Univ Illinois, Dept Plant Biol, Urbana, IL USA
[11] Harrisburg Univ, Ctr Econ Environm & Energy, Harrisburg, PA USA
[12] Texas Southern Univ, Innovat Transportat Res Inst, Houston, TX USA
[13] Duke Univ, Dept Biostat & Bioinformat, Durham, NC USA
[14] Hlth Effects Inst, Boston, MA USA
关键词
AOD; Gradient boosting; Machine learning; PM2.5; Random forest; FINE PARTICULATE MATTER; PRIVATELY INSURED POPULATION; BEIJING-TIANJIN-HEBEI; RANDOM FOREST; COMPONENTS; MODEL; AOD;
D O I
10.1038/s41370-024-00659-w
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Background Although PM2.5 (fine particulate matter with an aerodynamic diameter less than 2.5 mu m) is an air pollutant of great concern in Texas, limited regulatory monitors pose a significant challenge for decision-making and environmental studies. Objective This study aimed to predict PM2.5 concentrations at a fine spatial scale on a daily basis by using novel machine learning approaches and incorporating satellite-derived Aerosol Optical Depth (AOD) and a variety of weather and land use variables. MethodsWe compiled a comprehensive dataset in Texas from 2013 to 2017, including ground-level PM2.5 concentrations from regulatory monitors; AOD values at 1-km resolution based on images retrieved from the MODIS satellite; and weather, land-use, population density, among others. We built predictive models for each year separately to estimate PM2.5 concentrations using two machine learning approaches called gradient boosted trees and random forest. We evaluated the model prediction performance using in-sample and out-of-sample validations. Results Our predictive models demonstrate excellent in-sample model performance, as indicated by high R-2 values generated from the gradient boosting models (0.94-0.97) and random forest models (0.81-0.90). However, the out-of-sample R-2 values fall within a range of 0.52-0.75 for gradient boosting models and 0.44-0.69 for random forest models. Model performance varies slightly across years. A generally decreasing trend in predicted PM2.5 concentrations over time is observed in Eastern Texas.
引用
收藏
页码:814 / 820
页数:7
相关论文
共 50 条
  • [41] A model for particulate matter (PM2.5) prediction for Delhi based on machine learning approaches
    Masood, Adil
    Ahmad, Kafeel
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 2101 - 2110
  • [42] Performing indoor PM2.5 prediction with low-cost data and machine learning
    Lagesse, Brent
    Wang, Shuoqi
    Larson, Timothy, V
    Kim, Amy Ahim
    FACILITIES, 2022, 40 (7/8) : 495 - 514
  • [43] Prediction of PM2.5 Concentration Based on Ensemble Learning
    Peng Y.
    Zhao Z.-R.
    Wu T.-X.
    Wang J.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2019, 42 (06): : 162 - 169
  • [44] A deep learning model for PM2.5 concentration prediction
    Zhang, Zhendong
    Ma, Xiang
    Yan, Ke
    2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 428 - 433
  • [45] Air quality analysis and PM2.5 modelling using machine learning techniques: A study of Hyderabad city in India
    Mathew, Aneesh
    Gokul, P. R.
    Raja Shekar, Padala
    Arunab, K. S.
    Ghassan Abdo, Hazem
    Almohamad, Hussein
    Abdullah Al Dughairi, Ahmed
    COGENT ENGINEERING, 2023, 10 (01):
  • [46] Estimation of PM10 and PM2.5 Using Backscatter Coefficient of Ceilometer and Machine Learning
    Kim, Bu-Yo
    Cha, Joo Wan
    Lee, Yong Hee
    AEROSOL AND AIR QUALITY RESEARCH, 2023, 23 (12)
  • [47] Predicting particulate matter (PM2.5) air pollution levels in Almaty city using machine learning techniques
    Alibek Issakhov
    Nurtugan Rysmambetov
    Aizhan Abylkassymova
    Modeling Earth Systems and Environment, 2025, 11 (4)
  • [48] PM 2.5 Prediction & Air Quality Classification Using Machine Learning
    Soontornpipit, Pichitpong
    Lekawat, Lertsak
    Tritham, Chatchai
    Tritham, Chattabhorn
    Pongpaibool, Pornanong
    Prasertsuk, Narachata
    Jirakitpuwapat, Wachirapong
    THAI JOURNAL OF MATHEMATICS, 2024, 22 (02): : 441 - 452
  • [49] Characterization of temporal PM2.5, nitrate, and sulfate using deep learning techniques
    Lin, Guan-Yu
    Chen, Ho-Wen
    Chen, Bin-Jiun
    Yang, Yi-Cong
    ATMOSPHERIC POLLUTION RESEARCH, 2022, 13 (01)
  • [50] High-Resolution PM2.5 Emissions and Associated Health Impact Inequalities in an Indian District
    Tomar, Gaurav
    Nagpure, Ajay Singh
    Jain, Yash
    Kumar, Vivek
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2023, 57 (06) : 2310 - 2321