Forecasting PM2.5 concentration levels using shallow machine learning models on the Monterrey Metropolitan Area in Mexico

被引:3
|
作者
Pozo-Luyo, Cesar Alejandro [1 ]
Cruz-Duarte, Jorge M. [1 ]
Amaya, Ivan [1 ]
Ortiz-Bayliss, Jose Carlos [1 ]
机构
[1] Tecnol Monterrey, Sch Engn & Sci, Ave Eugenio Garza Sada 2501, Monterrey 64700, Nuevo Leon, Mexico
关键词
Air quality forecasting; PM2.5; forecasting; Machine learning; Regression; METEOROLOGICAL CONDITIONS; AIR-QUALITY; EXPOSURE;
D O I
10.1016/j.apr.2023.101898
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The Monterrey Metropolitan Area is one of the most densely populated and polluted regions in Latin America. Hence, providing early warnings to the population when pollutant concentrations reach high levels is critical. This allows people at higher health risk to make informed decisions about when to go out, mitigating future health complications. Using forecasting models, we can produce timely warnings for future concentration levels. In this work, we implement a set of short-term shallow machine learning models that would serve as a baseline for future forecasting analyses of PM2.5 concentration levels in the Monterrey Metropolitan Area. The proposed approach starts with multiple imputation through chained equations for missing value imputation, the incorporation of time metadata, and target winsorization. Then, we rely on the well-known random search for parameter optimization of the machine learning models and k-fold cross-validation, obtaining favorable results. We devise these models for a single-step and single-station analysis on an hourly multivariate air quality dataset (containing 77203 rows and 16 columns from the first hour of January 1, 2015 00:00:00 to April 17, 2022 23:00:00) and compare them using standard regression metrics. Therefore, we identify the forecasting model with the best performance, which was an Extra Trees Regressor with a Root Mean Squared Error of 0.013, a Mean Absolute Error of 0.006 (equivalent to a Mean Absolute Percentage Error of 0.294% and a Symmetric Mean Absolute Percentage Error of 0.078%), and a Maximum Error of 0.187 mu g/m(3).
引用
收藏
页数:11
相关论文
共 50 条
  • [21] PM2.5-bounded PAHS from two zones of the Metropolitan Area of Monterrey, Nuevo Leon, Mexico
    Garza-Ocanas, L.
    Garza-Ulloa, H.
    Gonzalez-Santiago, O.
    Lujan-Rangel, R.
    Badillo-Castaneda, C. T.
    TOXICOLOGY LETTERS, 2010, 196 : S287 - S287
  • [22] Prediction of PM2.5 concentration in Ulaanbaatar with deep learning models
    Suriya
    Natsagdorj, Narantsogt
    Aorigele
    Zhou, Haijun
    Sachurila
    URBAN CLIMATE, 2023, 47
  • [23] Statistical Analysis of PM10 Concentration in the Monterrey Metropolitan Area, Mexico (2010-2018)
    Aguirre-Lopez, Mario A.
    Rodriguez-Gonzalez, Miguel Angel
    Soto-Villalobos, Roberto
    Gomez-Sanchez, Laura Elena
    Benavides-Rios, Angela Gabriela
    Benavides-Bravo, Francisco Gerardo
    Walle-Garcia, Otoniel
    Pamanes-Aguilar, Maria Gricelda
    ATMOSPHERE, 2022, 13 (02)
  • [24] Application of CNN-LSTM Algorithm for PM2.5 Concentration Forecasting in the Beijing-Tianjin-Hebei Metropolitan Area
    Su, Yuxuan
    Li, Junyu
    Liu, Lilong
    Guo, Xi
    Huang, Liangke
    Hu, Mingyun
    ATMOSPHERE, 2023, 14 (09)
  • [25] Time series-based PM2.5 concentration prediction in Jing-Jin-Ji area using machine learning algorithm models
    Ma, Xin
    Chen, Tengfei
    Ge, Rubing
    Cui, Caocao
    Xu, Fan
    Lv, Qi
    HELIYON, 2022, 8 (09)
  • [26] Applying PCA to Deep Learning Forecasting Models for Predicting PM2.5
    Choi, Sang Won
    Kim, Brian H. S.
    SUSTAINABILITY, 2021, 13 (07)
  • [27] A secondary-decomposition-ensemble learning paradigm for forecasting PM2.5 concentration
    Gan, Kai
    Sun, Shaolong
    Wang, Shouyang
    Wei, Yunjie
    ATMOSPHERIC POLLUTION RESEARCH, 2018, 9 (06) : 989 - 999
  • [28] PM2.5 Estimation using Machine Learning Models and Satellite Data: A Literature Review
    Unik, Mitra
    Sitanggang, Imas Sukaesih
    Syaufina, Lailan
    Jaya, I. Nengah Surati
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (05) : 359 - 370
  • [29] An intercomparison of weather normalization of PM2.5 concentration using traditional statistical methods, machine learning, and chemistry transport models
    Zheng, Huang
    Kong, Shaofei
    Zhai, Shixian
    Sun, Xiaoyun
    Cheng, Yi
    Yao, Liquan
    Song, Congbo
    Zheng, Zhonghua
    Shi, Zongbo
    Harrison, Roy M.
    NPJ CLIMATE AND ATMOSPHERIC SCIENCE, 2023, 6 (01)
  • [30] An intercomparison of weather normalization of PM2.5 concentration using traditional statistical methods, machine learning, and chemistry transport models
    Huang Zheng
    Shaofei Kong
    Shixian Zhai
    Xiaoyun Sun
    Yi Cheng
    Liquan Yao
    Congbo Song
    Zhonghua Zheng
    Zongbo Shi
    Roy M. Harrison
    npj Climate and Atmospheric Science, 6