Forecasting PM2.5 concentration levels using shallow machine learning models on the Monterrey Metropolitan Area in Mexico

被引:3
|
作者
Pozo-Luyo, Cesar Alejandro [1 ]
Cruz-Duarte, Jorge M. [1 ]
Amaya, Ivan [1 ]
Ortiz-Bayliss, Jose Carlos [1 ]
机构
[1] Tecnol Monterrey, Sch Engn & Sci, Ave Eugenio Garza Sada 2501, Monterrey 64700, Nuevo Leon, Mexico
关键词
Air quality forecasting; PM2.5; forecasting; Machine learning; Regression; METEOROLOGICAL CONDITIONS; AIR-QUALITY; EXPOSURE;
D O I
10.1016/j.apr.2023.101898
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The Monterrey Metropolitan Area is one of the most densely populated and polluted regions in Latin America. Hence, providing early warnings to the population when pollutant concentrations reach high levels is critical. This allows people at higher health risk to make informed decisions about when to go out, mitigating future health complications. Using forecasting models, we can produce timely warnings for future concentration levels. In this work, we implement a set of short-term shallow machine learning models that would serve as a baseline for future forecasting analyses of PM2.5 concentration levels in the Monterrey Metropolitan Area. The proposed approach starts with multiple imputation through chained equations for missing value imputation, the incorporation of time metadata, and target winsorization. Then, we rely on the well-known random search for parameter optimization of the machine learning models and k-fold cross-validation, obtaining favorable results. We devise these models for a single-step and single-station analysis on an hourly multivariate air quality dataset (containing 77203 rows and 16 columns from the first hour of January 1, 2015 00:00:00 to April 17, 2022 23:00:00) and compare them using standard regression metrics. Therefore, we identify the forecasting model with the best performance, which was an Extra Trees Regressor with a Root Mean Squared Error of 0.013, a Mean Absolute Error of 0.006 (equivalent to a Mean Absolute Percentage Error of 0.294% and a Symmetric Mean Absolute Percentage Error of 0.078%), and a Maximum Error of 0.187 mu g/m(3).
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Source apportionment of PM2.5 for supporting control strategies in the Monterrey Metropolitan Area, Mexico
    Martinez-Cinco, Marco
    Santos-Guzman, Jesus
    Mejia-Velazquez, Gerardo
    JOURNAL OF THE AIR & WASTE MANAGEMENT ASSOCIATION, 2016, 66 (06) : 631 - 642
  • [2] Heavy Metal Content in PM2.5 Air Samples Collected in the Metropolitan Area of Monterrey, Mexico
    Tadeo Badillo-Castaneda, Christian
    Garza-Ocanas, Lourdes
    Humberto Garza-Ulloa, M. C.
    Teresa Zanatta-Calderon, Maria
    Caballero-Quintero, Adolfo
    HUMAN AND ECOLOGICAL RISK ASSESSMENT, 2015, 21 (08): : 2022 - 2035
  • [3] Platinum in PM2.5 of the metropolitan area of Mexico City
    Morton-Bermea, Ofelia
    Amador-Munoz, Omar
    Martinez-Trejo, Lida
    Hernandez-Alvarez, Elizabeth
    Beramendi-Orosco, Laura
    Elena Garcia-Arreola, Maria
    ENVIRONMENTAL GEOCHEMISTRY AND HEALTH, 2014, 36 (05) : 987 - 994
  • [4] Platinum in PM2.5 of the metropolitan area of Mexico City
    Ofelia Morton-Bermea
    Omar Amador-Muñoz
    Lida Martínez-Trejo
    Elizabeth Hernández-Álvarez
    Laura Beramendi-Orosco
    María Elena García-Arreola
    Environmental Geochemistry and Health, 2014, 36 : 987 - 994
  • [5] Forecasting PM2.5 Concentration using Spatio-Temporal Extreme Learning Machine
    Liu, Bo
    Yan, Shuo
    Li, Jianqiang
    Li, Yong
    2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 950 - 953
  • [6] Modelling and Forecasting Temporal PM2.5 Concentration Using Ensemble Machine Learning Methods
    Ejohwomu, Obuks Augustine
    Shamsideen Oshodi, Olakekan
    Oladokun, Majeed
    Bukoye, Oyegoke Teslim
    Emekwuru, Nwabueze
    Sotunbo, Adegboyega
    Adenuga, Olumide
    BUILDINGS, 2022, 12 (01)
  • [7] Prediction of PM2.5 Concentration Using Spatiotemporal Data with Machine Learning Models
    Ma, Xin
    Chen, Tengfei
    Ge, Rubing
    Xv, Fan
    Cui, Caocao
    Li, Junpeng
    ATMOSPHERE, 2023, 14 (10)
  • [8] An Improved Weight Optimization of Hybrid Machine Learning Models for Forecasting Daily PM2.5 Concentration
    Ratchagit, Manlika
    CONTEMPORARY MATHEMATICS, 2024, 5 (03): : 3953 - 3970
  • [9] Time series forecasting of ozone levels in the Metropolitan Area of Monterrey, Mexico
    Iglesias-Gonzalez, S.
    Huertas, M.
    Hernandez-Paniagua, I
    Mendoza, A.
    15TH INTERNATIONAL CONFERENCE ON ATMOSPHERIC SCIENCES AND APPLICATIONS TO AIR QUALITY, 2020, 489
  • [10] Platinum concentration in PM2.5 in the Mexico City Metropolitan Area: relationship to meteorological conditions
    Garza-Galindo, Rodrigo
    Morton-Bermea, Ofelia
    Hernandez-Alvarez, Elizabeth
    Ordonez-Godinez, Sara L.
    Amador-Munoz, Omar
    Beramendi-Orosco, Laura
    Retama-Hernandez, Armando
    Miranda, Javier
    Rosas-Perez, Irma
    HUMAN AND ECOLOGICAL RISK ASSESSMENT, 2020, 26 (05): : 1164 - 1174