Forecasting PM2.5 concentration levels using shallow machine learning models on the Monterrey Metropolitan Area in Mexico

被引:3
|
作者
Pozo-Luyo, Cesar Alejandro [1 ]
Cruz-Duarte, Jorge M. [1 ]
Amaya, Ivan [1 ]
Ortiz-Bayliss, Jose Carlos [1 ]
机构
[1] Tecnol Monterrey, Sch Engn & Sci, Ave Eugenio Garza Sada 2501, Monterrey 64700, Nuevo Leon, Mexico
关键词
Air quality forecasting; PM2.5; forecasting; Machine learning; Regression; METEOROLOGICAL CONDITIONS; AIR-QUALITY; EXPOSURE;
D O I
10.1016/j.apr.2023.101898
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The Monterrey Metropolitan Area is one of the most densely populated and polluted regions in Latin America. Hence, providing early warnings to the population when pollutant concentrations reach high levels is critical. This allows people at higher health risk to make informed decisions about when to go out, mitigating future health complications. Using forecasting models, we can produce timely warnings for future concentration levels. In this work, we implement a set of short-term shallow machine learning models that would serve as a baseline for future forecasting analyses of PM2.5 concentration levels in the Monterrey Metropolitan Area. The proposed approach starts with multiple imputation through chained equations for missing value imputation, the incorporation of time metadata, and target winsorization. Then, we rely on the well-known random search for parameter optimization of the machine learning models and k-fold cross-validation, obtaining favorable results. We devise these models for a single-step and single-station analysis on an hourly multivariate air quality dataset (containing 77203 rows and 16 columns from the first hour of January 1, 2015 00:00:00 to April 17, 2022 23:00:00) and compare them using standard regression metrics. Therefore, we identify the forecasting model with the best performance, which was an Extra Trees Regressor with a Root Mean Squared Error of 0.013, a Mean Absolute Error of 0.006 (equivalent to a Mean Absolute Percentage Error of 0.294% and a Symmetric Mean Absolute Percentage Error of 0.078%), and a Maximum Error of 0.187 mu g/m(3).
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Forecasting PM2.5 Concentration Using Gradient-Boosted Regression Tree with CNN Learning Model
    A. Usha Ruby
    J. George Chellin Chandran
    Prasannavenkatesan Theerthagiri
    Renuka Patil
    B. N. Chaithanya
    T. J. Swasthika Jain
    Optical Memory and Neural Networks, 2024, 33 : 86 - 96
  • [42] Forecasting of PM2.5 Concentration in Beijing Using Hybrid Deep Learning Framework Based on Attention Mechanism
    Li, Dong
    Liu, Jiping
    Zhao, Yangyang
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [43] Forecasting PM2.5 Concentration Using Gradient-Boosted Regression Tree with CNN Learning Model
    Usha Ruby, A.
    Chandran, J. George Chellin
    Theerthagiri, Prasannavenkatesan
    Patil, Renuka
    Chaithanya, B. N.
    Jain, T. J. Swasthika
    OPTICAL MEMORY AND NEURAL NETWORKS, 2024, 33 (01) : 86 - 96
  • [44] A Development of PM2.5 Forecasting System in South Korea Using Chemical Transport Modeling and Machine Learning
    Koo, Youn-Seo
    Kwon, Hee-Yong
    Bae, Hyosik
    Yun, Hui-Young
    Choi, Dae-Ryun
    Yu, SukHyun
    Wang, Kyung-Hui
    Koo, Ji-Seok
    Lee, Jae-Bum
    Choi, Min-Hyeok
    Lee, Jeong-Beom
    ASIA-PACIFIC JOURNAL OF ATMOSPHERIC SCIENCES, 2023, 59 (05) : 577 - 595
  • [45] Forecasting Ozone and PM2.5 Pollution Potentials Using Machine Learning Algorithms: A Case Study in Chengdu
    Wang, Xinlu
    Huang, Ran
    Zhang, Wenxian
    Lü, Baolei
    Du, Yunsong
    Zhang, Wei
    Li, Bolan
    Hu, Yongtao
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2021, 57 (05): : 938 - 950
  • [46] A Development of PM2.5 Forecasting System in South Korea Using Chemical Transport Modeling and Machine Learning
    Youn-Seo Koo
    Hee-Yong Kwon
    Hyosik Bae
    Hui-Young Yun
    Dae-Ryun Choi
    SukHyun Yu
    Kyung-Hui Wang
    Ji-Seok Koo
    Jae-Bum Lee
    Min-Hyeok Choi
    Jeong-Beom Lee
    Asia-Pacific Journal of Atmospheric Sciences, 2023, 59 : 577 - 595
  • [47] Forecasting Atmospheric PM2.5 Concentration in Thiruvananthapuram City using LSTM Model
    Mohan, Anju S.
    Abraham, Lizy
    2019 FIFTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP 2019), 2019, : 343 - 346
  • [48] PM2.5 concentration simulation by hybrid machine learning based on image features
    Ma, Minjin
    Zhao, Zhenzhu
    Ma, Yuzhan
    Cao, Yidan
    Kang, Guoqiang
    FRONTIERS IN EARTH SCIENCE, 2025, 13
  • [49] Comparison of Statistical and Deep Learning Methods for Forecasting PM2.5 Concentration in Northern Thailand
    Wongrin, Weerinrada
    Chaisee, Kuntalee
    Suphawan, Kamonrat
    POLISH JOURNAL OF ENVIRONMENTAL STUDIES, 2023, 32 (02): : 1419 - 1431
  • [50] Forecasting PM10 Concentrations in the Caribbean Area Using Machine Learning Models
    Plocoste, Thomas
    Laventure, Sylvio
    ATMOSPHERE, 2023, 14 (01)