Evaluation of data preprocessing and feature selection process for prediction of hourly PM10 concentration using long short-term memory models

被引:14
|
作者
Aksangur, Ipek [1 ]
Eren, Beytullah [1 ,2 ]
Erden, Caner [3 ,4 ]
机构
[1] Sakarya Univ, Fac Engn, Dept Environ Engn, Esentepe, Sakarya, Turkey
[2] Harran Univ, Halfeti Vocat Sch, Halfeti, Sanliurfa, Turkey
[3] Sakarya Univ Appl Sci, Fac Appl Sci, Dept Int Trade & Finance, Sakarya, Turkey
[4] Sakarya Univ Appl Sci, AI Res & Applicat Ctr, Sakarya, Turkey
关键词
Air quality; Data preprocessing; Feature selection; Particulate matter (PM 10 ); Long -short term memory (LSTM); AIR-POLLUTION; NEURAL-NETWORK; PM2.5; ARCHITECTURE; EXPOSURE; IMPACT; SO2;
D O I
10.1016/j.envpol.2022.119973
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Studies have confirmed that PM10, defined as respirable particles with diameters of 10 mu m and smaller, has adverse effects on human health and the environment. Various estimation methods are employed to determine the PM10 concentration using historical data on controlling PM10 air pollution, early warning, and protecting public health and the environment. The present study analyses different Long Short-Term Memory (LSTM) models that can predict hourly PM10 concentration. In parallel, the study also investigates the effectiveness of the data preprocessing and feature selection (DPFS) process on the prediction accuracy of the LSTM models. For this purpose, three different LSTM models, namely Vanilla, Bi-Directional, and Stacked, were developed. Then, a comprehensive data preprocessing stage is used to eliminate missing and erroneous data and outliers from real -world raw data, and a feature selection process is applied to extract unnecessary features. The LSTM models consider three air quality parameters, including SO2, O-3, and CO, and three meteorological factors, including relative humidity, wind direction, and wind speed. The prediction performances of the LSTM models are compared using the RMSE, MAE and R-2 performance index according to whether DPFS is used in the models or not. As a result, when the DPFS process was applied, the proposed LSTM models achieved high prediction performance and can be used to predict hourly PM10 concentrations. Overall, the DPFS process significantly enhanced the developed LSTM models' prediction performance. Furthermore, the proposed model might be a useful tool for city administrators to make decisions and improve air quality management efforts.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Short-term estimations of PM10 concentration in the Middle Black Sea region based on grey prediction models
    Ozen, Hulya Aykac
    Obekcan, Hamdi
    CLEAN-SOIL AIR WATER, 2023, 51 (10)
  • [2] PM10 Density Forecast Model Using Long Short Term Memory
    Park, Jung-Hwan
    Yoo, Seong-Joon
    Kim, Kyung-Joong
    Gu, Yeong-Hyeon
    Lee, Keon-Hoon
    Son, U-Hyon
    2017 NINTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2017), 2017, : 576 - 581
  • [3] Development of a daily PM10 and PM2.5 prediction system using a deep long short-term memory neural network model
    Kim, Hyun S.
    Park, Inyoung
    Song, Chul H.
    Lee, Kyunghwa
    Yun, Jae W.
    Kim, Hong K.
    Jeon, Moongu
    Lee, Jiwon
    Han, Kyung M.
    ATMOSPHERIC CHEMISTRY AND PHYSICS, 2019, 19 (20) : 12935 - 12951
  • [4] PM10 density forecast model using long short term memory
    Park, Jung-Hwan
    Yoo, Seong-Joon
    Kim, Kyung-Joong
    Gu, Yeong-Hyeon
    Lee, Keon-Hoon
    Son, U-Hyon
    International Conference on Ubiquitous and Future Networks, ICUFN, 2017, 0 : 576 - 581
  • [5] PM2.5/PM10 ratio prediction based on a long short-term memory neural network in Wuhan, China
    Wu, Xueling
    Wang, Ying
    He, Siyuan
    Wu, Zhongfang
    GEOSCIENTIFIC MODEL DEVELOPMENT, 2020, 13 (03) : 1499 - 1511
  • [6] Optimizing air quality predictions: A discrete wavelet transform and long short-term memory approach with wavelet-type selection for hourly PM10 concentrations
    Arslan, Gokce Nur Tasagil
    Depren, Serpil Kilic
    JOURNAL OF CHEMOMETRICS, 2024, 38 (04)
  • [7] Short-term and long-term effects of exposure to PM10
    Seihei, Narges
    Farhadi, Majid
    Takdastan, Afshin
    Asban, Parisa
    Kiani, Fatemeh
    Mohammadi, Mohammad Javad
    CLINICAL EPIDEMIOLOGY AND GLOBAL HEALTH, 2024, 27
  • [8] Evaluation of Sulphur Dioxide Hourly Prediction Using Long Short-term Memory for Summer and Winter Season
    Bennis, Mohammed
    Mohamed, Youssfi
    El Morabet, Rachida
    Alsubih, Majed
    Prayanagat, Muneer
    Khan, Roohul Abad
    ROCZNIK OCHRONA SRODOWISKA, 2024, 26 : 313 - 321
  • [9] An ensemble long short-term memory neural network for hourly PM2.5 concentration forecasting
    Bai, Yun
    Zeng, Bo
    Li, Chuan
    Zhang, Jin
    CHEMOSPHERE, 2019, 222 : 286 - 294
  • [10] Prediction of short and medium term PM10 concentration using artificial neural networks
    Schornobay-Lui, Elaine
    Alexandrina, Eduardo Carlos
    Aguiar, Monica Lopes
    Hanisch, Werner Siegfried
    Correa, Edinalda Moreira
    Correa, Nivaldo Aparecido
    MANAGEMENT OF ENVIRONMENTAL QUALITY, 2019, 30 (02) : 414 - 436