Novel MIA-LSTM Deep Learning Hybrid Model with Data Preprocessing for Forecasting of PM2.5

被引:6
|
作者
Narkhede, Gaurav [1 ]
Hiwale, Anil [1 ]
Tidke, Bharat [2 ]
Khadse, Chetan [3 ]
机构
[1] MIT World Peace Univ, Sch Elect & Commun Engn, Pune 411038, India
[2] MIT World Peace Univ, Sch Comp Engn & Technol, Pune 411038, India
[3] MIT World Peace Univ, Sch Elect Engn, Pune 411038, India
关键词
MIA-LSTM; data preprocessing; iterative imputation; autoencoder; LSTM; MISSING VALUES; NEURAL-NETWORK; IMPUTATION; AIR; PREDICTION;
D O I
10.3390/a16010052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Day by day pollution in cities is increasing due to urbanization. One of the biggest challenges posed by the rapid migration of inhabitants into cities is increased air pollution. Sustainable Development Goal 11 indicates that 99 percent of the world's urban population breathes polluted air. In such a trend of urbanization, predicting the concentrations of pollutants in advance is very important. Predictions of pollutants would help city administrations to take timely measures for ensuring Sustainable Development Goal 11. In data engineering, imputation and the removal of outliers are very important steps prior to forecasting the concentration of air pollutants. For pollution and meteorological data, missing values and outliers are critical problems that need to be addressed. This paper proposes a novel method called multiple iterative imputation using autoencoder-based long short-term memory (MIA-LSTM) which uses iterative imputation using an extra tree regressor as an estimator for the missing values in multivariate data followed by an LSTM autoencoder for the detection and removal of outliers present in the dataset. The preprocessed data were given to a multivariate LSTM for forecasting PM2.5 concentration. This paper also presents the effect of removing outliers and missing values from the dataset as well as the effect of imputing missing values in the process of forecasting the concentrations of air pollutants. The proposed method provides better results for forecasting with a root mean square error (RMSE) value of 9.8883. The obtained results were compared with the traditional gated recurrent unit (GRU), 1D convolutional neural network (CNN), and long short-term memory (LSTM) approaches for a dataset of the Aotizhonhxin area of Beijing in China. Similar results were observed for another two locations in China and one location in India. The results obtained show that imputation and outlier/anomaly removal improve the accuracy of air pollution forecasting.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] A novel ensemble reinforcement learning gated unit model for daily PM2.5 forecasting
    Yanfei Li
    Zheyu Liu
    Hui Liu
    Air Quality, Atmosphere & Health, 2021, 14 : 443 - 453
  • [32] A Novel Hybrid Method to Predict PM2.5 Concentration Based on the SWT-QPSO-LSTM Hybrid Model
    Du, Meng
    Chen, Yixin
    Liu, Yang
    Yin, Hang
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [33] A new hybrid PM2.5 volatility forecasting model based on EMD and machine learning algorithms
    Wang, Ping
    Bi, Xu
    Zhang, Guisheng
    Yu, Mengjiao
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2023, 30 (34) : 82878 - 82894
  • [34] Hybrid Spatio-temporal Deep Learning Framework for Particulate Matter(PM2.5) Concentration Forecasting
    Abirami, S.
    Chitra, P.
    Madhumitha, R.
    Kesavan, Ragul S.
    2020 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN INFORMATION TECHNOLOGY (ICITIIT), 2020,
  • [35] Forecasting of PM2.5 Concentration in Beijing Using Hybrid Deep Learning Framework Based on Attention Mechanism
    Li, Dong
    Liu, Jiping
    Zhao, Yangyang
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [36] A novel hybrid-Garch model based on ARIMA and SVM for PM2.5 concentrations forecasting
    Wang, Ping
    Zhang, Hong
    Qin, Zuodong
    Zhang, Guisheng
    ATMOSPHERIC POLLUTION RESEARCH, 2017, 8 (05) : 850 - 860
  • [37] High-spatiotemporal-resolution PM2.5 forecasting by hybrid deep learning models with ensembled massive heterogeneous monitoring data
    Wu, Kuan-Yen
    Hsia, I. -Wen
    Kow, Pu-Yun
    Chang, Li-Chiu
    Chang, Fi-John
    JOURNAL OF CLEANER PRODUCTION, 2023, 433
  • [38] PM2.5 forecasting with hybrid LSE model-based approach
    Chen, Yunliang
    Li, Fangyuan
    Deng, Ze
    Chen, Xiaodao
    He, Jijun
    SOFTWARE-PRACTICE & EXPERIENCE, 2017, 47 (03): : 379 - 390
  • [39] DESA: a novel hybrid decomposing-ensemble and spatiotemporal attention model for PM2.5 forecasting
    Shuwei Fang
    Qi Li
    Hamed Karimian
    Hui Liu
    Yuqin Mo
    Environmental Science and Pollution Research, 2022, 29 : 54150 - 54166
  • [40] A new hybrid deep neural network for multiple sites PM2.5 forecasting
    Teng, Mengfan
    Li, Siwei
    Yang, Jie
    Chen, Jiarui
    Fan, Chunying
    Ding, Yu
    JOURNAL OF CLEANER PRODUCTION, 2024, 473