Evaluation of machine learning and deep learning models for daily air quality index prediction in Delhi city, India

被引:1
|
作者
Pande, Chaitanya Baliram [1 ]
Radhadevi, Latha [1 ]
Satyanarayana, Murthy Bandaru [1 ]
机构
[1] Indian Inst Trop Meteorol, Dr Homi Bhabha Rd, Pune 411008, India
关键词
Air pollution; Extreme gradient boosting; Cross-validation; SHAP method; ANN model; NEURAL-NETWORKS; PM10;
D O I
10.1007/s10661-024-13351-1
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The air quality index (AQI), based on criteria for air contaminants, is defined to provide a shared vision of air quality. As air pollution continues to rise in global cities due to urbanization and climate change, air pollution monitoring and forecasting models for effective air quality monitoring that gather and forecast information about air pollution concentration are essential in every city. Air quality predictions have evolved to be more helpful for management. Recently, better performance and ability have developed due to the involvement of machine learning (ML) and artificial intelligence (AI) in forecasting air quality in urban cities in India. This paper focuses on air pollution as a significant ecological problem that directly impacts human health and the distribution of an environmental system in urban areas. Hence, we have developed advanced models for daily AQI forecasting to understand the air effluence level in the upcoming days. In this research, six data-driven models have been developed and implemented for daily AQI forecasting in the study area; it is crucial for understanding the future air pollution levels to plan and control air pollution in the entire city. The developed model is applied to air quality datasets. A comparison of the performance of ML models tested here indicates that the XGBoost algorithm achieves the highest coefficient of determination (R2) and root-mean-square deviation (RMSE) value of 0.99 and lower values value of 4.65 than other models in the testing phase. The results of the artificial neural network (ANN) algorithm are slightly lower than the extreme gradient boosting (XGBoost model); the ANN model results are as R2, mean squared error (MSE), and RMSE values of 0.99, 13.99, and 198.88, respectively. All the models were subjected to a ten-fold cross-validation model. However, the RF cross-validation model outperforms other models; the RF model result shows the R2, RMSE, and MSE values of 0.99, 3.64, and 4.12, respectively. This study also employed two interpretable models, namely feature importance analysis and Shapley additive explanation (SHAP), to evaluate both the global and local methods in a manner that is independent of specific ML models. The feature importance shows that particle matter (PM) 2.5, PM10, carbon monoxide (CO), and nitrogen oxides (NOx) were the most influential variables. The results determined that such novel DL and ML models may improve the accuracy of AQI forecasts and understanding of air pollution, particularly in metropolitan cities.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Advanced Machine Learning Techniques for Precise hourly Air Quality Index (AQI) Prediction in Azamgarh, India
    Ansari, Asif
    Quaff, Abdur Rahman
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH, 2025, 19 (01)
  • [22] Prediction of Daily Air Pollutants Concentration and Air Pollutant Index Using Machine Learning Approach
    Mustakim, Nurul Aisyah
    Ul-Saufie, Ahmad Zia
    Shaziayani, Wan Nur
    Noor, Norazian Mohamad
    Mutalib, Sofianita
    PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2023, 31 (01): : 123 - 136
  • [23] Forecasting of daily air quality index in Delhi
    Kumar, Anikender
    Goyal, P.
    SCIENCE OF THE TOTAL ENVIRONMENT, 2011, 409 (24) : 5517 - 5523
  • [24] A comprehensive evaluation of statistical, machine learning and deep learning models for time series prediction
    Xuan, Ang
    Yin, Mengmeng
    Li, Yupei
    Chen, Xiyu
    Ma, Zhenliang
    2022 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MACHINE LEARNING APPLICATIONS (CDMA 2022), 2022, : 55 - 60
  • [25] An evaluation of machine learning and deep learning models for drought prediction using weather data
    Jiang, Weiwei
    Luo, Jiayun
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (03) : 3611 - 3626
  • [26] Marine Data Prediction: An Evaluation of Machine Learning, Deep Learning, and Statistical Predictive Models
    Ali, Ahmed
    Fathalla, Ahmed
    Salah, Ahmad
    Bekhit, Mahmoud
    Eldesouky, Esraa
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021 (2021)
  • [27] Prediction of air pollutants for air quality using deep learning methods in a metropolitan city
    Das, Bihter
    Dursun, Omer Osman
    Toraman, Suat
    URBAN CLIMATE, 2022, 46
  • [28] Air Quality Index and Air Pollutant Concentration Prediction Based on Machine Learning Algorithms
    Liu, Huixiang
    Li, Qing
    Yu, Dongbing
    Gu, Yu
    APPLIED SCIENCES-BASEL, 2019, 9 (19):
  • [29] Predicting air quality index and fine particulate matter levels in Bagdad city using advanced machine learning and deep learning techniques
    Khadom, Anees A.
    Albawi, Saad
    Abboud, Ali J.
    Mahood, Hameed B.
    Hassan, Qusay
    JOURNAL OF ATMOSPHERIC AND SOLAR-TERRESTRIAL PHYSICS, 2024, 262
  • [30] Review of machine learning and deep learning models for toxicity prediction
    Guo, Wenjing
    Liu, Jie
    Dong, Fan
    Song, Meng
    Li, Zoe
    Khan, Md Kamrul Hasan
    Patterson, Tucker A.
    Hong, Huixiao
    EXPERIMENTAL BIOLOGY AND MEDICINE, 2023, 248 (21) : 1952 - 1973