PM2.5 concentration simulation by hybrid machine learning based on image features

被引:0
|
作者
Ma, Minjin [1 ]
Zhao, Zhenzhu [1 ,2 ]
Ma, Yuzhan [3 ]
Cao, Yidan [1 ]
Kang, Guoqiang [1 ]
机构
[1] Lanzhou Univ, Coll Atmospher Sci, Gansu Key Lab Arid Climate Change & Reducing Disas, Lanzhou, Peoples R China
[2] Dalian Ecol & Environm Affairs Serv Ctr, Water & Atmospher Dept, Dalian, Peoples R China
[3] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
关键词
machine learning; image features; complete ensemble empirical mode decomposition with adaptive noise; signal decomposition; PM2.5; MEMORY NEURAL-NETWORK; AIR-POLLUTION; ENSEMBLE MODEL; PM10;
D O I
10.3389/feart.2025.1509489
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Air pollution significantly impacts human health, making the development of effective pollutant concentration assessment methods crucial. This study introduces a hybrid machine learning approach to simulate PM2.5 mass concentration using outdoor images, offering an alternative to traditional observation techniques. The proposed method utilizes a convolutional neural network (CNN) to extract image features through transfer learning. The importance of these features is then evaluated using a random forest (RF) model. In addition, the extracted image features are combined with meteorological data (e.g., temperature (TEM), relative humidity (RHU), and sea level pressure (PRS_Sea)) and pollutant concentration data (hourly PM2.5 concentrations from four monitoring stations) for complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) signal decomposition. This results in multiscale signals that are subsequently used in the hybrid machine learning model to simulate PM2.5 concentrations. The results demonstrate that the ResNet50 training method, which extracts 64 image features, yields the best performance. An RF model is applied to the low-frequency signal, superimposed with the trend signal, while a Lasso regression model is used for the high-frequency signal. The combined approach produces superior simulation results than the RF model alone. Notably, image feature 23, PM2.5 concentration from the Institute of Biological Products (IBP), and TEM are most influential for the high-frequency signal, with characteristic coefficients of 1.409, 0.380, and 0.318, respectively. For the low-frequency signals, image features 5 and 23, along with the PM2.5 concentration from the Lanlian Hotel (LH), are the most significant, with importance values of 0.170, 0.137, and 0.125, respectively. The Lasso regression model (random forest model) has the function of high (low) value correction for high (low) frequency signal simulation, leading to more accurate simulation results. This study proposes a cost-effective method for accurately estimating PM2.5 concentrations.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Machine-learning-based model and simulation analysis of PM2.5 concentration prediction in Beijing
    Qu Y.
    Qian X.
    Song H.-Q.
    He J.
    Li J.-H.
    Xiu H.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2019, 41 (03): : 401 - 407
  • [2] PM2.5 Concentration Measurement Based on Image Perception
    Wang, Guangcheng
    Shi, Quan
    Jiang, Kui
    ELECTRONICS, 2022, 11 (09)
  • [3] PM2.5 Prediction Based on the CEEMDAN Algorithm and a Machine Learning Hybrid Model
    Ban, Wenchao
    Shen, Liangduo
    SUSTAINABILITY, 2022, 14 (23)
  • [4] A new hybrid prediction model of PM2.5 concentration based on secondary decomposition and optimized extreme learning machine
    Yang, Hong
    Zhao, Junlin
    Li, Guohui
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2022, 29 (44) : 67214 - 67241
  • [5] A new hybrid prediction model of PM2.5 concentration based on secondary decomposition and optimized extreme learning machine
    Hong Yang
    Junlin Zhao
    Guohui Li
    Environmental Science and Pollution Research, 2022, 29 : 67214 - 67241
  • [6] Prediction of PM2.5 Concentration Based on Ensemble Learning
    Peng Y.
    Zhao Z.-R.
    Wu T.-X.
    Wang J.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2019, 42 (06): : 162 - 169
  • [7] An Improved Weight Optimization of Hybrid Machine Learning Models for Forecasting Daily PM2.5 Concentration
    Ratchagit, Manlika
    CONTEMPORARY MATHEMATICS, 2024, 5 (03): : 3953 - 3970
  • [8] Combining Machine Learning and Numerical Simulation for High-Resolution PM2.5 Concentration Forecast
    Bi, Jianzhao
    Knowland, K. Emma
    Keller, Christoph A.
    Liu, Yang
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2022, 56 (03) : 1544 - 1556
  • [9] PM2.5 and O3 concentration estimation based on interpretable machine learning
    Wang, Siyuan
    Ren, Ying
    Xia, Bisheng
    ATMOSPHERIC POLLUTION RESEARCH, 2023, 14 (09)
  • [10] PM2.5 Concentration Estimation Based on Image Quality Assessment
    Yang, Benqian
    Chen, Qiang
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 676 - 681