PM2.5 concentration simulation by hybrid machine learning based on image features

被引:0
|
作者
Ma, Minjin [1 ]
Zhao, Zhenzhu [1 ,2 ]
Ma, Yuzhan [3 ]
Cao, Yidan [1 ]
Kang, Guoqiang [1 ]
机构
[1] Lanzhou Univ, Coll Atmospher Sci, Gansu Key Lab Arid Climate Change & Reducing Disas, Lanzhou, Peoples R China
[2] Dalian Ecol & Environm Affairs Serv Ctr, Water & Atmospher Dept, Dalian, Peoples R China
[3] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
关键词
machine learning; image features; complete ensemble empirical mode decomposition with adaptive noise; signal decomposition; PM2.5; MEMORY NEURAL-NETWORK; AIR-POLLUTION; ENSEMBLE MODEL; PM10;
D O I
10.3389/feart.2025.1509489
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Air pollution significantly impacts human health, making the development of effective pollutant concentration assessment methods crucial. This study introduces a hybrid machine learning approach to simulate PM2.5 mass concentration using outdoor images, offering an alternative to traditional observation techniques. The proposed method utilizes a convolutional neural network (CNN) to extract image features through transfer learning. The importance of these features is then evaluated using a random forest (RF) model. In addition, the extracted image features are combined with meteorological data (e.g., temperature (TEM), relative humidity (RHU), and sea level pressure (PRS_Sea)) and pollutant concentration data (hourly PM2.5 concentrations from four monitoring stations) for complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) signal decomposition. This results in multiscale signals that are subsequently used in the hybrid machine learning model to simulate PM2.5 concentrations. The results demonstrate that the ResNet50 training method, which extracts 64 image features, yields the best performance. An RF model is applied to the low-frequency signal, superimposed with the trend signal, while a Lasso regression model is used for the high-frequency signal. The combined approach produces superior simulation results than the RF model alone. Notably, image feature 23, PM2.5 concentration from the Institute of Biological Products (IBP), and TEM are most influential for the high-frequency signal, with characteristic coefficients of 1.409, 0.380, and 0.318, respectively. For the low-frequency signals, image features 5 and 23, along with the PM2.5 concentration from the Lanlian Hotel (LH), are the most significant, with importance values of 0.170, 0.137, and 0.125, respectively. The Lasso regression model (random forest model) has the function of high (low) value correction for high (low) frequency signal simulation, leading to more accurate simulation results. This study proposes a cost-effective method for accurately estimating PM2.5 concentrations.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Modelling and Forecasting Temporal PM2.5 Concentration Using Ensemble Machine Learning Methods
    Ejohwomu, Obuks Augustine
    Shamsideen Oshodi, Olakekan
    Oladokun, Majeed
    Bukoye, Oyegoke Teslim
    Emekwuru, Nwabueze
    Sotunbo, Adegboyega
    Adenuga, Olumide
    BUILDINGS, 2022, 12 (01)
  • [32] A Hybrid Deep Learning Approach for PM2.5 Concentration Prediction in Smart Environmental Monitoring
    Vo, Minh Thanh
    Vo, Anh H.
    Bui, Huong
    Le, Tuong
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03): : 3029 - 3041
  • [33] PM2.5 Estimation Based on Image Analysis
    Li, Xiaoli
    Zhang, Shan
    Wang, Kang
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (02): : 907 - 923
  • [34] Analysis of the Gridded Influencing Factors of the PM2.5 Concentration in Sichuan Province Based on a Stacked Machine Learning Model
    Wu, Yuhong
    Du, Ning
    Wang, Li
    Cai, Hong
    Zhou, Bin
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH, 2023, 17 (01)
  • [35] Spatiotemporal change of PM2.5 concentration in Beijing-Tianjin-Hebei and its prediction based on machine learning
    Liu, Nanjian
    Hao, Zhixin
    Zhao, Peng
    URBAN CLIMATE, 2024, 58
  • [36] MERRA-2 PM2.5 mass concentration reconstruction in China mainland based on LightGBM machine learning
    Ma, Jinghui
    Zhang, Renhe
    Xu, Jianming
    Yu, Zhongqi
    SCIENCE OF THE TOTAL ENVIRONMENT, 2022, 827
  • [37] Analysis of the Gridded Influencing Factors of the PM2.5 Concentration in Sichuan Province Based on a Stacked Machine Learning Model
    Yuhong Wu
    Ning Du
    Li Wang
    Hong Cai
    Bin Zhou
    International Journal of Environmental Research, 2023, 17
  • [38] A novel hybrid strategy for PM2.5 concentration analysis and prediction
    Jiang, Ping
    Dong, Qingli
    Li, Peizhi
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2017, 196 : 443 - 457
  • [39] Effects of Big Data on PM2.5: A Study Based on Double Machine Learning
    Wei, Xinyu
    Cheng, Mingwang
    Duan, Kaifeng
    Kong, Xiangxing
    LAND, 2024, 13 (03)
  • [40] PM2.5 concentration prediction using machine learning algorithms: an approach to virtual monitoring stations
    Makhdoomi, Ahmad
    Sarkhosh, Maryam
    Ziaei, Somayyeh
    SCIENTIFIC REPORTS, 2025, 15 (01):