Fine Particulate Matter Concentration Level Prediction by using Tree-based Ensemble Classification Algorithms

被引:0
|
作者
Zhao, Yin [1 ]
Abu Hasan, Yahya [1 ]
机构
[1] Univ Sains Malaysia, Sch Math Sci, George Town, Penang, Malaysia
关键词
Random Forest; C5.0; PM2.5; prediction; data mining;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Pollutant forecasting is an important problem in the environmental sciences. Data mining is an approach to discover knowledge from large data. This paper tries to use data mining methods to forecast PM2.5 concentration level, which is an important air pollutant. There are several tree-based classification algorithms available in data mining, such as CART, C4.5, Random Forest (RF) and C5.0. RF and C5.0 are popular ensemble methods, which are, RF builds on CART with Bagging and C5.0 builds on C4.5 with Boosting, respectively. This paper builds PM2.5 concentration level predictive models based on RF and C5.0 by using R packages. The data set includes 2000-2011 period data in a new town of Hong Kong. The PM2.5 concentration is divided into 2 levels, the critical points is 25 mu g/m(3)(24 hours mean). According to 100 times 10-fold cross validation, the best testing accuracy is from RF model, which is around 0.845 similar to 0.854.
引用
收藏
页码:21 / 27
页数:7
相关论文
共 50 条
  • [1] Tree-Based Ensemble Models and Algorithms for Classification
    Tsiligaridis, J.
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 103 - 106
  • [2] Evaluation of Tree-Based Voting Algorithms in Water Quality Classification Prediction
    Li, Lili
    Wei, Jianhui
    SUSTAINABILITY, 2024, 16 (23)
  • [3] Classification of repeated measurements data using tree-based ensemble methods
    Werner Adler
    Sergej Potapov
    Berthold Lausen
    Computational Statistics, 2011, 26
  • [4] Classification of repeated measurements data using tree-based ensemble methods
    Adler, Werner
    Potapov, Sergej
    Lausen, Berthold
    COMPUTATIONAL STATISTICS, 2011, 26 (02) : 355 - 369
  • [5] Classification Prediction of PM10 Concentration Using a Tree-Based Machine Learning Approach
    Shaziayani, Wan Nur
    Ul-Saufie, Ahmad Zia
    Mutalib, Sofianita
    Noor, Norazian Mohamad
    Zainordin, Nazatul Syadia
    ATMOSPHERE, 2022, 13 (04)
  • [6] Prediction of Water Carbon Fluxes and Emission Causes in Rice Paddies Using Two Tree-Based Ensemble Algorithms
    Gu, Xinqin
    Yao, Li
    Wu, Lifeng
    SUSTAINABILITY, 2023, 15 (16)
  • [7] Application of Discrete Wavelet Transform and Tree-Based Ensemble Machine Learning for Modeling of Particulate Matter Concentrations
    Stoimenova-Minova, Maya
    Gocheva-Ilieva, Snezhana
    Ivanov, Atanas
    MATHEMATICAL METHODS FOR ENGINEERING APPLICATIONS, ICMASE 2023, 2024, 439 : 171 - 183
  • [8] Prediction and forecast of surface wind using ML tree-based algorithms
    M. H. ElTaweel
    S. C. Alfaro
    G. Siour
    A. Coman
    S. M. Robaa
    M. M. Abdel Wahab
    Meteorology and Atmospheric Physics, 2024, 136
  • [9] Prediction and forecast of surface wind using ML tree-based algorithms
    Eltaweel, M. H.
    Alfaro, S. C.
    Siour, G.
    Coman, A.
    Robaa, S. M.
    Wahab, M. M. Abdel
    METEOROLOGY AND ATMOSPHERIC PHYSICS, 2024, 136 (01)
  • [10] Malware Classification of Portable Executables using Tree-Based Ensemble Machine Learning
    Atluri, Venkata
    2019 IEEE SOUTHEASTCON, 2019,