Fine Particulate Matter Concentration Level Prediction by using Tree-based Ensemble Classification Algorithms

被引:0
|
作者
Zhao, Yin [1 ]
Abu Hasan, Yahya [1 ]
机构
[1] Univ Sains Malaysia, Sch Math Sci, George Town, Penang, Malaysia
关键词
Random Forest; C5.0; PM2.5; prediction; data mining;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Pollutant forecasting is an important problem in the environmental sciences. Data mining is an approach to discover knowledge from large data. This paper tries to use data mining methods to forecast PM2.5 concentration level, which is an important air pollutant. There are several tree-based classification algorithms available in data mining, such as CART, C4.5, Random Forest (RF) and C5.0. RF and C5.0 are popular ensemble methods, which are, RF builds on CART with Bagging and C5.0 builds on C4.5 with Boosting, respectively. This paper builds PM2.5 concentration level predictive models based on RF and C5.0 by using R packages. The data set includes 2000-2011 period data in a new town of Hong Kong. The PM2.5 concentration is divided into 2 levels, the critical points is 25 mu g/m(3)(24 hours mean). According to 100 times 10-fold cross validation, the best testing accuracy is from RF model, which is around 0.845 similar to 0.854.
引用
收藏
页码:21 / 27
页数:7
相关论文
共 50 条
  • [31] COVID-19 Cases Prediction in Saudi Arabia Using Tree-based Ensemble Models
    Almazroi, Abdulwahab Ali
    Usmani, Raja Sher Afgun
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 32 (01): : 389 - 400
  • [32] Prediction of asphalt binder elastic recovery using tree-based ensemble bagging and boosting models
    Asadi, Babak
    Hajj, Ramez
    CONSTRUCTION AND BUILDING MATERIALS, 2024, 410
  • [33] ATM Allocation Using Decision Tree-Based Algorithms
    Yurdakul, Hazal Hasret
    Kasikci, Kerem
    Cagatay, Ilhan
    Guven, Melih
    Koras, Murat
    Akgun, Baris
    Gonen, Mehmet
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [34] Enhancing cell delay accuracy in post-placed netlists using ensemble tree-based algorithms
    Attaoui, Yassine
    Chentouf, Mohamed
    Ismaili, Zine El Abidine Alaoui
    El Mourabit, Aimad
    INTEGRATION-THE VLSI JOURNAL, 2024, 97
  • [35] Improved prediction of awakening or nonawakening in severe anoxic coma using tree-based classification
    Tirschwell, D
    CRITICAL CARE MEDICINE, 2006, 34 (05) : 1573 - 1575
  • [36] COMPARISON OF TREE-BASED CLASSIFICATION ALGORITHMS IN MAPPING BURNED FOREST AREAS
    Matci, Dilek Kucuk
    Comert, Resul
    Avdan, Ugur
    GEODETSKI VESTNIK, 2020, 64 (03) : 348 - 360
  • [37] Performance evaluation of feature selection and tree-based algorithms for traffic classification
    Aouedi, Ons
    Piamrat, Kandaraj
    Parrein, Benoit
    2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
  • [38] Ensemble learning-based classification of microarray cancer data on tree-based features
    Dagnew, Guesh
    Shekar, B. H.
    COGNITIVE COMPUTATION AND SYSTEMS, 2021, 3 (01) : 48 - 60
  • [39] Tree-Based Ensemble Multi-Task Learning Method for Classification and Regression
    Simm, Jaak
    Magrans De Abril, Ildefons
    Sugiyama, Masashi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (06) : 1677 - 1681
  • [40] Video classification using a tree-based RBF network
    Gillespie, WJ
    Nguyen, DT
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 3753 - 3756