Improving Rotation Forest Performance for Imbalanced Data Classification through Fuzzy Clustering

被引:0
|
作者
Hosseinzadeh, Mehrdad [1 ,2 ]
Eftekhari, Mahdi [1 ]
机构
[1] Shahid Bahonar Univ Kerman, Dept Comp Engn, Kerman, Iran
[2] Shahid Bahonar Univ Kerman, Young Researchers Assoc, Kerman, Iran
关键词
Imbalanced Data Classification; Ensemble Learning; Fuzzy Clustering;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, fuzzy C-means clustering and Rotation Forest (RF) are combined to construct a high performance classifier for imbalanced data classification. Data samples are clustered via fuzzy clustering and then fuzzy membership function matrix is added into data samples. Therefore, clusters memberships of samples are utilized as new features that are added into the original features. After that, RF is utilized for classification where the new set of features as well as the original ones are taken into account in the feature subspacing phase. The proposed algorithm utilizes SMOTE oversampling algorithm for balancing data samples. The obtained results confirm that our proposed method outperforms the other well-known bagging algorithms.
引用
收藏
页码:35 / 40
页数:6
相关论文
共 50 条
  • [21] Data mining based fuzzy classification algorithm for imbalanced data
    Xu, Le
    Chow, Mo-Yuen
    Taylor, Leroy S.
    2006 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2006, : 825 - +
  • [22] Improving classification performance on real data through imputation
    Bratu, C. Vidrighin
    Muresan, T.
    Potolea, R.
    2008 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR 2008), THETA 16TH EDITION, VOL III, PROCEEDINGS, 2008, : 464 - 469
  • [23] A Fuzzy Decision Tree Approach for Imbalanced Data Classification
    Sardari, Sahar
    Eftekhari, Mahdi
    2016 6TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2016, : 292 - 297
  • [24] Improving the classification performance on imbalanced data sets via new hybrid parameterisation model
    Mohamad, Masurah
    Selamat, Ali
    Subroto, Imam Much
    Krejcar, Ondrej
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2021, 33 (07) : 787 - 797
  • [25] Improving activated sludge classification based on imbalanced data
    Qian, Y.
    Liang, Y. C.
    Guan, R. C.
    JOURNAL OF HYDROINFORMATICS, 2014, 16 (06) : 1331 - 1342
  • [26] Improving performance of classification on incomplete data using feature selection and clustering
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    Lam Thu Bui
    APPLIED SOFT COMPUTING, 2018, 73 : 848 - 861
  • [27] Clustering-based incremental learning for imbalanced data classification
    Liu, Yuxin
    Du, Guangyu
    Yin, Chenke
    Zhang, Haichao
    Wang, Jia
    KNOWLEDGE-BASED SYSTEMS, 2024, 292
  • [28] UNSUPERVISED CLASSIFICATION OF FOREST FROM POLARIMETRIC INTERFEROMETRIC SAR DATA USING FUZZY CLUSTERING
    Luo, Huan-Min
    Chen, Er-Xue
    Li, Xiao-Wen
    Cheng, Jian
    Li, Min
    PROCEEDINGS OF THE 2010 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, 2010, : 201 - 206
  • [29] Clustering-based incremental learning for imbalanced data classification
    Liu, Yuxin
    Du, Guangyu
    Yin, Chenke
    Zhang, Hachao
    Wang, Jia
    Knowledge-Based Systems, 2024, 292
  • [30] Classification performance assessment for imbalanced multiclass data
    Aguilar-Ruiz, Jesus S.
    Michalak, Marcin
    SCIENTIFIC REPORTS, 2024, 14 (01):