Improving Rotation Forest Performance for Imbalanced Data Classification through Fuzzy Clustering

被引:0
|
作者
Hosseinzadeh, Mehrdad [1 ,2 ]
Eftekhari, Mahdi [1 ]
机构
[1] Shahid Bahonar Univ Kerman, Dept Comp Engn, Kerman, Iran
[2] Shahid Bahonar Univ Kerman, Young Researchers Assoc, Kerman, Iran
关键词
Imbalanced Data Classification; Ensemble Learning; Fuzzy Clustering;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, fuzzy C-means clustering and Rotation Forest (RF) are combined to construct a high performance classifier for imbalanced data classification. Data samples are clustered via fuzzy clustering and then fuzzy membership function matrix is added into data samples. Therefore, clusters memberships of samples are utilized as new features that are added into the original features. After that, RF is utilized for classification where the new set of features as well as the original ones are taken into account in the feature subspacing phase. The proposed algorithm utilizes SMOTE oversampling algorithm for balancing data samples. The obtained results confirm that our proposed method outperforms the other well-known bagging algorithms.
引用
收藏
页码:35 / 40
页数:6
相关论文
共 50 条
  • [31] Improving the Performance of Fuzzy Rule Based Classification Systems for Highly Imbalanced Data-Sets Using an Evolutionary Adaptive Inference System
    Fernandez, Alberto
    Jose del Jesus, Maria
    Herrera, Francisco
    BIO-INSPIRED SYSTEMS: COMPUTATIONAL AND AMBIENT INTELLIGENCE, PT 1, 2009, 5517 : 294 - +
  • [32] Performance of evaluation metrics for classification in imbalanced data
    Huayanay, Alex de la Cruz
    Bazan, Jorge L.
    Russo, Cibele M.
    COMPUTATIONAL STATISTICS, 2025, 40 (03) : 1447 - 1473
  • [33] A Data-driven Fuzzy Modelling Framework for the Classification of Imbalanced Data
    Rubio-Solis, Adrian
    Panoutsos, George
    Thornton, Steve
    2016 IEEE 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS (IS), 2016, : 302 - 307
  • [34] Improving Imbalanced Data Classification in Auto Insurance by the Data Level Approaches
    Hanafy, Mohamed
    Ming, Ruixing
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (06) : 493 - 499
  • [35] Improving Classification Performance of Fully Connected Layers by Fuzzy Clustering in Transformed Feature Space
    Kalayci, Tolga Ahmet
    Asan, Umut
    SYMMETRY-BASEL, 2022, 14 (04):
  • [36] A Density-Based Random Forest for Imbalanced Data Classification
    Dong, Jia
    Qian, Quan
    FUTURE INTERNET, 2022, 14 (03):
  • [37] Comparison of Sampling Methods for Imbalanced Data Classification in Random Forest
    Paing, May Phu
    Pintavirooj, C.
    Tungjitkusolmun, Supan
    Choomchuay, Somsak
    Hamamoto, Kazuhiko
    2018 11TH BIOMEDICAL ENGINEERING INTERNATIONAL CONFERENCE (BMEICON 2018), 2018,
  • [38] A novel imbalanced data classification algorithm based on fuzzy rule
    Xu Z.-Y.
    Zhang Y.
    International Journal of Information and Communication Technology, 2019, 14 (03) : 373 - 384
  • [39] Fuzzy Support Vector Machine for Microarray Imbalanced Data Classification
    Ladayya, Faroh
    Purnami, Santi Wulan
    Irhamah
    13TH IMT-GT INTERNATIONAL CONFERENCE ON MATHEMATICS, STATISTICS AND THEIR APPLICATIONS (ICMSA2017), 2017, 1905
  • [40] A new robust fuzzy clustering validity index for imbalanced data sets
    Liu, Yun
    Jiang, Yanfang
    Hou, Tao
    Liu, Fu
    INFORMATION SCIENCES, 2021, 547 (547) : 579 - 591