Improving Rotation Forest Performance for Imbalanced Data Classification through Fuzzy Clustering

被引:0
|
作者
Hosseinzadeh, Mehrdad [1 ,2 ]
Eftekhari, Mahdi [1 ]
机构
[1] Shahid Bahonar Univ Kerman, Dept Comp Engn, Kerman, Iran
[2] Shahid Bahonar Univ Kerman, Young Researchers Assoc, Kerman, Iran
关键词
Imbalanced Data Classification; Ensemble Learning; Fuzzy Clustering;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, fuzzy C-means clustering and Rotation Forest (RF) are combined to construct a high performance classifier for imbalanced data classification. Data samples are clustered via fuzzy clustering and then fuzzy membership function matrix is added into data samples. Therefore, clusters memberships of samples are utilized as new features that are added into the original features. After that, RF is utilized for classification where the new set of features as well as the original ones are taken into account in the feature subspacing phase. The proposed algorithm utilizes SMOTE oversampling algorithm for balancing data samples. The obtained results confirm that our proposed method outperforms the other well-known bagging algorithms.
引用
收藏
页码:35 / 40
页数:6
相关论文
共 50 条
  • [11] Imbalanced web spam classification based on nested rotation forest
    Department of Information Science and Engineering, Shandong Normal University, No. 88, Wenhua East Road, Jinan, China
    不详
    ICIC Express Lett., 3 (937-944):
  • [12] Improving the performance of Fuzzy Clustering algorithms through Outlier Identification
    Kaur, Prabhjot
    Gosain, Anjana
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 373 - +
  • [13] Improving SMOTE with Fuzzy Rough Prototype Selection to Detect Noise in Imbalanced Classification Data
    Verbiest, Nele
    Ramentol, Enislay
    Cornelis, Chris
    Herrera, Francisco
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2012, 2012, 7637 : 169 - 178
  • [14] Rough fuzzy classification for class imbalanced data
    Mazumder, Riaj Uddin
    Begum, Shahin Ara
    Biswas, Devajyoti
    Advances in Intelligent Systems and Computing, 2015, 335 : 159 - 171
  • [15] ESTIMATION OF FOREST PARAMETERS THROUGH FUZZY CLASSIFICATION OF TM DATA
    MASELLI, F
    CONESE, C
    DEFILIPPIS, T
    NORCINI, S
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 1995, 33 (01): : 77 - 84
  • [16] Imbalanced Data Classification Algorithm Based on Clustering and SVM
    Huang, Bo
    Zhu, Yimin
    Wang, Zhongzhen
    Fang, Zhijun
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (02)
  • [17] Local Clustering Conformal Predictor for Imbalanced Data Classification
    Wang, Huazhen
    Chen, Yewang
    Chen, Zhigang
    Yang, Fan
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2013, 2013, 412 : 421 - 431
  • [18] Improving undersampling-based ensemble with rotation forest for imbalanced problem
    Guo, Huaping
    Diao, Xiaoyu
    Liu, Hongbing
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (02) : 1371 - 1386
  • [19] Clustering and classification for dry bean feature imbalanced data
    Lee, Chou-Yuan
    Wang, Wei
    Huang, Jian-Qiong
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [20] Evaluate Clustering Performance and Computational Efficiency for PSO based Fuzzy Clustering Methods in Processing Big Imbalanced Data
    Wang, Jin
    Fang, Hua
    Li, Bo
    Wang, Honggang
    2017 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2017,