Improving Rotation Forest Performance for Imbalanced Data Classification through Fuzzy Clustering

被引:0
|
作者
Hosseinzadeh, Mehrdad [1 ,2 ]
Eftekhari, Mahdi [1 ]
机构
[1] Shahid Bahonar Univ Kerman, Dept Comp Engn, Kerman, Iran
[2] Shahid Bahonar Univ Kerman, Young Researchers Assoc, Kerman, Iran
关键词
Imbalanced Data Classification; Ensemble Learning; Fuzzy Clustering;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, fuzzy C-means clustering and Rotation Forest (RF) are combined to construct a high performance classifier for imbalanced data classification. Data samples are clustered via fuzzy clustering and then fuzzy membership function matrix is added into data samples. Therefore, clusters memberships of samples are utilized as new features that are added into the original features. After that, RF is utilized for classification where the new set of features as well as the original ones are taken into account in the feature subspacing phase. The proposed algorithm utilizes SMOTE oversampling algorithm for balancing data samples. The obtained results confirm that our proposed method outperforms the other well-known bagging algorithms.
引用
收藏
页码:35 / 40
页数:6
相关论文
共 50 条
  • [1] Using Fuzzy Undersampling and Fuzzy PCA to Improve Imbalanced Classification through Rotation Forest Algorithm
    Hosseinzadeh, Mehrdad
    Eftekhari, Mahdi
    CSSE 2015 20TH INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING, 2015,
  • [2] Adaptive Fuzzy Clustering for Improving Classification Performance in Yeast data set
    Kim, Man Sun
    Yang, Hyung Jeong
    Cheah, Wooi Ping
    2008 4TH INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 752 - +
  • [3] Performance enrichment through parameter tuning of random forest classification for imbalanced data applications
    More, Anjali S.
    Rana, Dipti P.
    MATERIALS TODAY-PROCEEDINGS, 2022, 56 : 3585 - 3593
  • [4] Improving Random Forest and Rotation Forest for highly imbalanced datasets
    Su, Chong
    Ju, Shenggen
    Liu, Yiguang
    Yu, Zhonghua
    INTELLIGENT DATA ANALYSIS, 2015, 19 (06) : 1409 - 1432
  • [5] Imbalanced Data Classification Based on Clustering
    Li, Hu
    Zou, Peng
    Han, Weihong
    Xia, Rongze
    COMPUTER-AIDED DESIGN, MANUFACTURING, MODELING AND SIMULATION III, 2014, 443 : 741 - 745
  • [6] SMOTE-Based Weighted Deep Rotation Forest for the Imbalanced Hyperspectral Data Classification
    Quan, Yinghui
    Zhong, Xian
    Feng, Wei
    Chan, Jonathan Cheung-Wai
    Li, Qiang
    Xing, Mengdao
    REMOTE SENSING, 2021, 13 (03) : 1 - 25
  • [7] Selective pre-processing of imbalanced data for improving classification performance
    Stefanowski, Jerzy
    Wilk, Szymon
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2008, 5182 : 283 - 292
  • [8] SPY: a novel resampling method for improving classification performance in imbalanced data
    Xuan Tho Dang
    Dang Hung Tran
    Hirose, Osamu
    Satou, Kenji
    2015 SEVENTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2015, : 280 - 285
  • [9] Classification with local clustering in imbalanced data sets
    Ji, Hua
    Zhang, Huaxiang
    ADVANCED RESEARCH ON INFORMATION SCIENCE, AUTOMATION AND MATERIAL SYSTEM, PTS 1-6, 2011, 219-220 : 151 - 155
  • [10] Classwise Clustering for Classification of Imbalanced Text Data
    Swarnalatha, K.
    Guru, D. S.
    Anami, Basavaraj S.
    Suhil, Mahamad
    EMERGING RESEARCH IN ELECTRONICS, COMPUTER SCIENCE AND TECHNOLOGY, ICERECT 2018, 2019, 545 : 83 - 94