Improving Rotation Forest Performance for Imbalanced Data Classification through Fuzzy Clustering

被引:0
|
作者
Hosseinzadeh, Mehrdad [1 ,2 ]
Eftekhari, Mahdi [1 ]
机构
[1] Shahid Bahonar Univ Kerman, Dept Comp Engn, Kerman, Iran
[2] Shahid Bahonar Univ Kerman, Young Researchers Assoc, Kerman, Iran
关键词
Imbalanced Data Classification; Ensemble Learning; Fuzzy Clustering;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, fuzzy C-means clustering and Rotation Forest (RF) are combined to construct a high performance classifier for imbalanced data classification. Data samples are clustered via fuzzy clustering and then fuzzy membership function matrix is added into data samples. Therefore, clusters memberships of samples are utilized as new features that are added into the original features. After that, RF is utilized for classification where the new set of features as well as the original ones are taken into account in the feature subspacing phase. The proposed algorithm utilizes SMOTE oversampling algorithm for balancing data samples. The obtained results confirm that our proposed method outperforms the other well-known bagging algorithms.
引用
收藏
页码:35 / 40
页数:6
相关论文
共 50 条
  • [41] Overlap-Based Undersampling for Improving Imbalanced Data Classification
    Vuttipittayamongkol, Pattaramon
    Elyan, Eyad
    Petrovski, Andrei
    Jayne, Chrisina
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2018, PT I, 2018, 11314 : 689 - 697
  • [42] RSMOTE: improving classification performance over imbalanced medical datasets
    Naseriparsa, Mehdi
    Al-Shammari, Ahmed
    Sheng, Ming
    Zhang, Yong
    Zhou, Rui
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2020, 8 (01)
  • [43] Improving SVM Classification on Imbalanced Data Sets in Distance Spaces
    Koeknar-Tezel, Suzan
    Latecki, Longin Jan
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 259 - +
  • [44] Improving the Performance of Sentiment Classification on Imbalanced Datasets With Transfer Learning
    Xiao, Z.
    Wang, L.
    Du, J. Y.
    IEEE ACCESS, 2019, 7 : 28281 - 28290
  • [45] RSMOTE: improving classification performance over imbalanced medical datasets
    Mehdi Naseriparsa
    Ahmed Al-Shammari
    Ming Sheng
    Yong Zhang
    Rui Zhou
    Health Information Science and Systems, 8
  • [46] Dynamic Synthetic Minority Over-Sampling Technique-Based Rotation Forest for the Classification of Imbalanced Hyperspectral Data
    Feng, Wei
    Dauphin, Gabriel
    Huang, Wenjiang
    Quan, Yinghui
    Bao, Wenxing
    Wu, Mingquan
    Li, Qiang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (07) : 2159 - 2169
  • [47] MCBC-SMOTE: A Majority Clustering Model for Classification of Imbalanced Data
    Arora, Jyoti
    Tushir, Meena
    Sharma, Keshav
    Mohan, Lalit
    Singh, Aman
    Alharbi, Abdullah
    Alosaimi, Wael
    Computers, Materials and Continua, 2022, 73 (03): : 4801 - 4817
  • [48] MCBC-SMOTE: A Majority Clustering Model for Classification of Imbalanced Data
    Arora, Jyoti
    Tushir, Meena
    Sharma, Keshav
    Mohan, Lalit
    Singh, Aman
    Alharbi, Abdullah
    Alosaimi, Wael
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (03): : 4801 - 4817
  • [49] Clustering and classification of fuzzy data using the fuzzy EM algorithm
    Quost, Benjamin
    Denoeux, Thierry
    FUZZY SETS AND SYSTEMS, 2016, 286 : 134 - 156
  • [50] Impact of Clustering on a Synthetic Instance Generation in Imbalanced Data Streams Classification
    Czarnowski, Ireneusz
    Martins, Denis Mayr Lima
    COMPUTATIONAL SCIENCE, ICCS 2022, PT II, 2022, : 586 - 597