Improved SMOTE algorithm for imbalanced dataset

被引:0
|
作者
Zheng Hengyu [1 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou, Peoples R China
关键词
SMOTE; Unbalanced dataset; SVM; Confusion Matrix;
D O I
10.1109/CAC51589.2020.9326603
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
When applying traditional classifiers to imbalanced dataset, the result might be bias towards the majority class, which leads to poor performance of classifiers. Synthetic Minority Oversampling Technique(SMOTE) is a popular algorithm to improve the classifier's performance through generating new minority samples and making dataset balanced. Based on SMOTE, two new over-sampling algorithms DSMOTE and ESMOTE are proposed in this paper. Being different with SMOTE which treats all minority samples equally, the two new over-sampling algorithms mainly synthesize new samples near the easily misclassified samples to improve the classification accuracy of minority class. Experiments show that DSMOTE and ESMOTE could both get better performance than SMOTE.
引用
收藏
页码:693 / 697
页数:5
相关论文
共 50 条
  • [41] A Classification Method for Imbalanced Data Based on SMOTE and Fuzzy Rough Nearest Neighbor Algorithm
    Zhao, Weibin
    Xu, Mengting
    Jia, Xiuyi
    Shang, Lin
    ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, RSFDGRC 2015, 2015, 9437 : 340 - 351
  • [42] Dynamic ensemble algorithm of SMOTE and rotation forest for imbalanced hyperspectral remote sensing classification
    Tong Y.
    Feng W.
    Song Y.
    Quan Y.
    Huang W.
    Gao L.
    Zhu W.
    Xing M.
    National Remote Sensing Bulletin, 2022, 26 (11) : 2369 - 2381
  • [43] A histogram SMOTE-based sampling algorithm with incremental learning for imbalanced data classification
    Liaw, Lawrence Chuin Ming
    Tan, Shing Chiang
    Goh, Pey Yun
    Lim, Chee Peng
    INFORMATION SCIENCES, 2025, 686
  • [44] Improving the Classification Quality of the SVM Classifier for the Imbalanced Datasets on the Base of Ideas the SMOTE Algorithm
    Demidova, Liliya
    Klyueva, Irina
    2017 SEMINAR ON SYSTEMS ANALYSIS, 2017, 10
  • [45] Adaptive SV-Borderline SMOTE-SVM algorithm for imbalanced data classification
    Guo, Jiaqi
    Wu, Haiyan
    Chen, Xiaolei
    Lin, Weiguo
    APPLIED SOFT COMPUTING, 2024, 150
  • [46] Safe-level SMOTE method for handling the class imbalanced problem in electroencephalography dataset of adult anxious state
    Daud, Syarifah Noor Syakiylla Sayed
    Sudirman, Rubita
    Shing, Tee Wee
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 83
  • [47] Predicting Extreme Financial Risks on Imbalanced Dataset: A Combined Kernel FCM and Kernel SMOTE Based SVM Classifier
    Huang, Xun
    Zhang, Cheng-Zhao
    Yuan, Jia
    COMPUTATIONAL ECONOMICS, 2020, 56 (01) : 187 - 216
  • [48] Predicting Extreme Financial Risks on Imbalanced Dataset: A Combined Kernel FCM and Kernel SMOTE Based SVM Classifier
    Xun Huang
    Cheng-Zhao Zhang
    Jia Yuan
    Computational Economics, 2020, 56 : 187 - 216
  • [49] A New Cost-sensitive SVM Algorithm for Imbalanced Dataset
    Zheng Hengyu
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS AND COMPUTER ENGINEERING (ICCECE), 2021, : 402 - 407
  • [50] A-SMOTE: A New Preprocessing Approach for Highly Imbalanced Datasets by Improving SMOTE
    Ahmed Saad Hussein
    Tianrui Li
    Chubato Wondaferaw Yohannese
    Kamal Bashir
    International Journal of Computational Intelligence Systems, 2019, 12 : 1412 - 1422