Improved SMOTE algorithm for imbalanced dataset

被引:0
|
作者
Zheng Hengyu [1 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou, Peoples R China
关键词
SMOTE; Unbalanced dataset; SVM; Confusion Matrix;
D O I
10.1109/CAC51589.2020.9326603
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
When applying traditional classifiers to imbalanced dataset, the result might be bias towards the majority class, which leads to poor performance of classifiers. Synthetic Minority Oversampling Technique(SMOTE) is a popular algorithm to improve the classifier's performance through generating new minority samples and making dataset balanced. Based on SMOTE, two new over-sampling algorithms DSMOTE and ESMOTE are proposed in this paper. Being different with SMOTE which treats all minority samples equally, the two new over-sampling algorithms mainly synthesize new samples near the easily misclassified samples to improve the classification accuracy of minority class. Experiments show that DSMOTE and ESMOTE could both get better performance than SMOTE.
引用
收藏
页码:693 / 697
页数:5
相关论文
共 50 条
  • [1] Dealing with Imbalanced Dataset: A Re-sampling Method Based on the Improved SMOTE Algorithm
    Xue, Wei
    Zhang, Jing
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2016, 45 (04) : 1160 - 1172
  • [2] Research on data mining method for imbalanced dataset based on improved SMOTE
    Yang, Zhi-Ming
    Qiao, Li-Yan
    Peng, Xi-Yuan
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2007, 36 (SUPPL. 2): : 22 - 26
  • [3] Improving Emotion Classification in Imbalanced YouTube Dataset Using SMOTE Algorithm
    Sarakit, Phakhawat
    Theeramunkong, Thanaruk
    Haruechaiyasak, Choochart
    2015 2ND INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS: CONCEPTS, THEORY AND APPLICATIONS ICAICTA, 2015,
  • [4] Ensemble classification algorithm based improved SMOTE for imbalanced data
    Ning, Liu, 1600, Natsional'nyi Hirnychyi Universytet
  • [5] Improved SMOTE Algorithm to Deal with Imbalanced Activity Classes in Smart Homes
    Shikai Guo
    Yaqing Liu
    Rong Chen
    Xiao Sun
    Xiangxin Wang
    Neural Processing Letters, 2019, 50 : 1503 - 1526
  • [6] Improved SMOTE Algorithm to Deal with Imbalanced Activity Classes in Smart Homes
    Guo, Shikai
    Liu, Yaqing
    Chen, Rong
    Sun, Xiao
    Wang, Xiangxin
    NEURAL PROCESSING LETTERS, 2019, 50 (02) : 1503 - 1526
  • [7] Explainability of SMOTE Based Oversampling for Imbalanced Dataset Problems
    Patil, Aum
    Framewala, Aman
    Kazi, Faruk
    2020 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTER TECHNOLOGIES (ICICT 2020), 2020, : 41 - 45
  • [8] An Improved Random Forest Algorithm for classification in an imbalanced dataset.
    Jose, Christy
    Gopakumar, G.
    2019 URSI ASIA-PACIFIC RADIO SCIENCE CONFERENCE (AP-RASC), 2019,
  • [9] An Improved Measurement of the Imbalanced Dataset
    Zhang, Chunkai
    Zhou, Ying
    Chen, Yingyang
    Qi, Changqing
    Wang, Xuan
    Dong, Lifeng
    CLOUD COMPUTING - CLOUD 2018, 2018, 10967 : 365 - 376
  • [10] Tomek Link and SMOTE Approaches for Machine Fault Classification with an Imbalanced Dataset
    Swana, Elsie Fezeka
    Doorsamy, Wesley
    Bokoro, Pitshou
    SENSORS, 2022, 22 (09)