Improved SMOTE algorithm for imbalanced dataset

被引:0
|
作者
Zheng Hengyu [1 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou, Peoples R China
关键词
SMOTE; Unbalanced dataset; SVM; Confusion Matrix;
D O I
10.1109/CAC51589.2020.9326603
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
When applying traditional classifiers to imbalanced dataset, the result might be bias towards the majority class, which leads to poor performance of classifiers. Synthetic Minority Oversampling Technique(SMOTE) is a popular algorithm to improve the classifier's performance through generating new minority samples and making dataset balanced. Based on SMOTE, two new over-sampling algorithms DSMOTE and ESMOTE are proposed in this paper. Being different with SMOTE which treats all minority samples equally, the two new over-sampling algorithms mainly synthesize new samples near the easily misclassified samples to improve the classification accuracy of minority class. Experiments show that DSMOTE and ESMOTE could both get better performance than SMOTE.
引用
收藏
页码:693 / 697
页数:5
相关论文
共 50 条
  • [21] An Improved SMOTE Imbalanced Data Classification Method Based on Support Degree
    Li, Kewen
    Zhang, Wenrong
    Lu, Qinghua
    Fang, Xianghua
    2014 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS (IIKI 2014), 2014, : 34 - 38
  • [22] Classification of Imbalanced Data by Combining the Complementary Neural Network and SMOTE Algorithm
    Jeatrakul, Piyasak
    Wong, Kok Wai
    Fung, Chun Che
    NEURAL INFORMATION PROCESSING: MODELS AND APPLICATIONS, PT II, 2010, 6444 : 152 - 159
  • [23] Classification of imbalanced data by using the SMOTE algorithm and locally linear embedding
    Wang, Juanjuan
    Xu, Mantao
    Wang, Hui
    Zhang, Jiwu
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 1815 - +
  • [24] Selective Ensemble Learning Algorithm for Imbalanced Dataset
    Du, Hongle
    Zhang, Yan
    Zhang, Lin
    Chen, Yeh-Cheng
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2023, 20 (02) : 831 - 856
  • [25] A Novel Region Adaptive SMOTE Algorithm for Intrusion Detection on Imbalanced Problem
    Yan, BingHao
    Han, GuoDong
    Sun, MeiDong
    Ye, ShengZhao
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1281 - 1286
  • [26] Bronze Inscriptions Classification Algorithm On Imbalanced Dataset
    He, Jiayuan
    Zhu, Qingting
    Chen, Youguang
    Nie, Fan
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1715 - 1718
  • [27] A selective ensemble learning algorithm for imbalanced dataset
    Hongle, Du
    Yan, Zhang
    Gang, Ke
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021,
  • [28] A New Safe-Level Enabled Borderline-SMOTE for Condition Recognition of Imbalanced Dataset
    Chen, Chao
    Shen, Wei
    Yang, Chenhao
    Fan, Wei
    Liu, Xin
    Li, Ying
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [29] SVM Classification of Microaneurysms with Imbalanced Dataset Based on Borderline- SMOTE and Data Cleaning Techniques
    Wang, Qingjie
    Xin, Jingmin
    Wu, Jiayi
    Zheng, Nanning
    NINTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2016), 2017, 10341
  • [30] AGNES-SMOTE: An Oversampling Algorithm Based on Hierarchical Clustering and Improved SMOTE
    Wang, Xin
    Yang, Yue
    Chen, Mingsong
    Wang, Qin
    Qin, Qin
    Jiang, Hua
    Wang, Huijiao
    SCIENTIFIC PROGRAMMING, 2020, 2020