Comparison Of The Different Sampling Techniques For Imbalanced Classification Problems In Machine Learning

被引:5
|
作者
Peng Zhihao [1 ]
Yan Fenglong [1 ]
Li Xucheng [1 ]
机构
[1] Dalian Neusoft Univ Informat, Sch Comp & Software, Dalian 116626, Peoples R China
关键词
Machine Learning; Imbalanced Classification; Datasets;
D O I
10.1109/ICMTMA.2019.00101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Imbalanced class distribution is a scenario where the number of observations belonging to one class is significantly lower than those belonging to the other ones. Machine learning algorithms are often designed to improve accuracy by reducing the errors. Thus, they do not consider the class distribution proportion or the balance of classes. In this paper, firstly, we describes the various approaches for solving such class imbalance problems, using various sampling techniques. Then we weigh each technique for its pros and cons. Finally, an approach purpose is revealed in which you can create a balanced class distribution and apply ensemble learning technique designed especially for imbalanced class distribution.
引用
收藏
页码:431 / 434
页数:4
相关论文
共 50 条
  • [41] IMBALANCED DATA CLASSIFICATION BASED ON EXTREME LEARNING MACHINE AUTOENCODER
    Shen, Chu
    Zhang, Su-Fang
    Zhai, Jun-Hal
    Luo, Ding-Sheng
    Chen, Jun-Fen
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2018, : 399 - 404
  • [42] The Imbalanced Classification of Fraudulent Bank Transactions Using Machine Learning
    Ruchay, Alexey
    Feldman, Elena
    Cherbadzhi, Dmitriy
    Sokolov, Alexander
    MATHEMATICS, 2023, 11 (13)
  • [43] Laplacian least learning machine with dynamic updating for imbalanced classification
    Zhou, Jie
    Jiang, Zhibin
    Wang, Shitong
    APPLIED SOFT COMPUTING, 2020, 88 (88)
  • [44] Parameter-Free Extreme Learning Machine for Imbalanced Classification
    Li, Li
    Zhao, Kaiyi
    Sun, Ruizhi
    Gan, Jiangzhang
    Yuan, Gang
    Liu, Tong
    NEURAL PROCESSING LETTERS, 2020, 52 (03) : 1927 - 1944
  • [45] An improved weighted extreme learning machine for imbalanced data classification
    Lu, Chengbo
    Ke, Haifeng
    Zhang, Gaoyan
    Mei, Ying
    Xu, Huihui
    MEMETIC COMPUTING, 2019, 11 (01) : 27 - 34
  • [46] A multi-manifold learning based instance weighting and under-sampling for imbalanced data classification problems
    Tayyebe Feizi
    Mohammad Hossein Moattar
    Hamid Tabatabaee
    Journal of Big Data, 10
  • [47] A multi-manifold learning based instance weighting and under-sampling for imbalanced data classification problems
    Feizi, Tayyebe
    Moattar, Mohammad Hossein
    Tabatabaee, Hamid
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [48] Parameter-Free Extreme Learning Machine for Imbalanced Classification
    Li Li
    Kaiyi Zhao
    Ruizhi Sun
    Jiangzhang Gan
    Gang Yuan
    Tong Liu
    Neural Processing Letters, 2020, 52 : 1927 - 1944
  • [49] An improved weighted extreme learning machine for imbalanced data classification
    Chengbo Lu
    Haifeng Ke
    Gaoyan Zhang
    Ying Mei
    Huihui Xu
    Memetic Computing, 2019, 11 : 27 - 34
  • [50] HSDLM: A Hybrid Sampling With Deep Learning Method for Imbalanced Data Classification
    Hasib, Khan Md
    Towhid, Nurul Akter
    Islam, Md Rafiqul
    INTERNATIONAL JOURNAL OF CLOUD APPLICATIONS AND COMPUTING, 2021, 11 (04) : 1 - 13