Comparison Of The Different Sampling Techniques For Imbalanced Classification Problems In Machine Learning

被引:5
|
作者
Peng Zhihao [1 ]
Yan Fenglong [1 ]
Li Xucheng [1 ]
机构
[1] Dalian Neusoft Univ Informat, Sch Comp & Software, Dalian 116626, Peoples R China
关键词
Machine Learning; Imbalanced Classification; Datasets;
D O I
10.1109/ICMTMA.2019.00101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Imbalanced class distribution is a scenario where the number of observations belonging to one class is significantly lower than those belonging to the other ones. Machine learning algorithms are often designed to improve accuracy by reducing the errors. Thus, they do not consider the class distribution proportion or the balance of classes. In this paper, firstly, we describes the various approaches for solving such class imbalance problems, using various sampling techniques. Then we weigh each technique for its pros and cons. Finally, an approach purpose is revealed in which you can create a balanced class distribution and apply ensemble learning technique designed especially for imbalanced class distribution.
引用
收藏
页码:431 / 434
页数:4
相关论文
共 50 条
  • [31] A review on over-sampling techniques in classification of multi-class imbalanced datasets: insights for medical problems
    Yang, Yuxuan
    Khorshidi, Hadi Akbarzadeh
    Aickelin, Uwe
    FRONTIERS IN DIGITAL HEALTH, 2024, 6
  • [32] Classification for Imbalanced and Overlapping Classes Using Outlier Detection and Sampling Techniques
    Yang, Zeping
    Gao, Daqi
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 : 375 - 381
  • [33] Handling Imbalanced Classification Problems by Weighted Generalization Memorization Machine
    Dou, Chen
    Lv, Yan
    Wang, Zhen
    Bai, Lan
    APPLIED ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [34] A New Hybrid Under-sampling Approach to Imbalanced Classification Problems
    Peng, Chun-Yang
    Park, You-Jin
    APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [35] Comparison of different machine learning techniques in river flow prediction
    Akbulut, Ugur
    Cifci, Mehmet Akif
    Isler, Buket
    Aslan, Zafer
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2025, 40 (01): : 467 - 485
  • [36] Active learning with extreme learning machine for online imbalanced multiclass classification
    Qin, Jiongming
    Wang, Cong
    Zou, Qinhong
    Sun, Yubin
    Chen, Bin
    KNOWLEDGE-BASED SYSTEMS, 2021, 231
  • [37] Human locomotion classification for different terrains using machine learning techniques
    Negi S.
    Negi P.C.B.S.
    Sharma S.
    Sharma N.
    Critical Reviews in Biomedical Engineering, 2020, 48 (04) : 199 - 209
  • [38] An Empirical Comparison of Individual Machine Learning Techniques in Signature and Fingerprint Classification
    Abreu, Marjory
    Fairhurst, Michael
    BIOMETRICS AND IDENTITY MANAGEMENT, 2008, 5372 : 130 - 139
  • [39] Comparison of Supervised Machine Learning Techniques for PD Classification in Generator Insulation
    Herath, H. M. M. G. T.
    Kumara, J. R. S. S.
    Fernando, M. A. R. M.
    Bandara, K. M. K. S.
    Serina, Ivan
    2017 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2017, : 290 - 295
  • [40] Classification of imbalanced datasets utilizing the synthetic minority oversampling method in conjunction with several machine learning techniques
    Shrayasi Datta
    Chinmoy Ghosh
    J. Pal Choudhury
    Iran Journal of Computer Science, 2025, 8 (1) : 51 - 68