Comparison Of The Different Sampling Techniques For Imbalanced Classification Problems In Machine Learning

被引:5
|
作者
Peng Zhihao [1 ]
Yan Fenglong [1 ]
Li Xucheng [1 ]
机构
[1] Dalian Neusoft Univ Informat, Sch Comp & Software, Dalian 116626, Peoples R China
关键词
Machine Learning; Imbalanced Classification; Datasets;
D O I
10.1109/ICMTMA.2019.00101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Imbalanced class distribution is a scenario where the number of observations belonging to one class is significantly lower than those belonging to the other ones. Machine learning algorithms are often designed to improve accuracy by reducing the errors. Thus, they do not consider the class distribution proportion or the balance of classes. In this paper, firstly, we describes the various approaches for solving such class imbalance problems, using various sampling techniques. Then we weigh each technique for its pros and cons. Finally, an approach purpose is revealed in which you can create a balanced class distribution and apply ensemble learning technique designed especially for imbalanced class distribution.
引用
收藏
页码:431 / 434
页数:4
相关论文
共 50 条
  • [1] Exploring Data Sampling Techniques for Imbalanced Classification Problems
    Sui, Yu
    Zhang, Xiaohui
    Huan, Jiajia
    Hong, Haifeng
    FOURTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2019, 11198
  • [2] Sampling Approaches for Imbalanced Data Classification Problem in Machine Learning
    Tyagi, Shivani
    Mittal, Sangeeta
    PROCEEDINGS OF RECENT INNOVATIONS IN COMPUTING, ICRIC 2019, 2020, 597 : 209 - 221
  • [3] Experimental Comparison of Sampling Techniques for Imbalanced Datasets Using Various Classification Models
    Pattanayak, Sanjibani Sudha
    Rout, Minakhi
    PROGRESS IN ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, VOL 2, 2018, 564 : 13 - 22
  • [4] Comparison of Different Classification Algorithms for Prediction of Heart Disease by Machine Learning Techniques
    Harshitha B.
    Maria Rufina P.
    Shilpa B.L.
    SN Computer Science, 4 (2)
  • [5] Comparison of Machine Learning Algorithms for Classification Problems
    Sekeroglu, Boran
    Hasan, Shakar Sherwan
    Abdullah, Saman Mirza
    ADVANCES IN COMPUTER VISION, VOL 2, 2020, 944 : 491 - 499
  • [6] A Comparison of Re-sampling Techniques for Pattern Classification in Imbalanced Data-Sets
    Saul, Marcia Amstelvina
    Rostami, Shahin
    ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS (UKCI), 2019, 840 : 240 - 251
  • [7] Self-adaptive Weighted Extreme Learning Machine for Imbalanced Classification Problems
    Long, Hao
    He, Yulin
    Huang, Joshua Zhexue
    Wang, Qiang
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, 2017, 2017, 10526 : 116 - 128
  • [8] Comparison of Machine Learning Techniques on Twitter Emotions Classification
    S. Santhosh Baboo
    M. Amirthapriya
    SN Computer Science, 2022, 3 (1)
  • [9] COMPARISON OF MACHINE LEARNING TECHNIQUES IN PHISHING WEBSITE CLASSIFICATION
    Hodzic, Adnan
    Kevric, Jasmin
    Karadag, Adem
    INTERNATIONAL CONFERENCE ON ECONOMIC AND SOCIAL STUDIES (ICESOS'16): REGIONAL ECONOMIC DEVELOPMENT: ENTREPNEURSHIP AND INNOVATION, 2016, : 249 - 256
  • [10] An empirical comparison of machine learning techniques for chant classification
    Kokkinidis, K.
    Mastoras, T.
    Tsagaris, A.
    Fotaris, P.
    2018 7TH INTERNATIONAL CONFERENCE ON MODERN CIRCUITS AND SYSTEMS TECHNOLOGIES (MOCAST), 2018,