Comparison Of The Different Sampling Techniques For Imbalanced Classification Problems In Machine Learning

被引:5
|
作者
Peng Zhihao [1 ]
Yan Fenglong [1 ]
Li Xucheng [1 ]
机构
[1] Dalian Neusoft Univ Informat, Sch Comp & Software, Dalian 116626, Peoples R China
关键词
Machine Learning; Imbalanced Classification; Datasets;
D O I
10.1109/ICMTMA.2019.00101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Imbalanced class distribution is a scenario where the number of observations belonging to one class is significantly lower than those belonging to the other ones. Machine learning algorithms are often designed to improve accuracy by reducing the errors. Thus, they do not consider the class distribution proportion or the balance of classes. In this paper, firstly, we describes the various approaches for solving such class imbalance problems, using various sampling techniques. Then we weigh each technique for its pros and cons. Finally, an approach purpose is revealed in which you can create a balanced class distribution and apply ensemble learning technique designed especially for imbalanced class distribution.
引用
收藏
页码:431 / 434
页数:4
相关论文
共 50 条
  • [21] Comparison of Machine Learning Techniques for Fetal Heart Rate Classification
    Comert, Z.
    Kocamaz, A. F.
    ACTA PHYSICA POLONICA A, 2017, 132 (03) : 451 - 454
  • [22] Comparison on Some Machine Learning Techniques in Breast Cancer Classification
    Mashudi, Nurul Amirah
    Rossli, Syaidathul Amaleena
    Ahmad, Norulhusna
    Noor, Norliza Mohd
    2020 IEEE-EMBS CONFERENCE ON BIOMEDICAL ENGINEERING AND SCIENCES (IECBES 2020): LEADING MODERN HEALTHCARE TECHNOLOGY ENHANCING WELLNESS, 2021, : 499 - 504
  • [23] Predicting Fraud in Mobile Money Transactions using Machine Learning: The Effects of Sampling Techniques on the Imbalanced Dataset
    Botchey, Francis E.
    Qin, Zhen
    Hughes-Lartey, Kwesi
    Ampomah, Kwame E.
    INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (07): : 45 - 56
  • [24] A comparative analysis of machine learning techniques for imbalanced data
    Mrad, Ali Ben
    Lahiani, Amine
    Mefteh-Wali, Salma
    Mselmi, Nada
    ANNALS OF OPERATIONS RESEARCH, 2024,
  • [25] An Improved Extreme Learning Machine for Imbalanced Data Classification
    Zhang, Xiaopeng
    Qin, Liangxi
    IEEE ACCESS, 2022, 10 : 8634 - 8642
  • [26] Imbalanced Classification in Diabetics Using Ensembled Machine Learning
    Kumar, M. Sandeep
    Khan, Mohammad Zubair
    Rajendran, Sukumar
    Noor, Ayman
    Dass, A. Stephen
    Prabhu, J.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (03): : 4397 - 4409
  • [27] A transfer weighted extreme learning machine for imbalanced classification
    Guo, Yinan
    Jiao, Botao
    Tan, Ying
    Zhang, Pei
    Tang, Fengzhen
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (10) : 7685 - 7705
  • [28] Machine Learning Techniques for Solving Classification Problems with Missing Input Data
    Garcia-Laencina, Pedro J.
    Sancho-Gomez, Jose-Luis
    Figueiras-Vidal, Anibal R.
    WMSCI 2008: 12TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS, 2008, : 12 - +
  • [29] Imbalanced data classification: Using transfer learning and active sampling
    Liu, Yang
    Yang, Guoping
    Qiao, Shaojie
    Liu, Meiqi
    Qu, Lulu
    Han, Nan
    Wu, Tao
    Yuan, Guan
    Peng, Yuzhong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
  • [30] Comparison of Sampling Methods for Imbalanced Data Classification in Random Forest
    Paing, May Phu
    Pintavirooj, C.
    Tungjitkusolmun, Supan
    Choomchuay, Somsak
    Hamamoto, Kazuhiko
    2018 11TH BIOMEDICAL ENGINEERING INTERNATIONAL CONFERENCE (BMEICON 2018), 2018,