Learning from Imbalanced Data Using Methods of Sample Selection

被引:0
|
作者
Chairi, Ikram [1 ]
Alaoui, Souad [1 ]
Lyhyaoui, Abdelouahid [1 ]
机构
[1] Abdelmalek Essaadi Univ, LTiLab, ENSA Tangier, Tanger Principal Tanger, Morocco
关键词
Imbalanced data; Multi-Layer Perceptron; sample selection;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The majority of Machine Learning (ML) habitually assume that the training sets used for learning are balanced. However, in real world application this hypothesis is not always true. The problem of between-class imbalance is a challenge that has attracted growing attention from both academia and industry because of his critical influence on the performance of machine learning. Many solutions are proposed to resolve this problem: Generally, the common practice for dealing with imbalanced data sets is to rebalance them artificially by using sampling methods. On the other hand, researches show that Sample Selection (SS) methods help to improve the accuracy during the learning process. The main idea of our work is to apply a technique of Sample Selection on the majority class to achieve an undersampling for the imbalanced data. This procedure consent to deal with the imbalance problem and to improve the performance of learning.
引用
收藏
页码:256 / 259
页数:4
相关论文
共 50 条
  • [41] SetConv: A New Approach for Learning from Imbalanced Data
    Gao, Yang
    Li, Yi-Fan
    Lin, Yu
    Aggarwal, Charu
    Khan, Latifur
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1284 - 1294
  • [42] Positive-Unlabeled Learning from Imbalanced Data
    Su, Guangxin
    Chen, Weitong
    Xu, Miao
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2995 - 3001
  • [43] Sample-level Data Selection for Federated Learning
    Li, Anran
    Zhang, Lan
    Tan, Juntao
    Qin, Yaxuan
    Wang, Junhao
    Li, Xiang-Yang
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,
  • [44] TRAINING SAMPLE SELECTION FOR DEEP LEARNING OF DISTRIBUTED DATA
    Jiang, Zheng
    Zhu, Xiaoqing
    Tan, Wai-tian
    Liston, Rob
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2189 - 2193
  • [45] Online Learning From Incomplete and Imbalanced Data Streams
    You, Dianlong
    Xiao, Jiawei
    Wang, Yang
    Yan, Huigui
    Wu, Di
    Chen, Zhen
    Shen, Limin
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (10) : 10650 - 10665
  • [46] Learning from the imbalanced data based on quantum evolutionary
    Zhang, C. (jsj_zcs@126.com), 1725, ICIC Express Letters Office (08):
  • [47] Learning from imbalanced data in surveillance of nosocomial infection
    Cohen, Gilles
    Hilario, Melanie
    Sax, Hugo
    Hugonnet, Stephane
    Geissbuhler, Antoine
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2006, 37 (01) : 7 - 18
  • [48] A New Evaluation Measure for Learning from Imbalanced Data
    Thai-Nghe, Nguyen
    Gantner, Zeno
    Schmidt-Thieme, Lars
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 537 - 542
  • [49] Detection of attacks on software defined networks using machine learning techniques and imbalanced data handling methods
    Hassan, Heba A.
    Hemdan, Ezz El-Din
    El-Shafai, Walid
    Shokair, Mona
    Abd El-Samie, Fathi E.
    SECURITY AND PRIVACY, 2024, 7 (02)
  • [50] Cervical Cancer Prediction Based on Imbalanced Data Using Machine Learning Algorithms with a Variety of Sampling Methods
    Muraru, Madalina Maria
    Simo, Zsuzsa
    Iantovics, Laszlo Barna
    APPLIED SCIENCES-BASEL, 2024, 14 (22):