Learning from Imbalanced Data Using Methods of Sample Selection

被引:0
|
作者
Chairi, Ikram [1 ]
Alaoui, Souad [1 ]
Lyhyaoui, Abdelouahid [1 ]
机构
[1] Abdelmalek Essaadi Univ, LTiLab, ENSA Tangier, Tanger Principal Tanger, Morocco
关键词
Imbalanced data; Multi-Layer Perceptron; sample selection;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The majority of Machine Learning (ML) habitually assume that the training sets used for learning are balanced. However, in real world application this hypothesis is not always true. The problem of between-class imbalance is a challenge that has attracted growing attention from both academia and industry because of his critical influence on the performance of machine learning. Many solutions are proposed to resolve this problem: Generally, the common practice for dealing with imbalanced data sets is to rebalance them artificially by using sampling methods. On the other hand, researches show that Sample Selection (SS) methods help to improve the accuracy during the learning process. The main idea of our work is to apply a technique of Sample Selection on the majority class to achieve an undersampling for the imbalanced data. This procedure consent to deal with the imbalance problem and to improve the performance of learning.
引用
收藏
页码:256 / 259
页数:4
相关论文
共 50 条
  • [1] Sample Selection based Active Learning for Imbalanced Data
    Chairi, Ikram
    Alaoui, Souad
    Lyhyaoui, Abdelouahid
    10TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS SITIS 2014, 2014, : 645 - 651
  • [2] Learning With Imbalanced Noisy Data by Preventing Bias in Sample Selection
    Liu, Huafeng
    Sheng, Mengmeng
    Sun, Zeren
    Yao, Yazhou
    Hua, Xian-Sheng
    Shen, Heng-Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7426 - 7437
  • [3] Prototyping: Sample Selection for Imbalanced Data
    Schwalb, Edward
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 221 - 227
  • [4] SelectNet: Learning to Sample from the Wild for Imbalanced Data Training
    Liu, Yunru
    Gao, Tingran
    Yang, Haizhao
    MATHEMATICAL AND SCIENTIFIC MACHINE LEARNING, VOL 107, 2020, 107 : 193 - 206
  • [5] Evaluation of Sampling Methods for Learning from Imbalanced Data
    Goel, Garima
    Maguire, Liam
    Li, Yuhua
    McLoone, Sean
    INTELLIGENT COMPUTING THEORIES, 2013, 7995 : 392 - 401
  • [6] Imbalanced Sample Selection With Deep Reinforcement Learning for Fault Diagnosis
    Fan, Saite
    Zhang, Xinmin
    Song, Zhihuan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (04) : 2518 - 2527
  • [7] Imbalanced Sample Selection with Deep Reinforcement Learning for Fault Diagnosis
    Fan, Saite
    Zhang, Xinmin
    Song, Zhihuan
    IEEE Transactions on Industrial Informatics, 2022, 18 (04): : 2518 - 2527
  • [8] Gene Selection for Microarray Expression Data with Imbalanced Sample Distributions
    Kamal, Abu H. M.
    Zhu, Xingquan
    Narayanan, Ramaswamy
    2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 3 - +
  • [9] Learning from Imbalanced Data
    He, Haibo
    Garcia, Edwardo A.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) : 1263 - 1284
  • [10] Intrusion Detection Based Sample Selection For Imbalanced Data Distribution
    Chairi, Ikram
    Alaoui, Souad
    Lyhyaoui, Abdelouahid
    2012 SECOND INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH), 2012, : 259 - 264