Improving ANN Performance for Imbalanced Data Sets by Means of the NTIL Technique

被引:0
|
作者
Vivaracho-Pascual, Carlos [1 ]
Simon-Hurtado, Arancha [1 ]
机构
[1] Univ Valladolid, Dept Comp Sci, E-47002 Valladolid, Spain
关键词
RECOGNITION; CLASSIFIER;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper deals with the problem of training an Artificial Neural Network (ANN) when the data sets are very imbalanced. Most learning algorithms, including ANN, are designed for well-balanced data and do not work properly on imbalanced ones. Of the approaches proposed for dealing with this problem, we are interested in the re-sampling ones, since they are algorithm-independent. We have recently proposed a new under-sampling technique for the two-class problem, called Non-Target Incremental Learning (NTIL), which has shown a good performance with SVM, improving results and training speed. Here, the advantages of using this technique with ANN are shown. The performance with regard to other popular under-sampling techniques is compared.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Suppressed possibilistic fuzzy c-means clustering based on shadow sets for noisy data with imbalanced sizes
    Yu, Haiyan
    Li, Honglei
    Xu, Xiaoyu
    Gao, Qian
    Lan, Rong
    APPLIED SOFT COMPUTING, 2024, 167
  • [42] Boosting support vector machines for imbalanced data sets
    Wang, Benjamin X.
    Japkowicz, Nathalie
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 25 (01) : 1 - 20
  • [43] Handling imbalanced data sets with a modification of Decorate algorithm
    Kotsiantis, Sotiris B.
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2008, 33 (2-3) : 91 - 98
  • [44] A comparative study on noise filtering of imbalanced data sets
    Szeghalmy, Szilvia
    Fazekas, Attila
    KNOWLEDGE-BASED SYSTEMS, 2024, 301
  • [45] Boosting support vector machines for imbalanced data sets
    Wang, Benjamin X.
    Japkowicz, Nathalie
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2008, 4994 : 38 - 47
  • [46] Boosting support vector machines for imbalanced data sets
    Benjamin X. Wang
    Nathalie Japkowicz
    Knowledge and Information Systems, 2010, 25 : 1 - 20
  • [47] Clustering Based Bagging Algorithm on Imbalanced Data Sets
    Sun, Xiao-Yan
    Zhang, Hua-Xiang
    Wang, Zhi-Chao
    INTEGRATED UNCERTAINTY IN KNOWLEDGE MODELLING AND DECISION MAKING, 2011, 7027 : 179 - 186
  • [48] Online Nonlinear AUC Maximization for Imbalanced Data Sets
    Hu, Junjie
    Yang, Haiqin
    Lyu, Michael R.
    King, Irwin
    So, Anthony Man-Cho
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (04) : 882 - 895
  • [49] Improving class probability estimates for imbalanced data
    Byron C. Wallace
    Issa J. Dahabreh
    Knowledge and Information Systems, 2014, 41 : 33 - 52
  • [50] Hybrid kernel machine ensemble for imbalanced data sets
    Li, Peng
    Chan, Kap Luk
    Fang, Wen
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 1108 - +