Improving ANN Performance for Imbalanced Data Sets by Means of the NTIL Technique

被引:0
|
作者
Vivaracho-Pascual, Carlos [1 ]
Simon-Hurtado, Arancha [1 ]
机构
[1] Univ Valladolid, Dept Comp Sci, E-47002 Valladolid, Spain
关键词
RECOGNITION; CLASSIFIER;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper deals with the problem of training an Artificial Neural Network (ANN) when the data sets are very imbalanced. Most learning algorithms, including ANN, are designed for well-balanced data and do not work properly on imbalanced ones. Of the approaches proposed for dealing with this problem, we are interested in the re-sampling ones, since they are algorithm-independent. We have recently proposed a new under-sampling technique for the two-class problem, called Non-Target Incremental Learning (NTIL), which has shown a good performance with SVM, improving results and training speed. Here, the advantages of using this technique with ANN are shown. The performance with regard to other popular under-sampling techniques is compared.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Improving SVM training by means of NTIL when the data sets are imbalanced
    Vivaracho, Carlos E.
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2006, 4203 : 111 - 120
  • [2] Improving the classification performance on imbalanced data sets via new hybrid parameterisation model
    Mohamad, Masurah
    Selamat, Ali
    Subroto, Imam Much
    Krejcar, Ondrej
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2021, 33 (07) : 787 - 797
  • [3] FUZZY AND SMOTE RESAMPLING TECHNIQUE FOR IMBALANCED DATA SETS
    Zorkeflee, Maisarah
    Din, Aniza Mohamed
    Ku-Mahamud, Ku Ruhana
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON COMPUTING & INFORMATICS, 2015, : 638 - 643
  • [4] Improving SVM Classification on Imbalanced Data Sets in Distance Spaces
    Koeknar-Tezel, Suzan
    Latecki, Longin Jan
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 259 - +
  • [5] Evaluation Measures of the Classification Performance of Imbalanced Data Sets
    Gu, Qiong
    Zhu, Li
    Cai, Zhihua
    COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, 2009, 51 : 461 - +
  • [6] Imbalanced Data Classification Based on MBCDK-means Undersampling and GA-ANN
    Song, Anping
    Xu, Quanhua
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT II, 2018, 11140 : 349 - 358
  • [7] Exploring An Iterative Feature Selection Technique for Highly Imbalanced Data Sets
    Khoshgoftaar, Taghi M.
    Gao, Kehan
    Napolitano, Amri
    2012 IEEE 13TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2012, : 101 - 108
  • [8] Contrastive dissimilarity: optimizing performance on imbalanced and limited data sets
    Teixeira, Lucas O.
    Bertolini, Diego
    Oliveira, Luiz S.
    Cavalcanti, George D. C.
    Costa, Yandre M. G.
    Neural Computing and Applications, 2024, 36 (32) : 20439 - 20456
  • [9] Data Mining on Imbalanced Data Sets
    Gu, Qiong
    Cai, Zhihua
    Zhu, Li
    Huang, Bo
    2008 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING, 2008, : 1020 - 1024
  • [10] Improving SVM classification on imbalanced time series data sets with ghost points
    Koeknar-Tezel, Suzan
    Latecki, Longin Jan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 28 (01) : 1 - 23