Evaluation Measures of the Classification Performance of Imbalanced Data Sets

被引:185
|
作者
Gu, Qiong [1 ,2 ]
Zhu, Li [2 ]
Cai, Zhihua [2 ]
机构
[1] Xiangfan Univ, Fac Math & Comp Sci, Xiangfan 441053, Hubei, Peoples R China
[2] China Univ Geosci, Sch Comp, Wuhan 430074, Peoples R China
关键词
Evaluation; classification performance; imbalanced data sets;
D O I
10.1007/978-3-642-04962-0_53
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discriminant Measures for Classification Performance play a critical role in guiding the design of classifiers, assessment methods and evaluation measures are at least as important as algorithm and are the first key stage to a successful data mining. We systematically summarized the evaluation measures of Imbalanced Data Sets (IDS). Several different type measures, such as commonly performance evaluation measures and visualizing classifier performance measures have been analyzed and compared. The problems of these measures towards IDS may lead to misunderstanding of classification results and even wrong strategy decision. Beside that, a series of complex numerical evaluation measures were also investigated which can also serve for evaluating classification performance of IDS.
引用
收藏
页码:461 / +
页数:2
相关论文
共 50 条
  • [21] An experimental comparison of classification algorithms for imbalanced credit scoring data sets
    Brown, Iain
    Mues, Christophe
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (03) : 3446 - 3453
  • [22] An Effective Over-sampling Method for Imbalanced Data Sets Classification
    Zhai Yun
    Ma Nan
    Ruan Da
    An Bing
    CHINESE JOURNAL OF ELECTRONICS, 2011, 20 (03): : 489 - 494
  • [23] SVM classification for imbalanced data sets using a multiobjective optimization framework
    Askan, Aysegul
    Sayin, Serpil
    ANNALS OF OPERATIONS RESEARCH, 2014, 216 (01) : 191 - 203
  • [24] A New Sampling Approach for Classification of Imbalanced Data sets with High Density
    Jia Pengfei
    Zhang Chunkai
    He Zhenyu
    2014 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2014, : 217 - 222
  • [25] Time series transductive classification on imbalanced data sets: an experimental study
    de Sousa, Celso A. R.
    Souza, Vinicius M. A.
    Batista, Gustavo E. A. P. A.
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3780 - 3785
  • [26] Classification of imbalanced data with random sets and mean-variance filtering
    Nikulin, Vladimir
    International Journal of Data Warehousing and Mining, 2008, 4 (02) : 63 - 78
  • [27] SVM classification for imbalanced data sets using a multiobjective optimization framework
    Ayşegül Aşkan
    Serpil Sayın
    Annals of Operations Research, 2014, 216 : 191 - 203
  • [28] NPC: Neighbors' Progressive Competition Algorithm for Classification of Imbalanced Data Sets
    Saryazdi, Soroush
    Nikpour, Bahareh
    Nezamabadi-Pour, Hossein
    2017 3RD IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2017, : 28 - 33
  • [29] Classification of Imbalanced data sets using Multi Objective Genetic Programming
    Maheta, Hardik H.
    Dabhi, Vipul K.
    2015 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2015,
  • [30] Contrastive dissimilarity: optimizing performance on imbalanced and limited data sets
    Teixeira, Lucas O.
    Bertolini, Diego
    Oliveira, Luiz S.
    Cavalcanti, George D. C.
    Costa, Yandre M. G.
    Neural Computing and Applications, 2024, 36 (32) : 20439 - 20456