Evaluation Measures of the Classification Performance of Imbalanced Data Sets

被引:185
|
作者
Gu, Qiong [1 ,2 ]
Zhu, Li [2 ]
Cai, Zhihua [2 ]
机构
[1] Xiangfan Univ, Fac Math & Comp Sci, Xiangfan 441053, Hubei, Peoples R China
[2] China Univ Geosci, Sch Comp, Wuhan 430074, Peoples R China
关键词
Evaluation; classification performance; imbalanced data sets;
D O I
10.1007/978-3-642-04962-0_53
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discriminant Measures for Classification Performance play a critical role in guiding the design of classifiers, assessment methods and evaluation measures are at least as important as algorithm and are the first key stage to a successful data mining. We systematically summarized the evaluation measures of Imbalanced Data Sets (IDS). Several different type measures, such as commonly performance evaluation measures and visualizing classifier performance measures have been analyzed and compared. The problems of these measures towards IDS may lead to misunderstanding of classification results and even wrong strategy decision. Beside that, a series of complex numerical evaluation measures were also investigated which can also serve for evaluating classification performance of IDS.
引用
收藏
页码:461 / +
页数:2
相关论文
共 50 条
  • [11] Monotonic classification: An overview on algorithms, performance measures and data sets
    Cano, Jose-Ramon
    Antonio Gutierrez, Pedro
    Krawczyk, Bartosz
    Wozniak, Michal
    Garcia, Salvador
    NEUROCOMPUTING, 2019, 341 : 168 - 182
  • [12] Empirical Assessment of Performance Measures for Preprocessing Moments in Imbalanced Data Classification Problem
    Szeszko, Pawel
    Topczewska, Magdalena
    COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2016, 2016, 9842 : 183 - 194
  • [13] An Improved Algorithm for SVMs Classification of Imbalanced Data Sets
    Castro, Cristiano Leite
    Carvalho, Mateus Araujo
    Braga, Antonio Padua
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, PROCEEDINGS, 2009, 43 : 108 - 118
  • [14] Classification of imbalanced marketing data with balanced random sets
    Nikulin, Vladimir
    McLachlan, Geoffrey J.
    Journal of Machine Learning Research, 2009, 7 : 89 - 100
  • [15] An evaluation of progressive sampling for imbalanced data sets
    Ng, Willie
    Dash, Manoranjan
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 657 - +
  • [16] Improving the classification performance on imbalanced data sets via new hybrid parameterisation model
    Mohamad, Masurah
    Selamat, Ali
    Subroto, Imam Much
    Krejcar, Ondrej
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2021, 33 (07) : 787 - 797
  • [17] Evaluation of the Classifiers in Multiparameter and Imbalanced Data Sets
    Piotrowska, Ewelina
    INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, ISAT 2019, PT II, 2020, 1051 : 263 - 273
  • [18] A Study of Interestingness Measures for Associative Classification on Imbalanced Data
    Yang, Guangfei
    Cui, Xuejiao
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2015, 2015, 9441 : 141 - 151
  • [19] Improving SVM Classification on Imbalanced Data Sets in Distance Spaces
    Koeknar-Tezel, Suzan
    Latecki, Longin Jan
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 259 - +
  • [20] Classification performance assessment for imbalanced multiclass data
    Aguilar-Ruiz, Jesus S.
    Michalak, Marcin
    SCIENTIFIC REPORTS, 2024, 14 (01):