Evaluation Measures of the Classification Performance of Imbalanced Data Sets

被引:185
|
作者
Gu, Qiong [1 ,2 ]
Zhu, Li [2 ]
Cai, Zhihua [2 ]
机构
[1] Xiangfan Univ, Fac Math & Comp Sci, Xiangfan 441053, Hubei, Peoples R China
[2] China Univ Geosci, Sch Comp, Wuhan 430074, Peoples R China
关键词
Evaluation; classification performance; imbalanced data sets;
D O I
10.1007/978-3-642-04962-0_53
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discriminant Measures for Classification Performance play a critical role in guiding the design of classifiers, assessment methods and evaluation measures are at least as important as algorithm and are the first key stage to a successful data mining. We systematically summarized the evaluation measures of Imbalanced Data Sets (IDS). Several different type measures, such as commonly performance evaluation measures and visualizing classifier performance measures have been analyzed and compared. The problems of these measures towards IDS may lead to misunderstanding of classification results and even wrong strategy decision. Beside that, a series of complex numerical evaluation measures were also investigated which can also serve for evaluating classification performance of IDS.
引用
收藏
页码:461 / +
页数:2
相关论文
共 50 条
  • [41] The classification of imbalanced large data sets based on MapReduce and ensemble of ELM classifiers
    Junhai Zhai
    Sufang Zhang
    Chenxi Wang
    International Journal of Machine Learning and Cybernetics, 2017, 8 : 1009 - 1017
  • [42] Comparing the Classification Performances of Supervised Classifiers with Balanced and Imbalanced SAR Data Sets
    Ustuner, Mustafa
    Gokdag, Unsal
    Bilgin, Gokhan
    Sanli, Fusun Balik
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [43] The classification of imbalanced large data sets based on MapReduce and ensemble of ELM classifiers
    Zhai, Junhai
    Zhang, Sufang
    Wang, Chenxi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2017, 8 (03) : 1009 - 1017
  • [44] Improving ANN Performance for Imbalanced Data Sets by Means of the NTIL Technique
    Vivaracho-Pascual, Carlos
    Simon-Hurtado, Arancha
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [45] Variant of Data Particle Geometrical Divide for Imbalanced Data Sets Classification by the Example of Occupancy Detection
    Rybak, Lukasz
    Dudczyk, Janusz
    APPLIED SCIENCES-BASEL, 2021, 11 (11):
  • [46] Informative Evaluation Metrics for Highly Imbalanced Big Data Classification
    Hancock, John
    Khoshgoftaar, Taghi M.
    Johnson, Justin M.
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1419 - 1426
  • [47] A hierarchical VQSVM for imbalanced data sets
    Yu, Ting
    Jan, Tony
    Simoff, Simeon
    Debenham, John
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 518 - 523
  • [48] A LEARNING METHOD FOR IMBALANCED DATA SETS
    de la Calleja, Jorge
    Fuentes, Olac
    Gonzalez, Jesus
    Aceves-Perez, Rita M.
    KDIR 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2009, : 307 - +
  • [49] On Measures of Similarity for Neutrosophic Sets with Applications in Classification and Evaluation Processes
    Poonia M.
    Bajaj R.K.
    Neutrosophic Sets and Systems, 2021, 39 : 86 - 100
  • [50] A New Performance Evaluation Method for Imbalanced Data Learning
    Dong, Yuan-Fang
    Li, Xiong-Fei
    Li, Jun
    Zhao, Hai-Ying
    2011 AASRI CONFERENCE ON APPLIED INFORMATION TECHNOLOGY (AASRI-AIT 2011), VOL 2, 2011, : 166 - 169