The use of the area under the roc curve in the evaluation of machine learning algorithms

被引:4663
|
作者
Bradley, AP [1 ]
机构
[1] UNIV QUEENSLAND,DEPT ELECT & COMP ENGN,COOPERAT RES CTR SENSOR SIGNAL & INFORMAT PROC,ST LUCIA,QLD 4072,AUSTRALIA
关键词
the ROC curve; the area under the ROC curve (AUC); accuracy measures; cross-validation; Wilcoxon statistic; standard error;
D O I
10.1016/S0031-3203(96)00142-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we investigate the use of the area under the receiver operating characteristic (ROC) curve (AUC) as a performance measure for machine learning algorithms. As a case study we evaluate six machine learning algorithms (C4.5, Multiscale Classifier, Perceptron, Multi-layer Perceptron, k-Nearest Neighbours, and a Quadratic Discriminant Function) on six ''real world'' medical diagnostics data sets. We compare and discuss the use of AUC to the more conventional overall accuracy and find that AUC exhibits a number of desirable properties when compared to overall accuracy: increased sensitivity in Analysis of Variance (ANOVA) tests; a standard error that decreased as both AUC and the number of test samples increased; decision threshold independent; and it is invariant to a priori class probabilities. The paper concludes with the recommendation that AUC be used in preference to overall accuracy for ''single number'' evaluation of machine learning algorithms. (C) 1997 Pattern Recognition Society.
引用
收藏
页码:1145 / 1159
页数:15
相关论文
共 50 条
  • [21] Active Sampling for Rank Learning via Optimizing the Area under the ROC Curve
    Donmez, Pinar
    Carbonell, Jaime G.
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 78 - +
  • [22] On the use of partial area under the ROC curve for comparison of two diagnostic tests
    Ma, Hua
    Bandos, Andriy I.
    Gur, David
    BIOMETRICAL JOURNAL, 2015, 57 (02) : 304 - 320
  • [23] Area under the ROC Curve has the most consistent evaluation for binary classification
    Li, Jing
    PLOS ONE, 2024, 19 (12):
  • [24] Evaluation of Area under the Constant Shape Bi-Weibull ROC Curve
    Pundir, Sudesh
    Amala, R.
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2014, 13 (01) : 305 - 328
  • [25] Regression analysis for the partial area under the ROC curve
    Cai, Tianxi
    Dodd, Lori E.
    STATISTICA SINICA, 2008, 18 (03) : 817 - 836
  • [26] On the limitations of the area under the ROC curve for NTCP modelling
    Bahn, Emanuel
    Alber, Markus
    RADIOTHERAPY AND ONCOLOGY, 2020, 144 : 148 - 151
  • [27] The area under the ROC curve as a measure of clustering quality
    Jaskowiak, Pablo A.
    Costa, Ivan G.
    Campello, Ricardo J. G. B.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (03) : 1219 - 1245
  • [28] Analysis of Area under the ROC Curve of Energy Detection
    Atapattu, Saman
    Tellambura, Chintha
    Jiang, Hai
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2010, 9 (03) : 1216 - 1225
  • [29] Empirical likelihood inference for the area under the ROC curve
    Qin, GS
    Zhou, XH
    BIOMETRICS, 2006, 62 (02) : 613 - 622
  • [30] Nonparametric confidence intervals for the area under the ROC curve
    Adimari, Gianfranco
    STATISTICA, 2006, 66 (01) : 39 - 49