Assessing the accuracy of prediction algorithms for classification: an overview

被引:1580
|
作者
Baldi, P [1 ]
Brunak, S
Chauvin, Y
Andersen, CAF
Nielsen, H
机构
[1] Univ Calif Irvine, Dept Informat & Comp Sci, Irvine, CA 92697 USA
[2] Tech Univ Denmark, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark
[3] Net ID Inc, San Francisco, CA 94107 USA
[4] Univ Calif Irvine, Dept Biol Sci, Irvine, CA 92697 USA
关键词
D O I
10.1093/bioinformatics/16.5.412
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We provide a unified overview of methods that currently are widely used to assess the accuracy of prediction algorithms, from raw percentages, quadratic error measures and other distances, ann correlation coefficients, and to information theoretic measures such as relative entropy and mutual information. We briefly discuss the advantages and disadvantages of each approach. For classification tasks, we derive new learning algorithms for the design of prediction systems by directly optimising the correlation coefficient. We observe and prove several results relating sensitivity nod specificity of optimal systems. While the principles are general, we illustrate the applicability on specific problems such as protein secondary structure and signal peptide prediction.
引用
收藏
页码:412 / 424
页数:13
相关论文
共 50 条
  • [21] Evaluating the Accuracy and Efficiency of Complex Network Classification Algorithms
    Bray, Margaret
    Hertzberg, Vicki
    10TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS SITIS 2014, 2014, : 355 - 360
  • [22] Data complexity and classification accuracy correlation in oversampling algorithms
    Komorniczak, Joanna
    Ksieniewicz, Pawel
    Wozniak, Michal
    FOURTH INTERNATIONAL WORKSHOP ON LEARNING WITH IMBALANCED DOMAINS: THEORY AND APPLICATIONS, VOL 183, 2022, 183 : 175 - 186
  • [23] A method for improving the accuracy of data mining classification algorithms
    Mastrogiannis, Nikolaos
    Boutsinas, Basilis
    Giannikos, Ioannis
    COMPUTERS & OPERATIONS RESEARCH, 2009, 36 (10) : 2829 - 2839
  • [24] Increasing the Target Prediction Accuracy of MicroRNA Based on Combination of Prediction Algorithms
    Shatnawi, Mohammed Q.
    Alhammouri, Mohammad
    Mukdadi, Kholoud
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (06) : 408 - 419
  • [25] An overview of solder bump shape prediction algorithms with validations
    Chiang, KN
    Yuan, CA
    IEEE TRANSACTIONS ON ADVANCED PACKAGING, 2001, 24 (02): : 158 - 162
  • [26] Image registration algorithms and prediction models - overview and pitfalls
    Baroni, G.
    Paganelli, C.
    RADIOTHERAPY AND ONCOLOGY, 2020, 152 : S423 - S423
  • [27] Classification accuracy and correlation: LDA in failure prediction
    Laitinen, Erkki K.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 183 (01) : 210 - 225
  • [28] Prediction of Speech Recognition Accuracy for Utterance Classification
    Korenevsky, Maxim L.
    Smirnov, Andrey B.
    Mendelev, Valentin S.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1275 - 1279
  • [29] Prediction of leptospirosis cases using classification algorithms
    Rocha Nery, Nivison Ruy, Jr.
    Claro, Daniela Barreiro
    Lindow, Janet C.
    IET SOFTWARE, 2017, 11 (03) : 93 - 99
  • [30] Prediction of Data Breaches using Classification Algorithms
    Kumari, Ankita
    Prakash, Prajish
    Umadevi, M.
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 1049 - 1054