Assessing the accuracy of prediction algorithms for classification: an overview

被引:1580
|
作者
Baldi, P [1 ]
Brunak, S
Chauvin, Y
Andersen, CAF
Nielsen, H
机构
[1] Univ Calif Irvine, Dept Informat & Comp Sci, Irvine, CA 92697 USA
[2] Tech Univ Denmark, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark
[3] Net ID Inc, San Francisco, CA 94107 USA
[4] Univ Calif Irvine, Dept Biol Sci, Irvine, CA 92697 USA
关键词
D O I
10.1093/bioinformatics/16.5.412
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We provide a unified overview of methods that currently are widely used to assess the accuracy of prediction algorithms, from raw percentages, quadratic error measures and other distances, ann correlation coefficients, and to information theoretic measures such as relative entropy and mutual information. We briefly discuss the advantages and disadvantages of each approach. For classification tasks, we derive new learning algorithms for the design of prediction systems by directly optimising the correlation coefficient. We observe and prove several results relating sensitivity nod specificity of optimal systems. While the principles are general, we illustrate the applicability on specific problems such as protein secondary structure and signal peptide prediction.
引用
收藏
页码:412 / 424
页数:13
相关论文
共 50 条
  • [1] Accuracy in prediction of soil orders using different classification algorithms
    Castrignanò, A
    Lopez, N
    ACCURACY 2000, PROCEEDINGS, 2000, : 99 - 103
  • [2] Comparison of the prediction accuracy of machine learning algorithms in crosslinguistic vowel classification
    Georgios P. Georgiou
    Scientific Reports, 13 (1)
  • [3] Comparison of the prediction accuracy of machine learning algorithms in crosslinguistic vowel classification
    Georgiou, Georgios P.
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [4] Survey of classification algorithms for formulating yield prediction accuracy in precision agriculture
    Savla, Anshal
    Dhawan, Parul
    Bhadada, Himtanaya
    Israni, Nivedita
    Mandholia, Alisha
    Bhardwaj, Sanya
    2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [5] Reinforcement Learning Algorithms: An Overview and Classification
    AlMahamid, Fadi
    Grolinger, Katarina
    2021 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2021,
  • [6] Data mining classification algorithms: An overview
    Bardab, Saeed Ngmaldin
    Ahmed, Tarig Mohamed
    Mohammed, Tarig Abdalkarim Abdalfadil
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2021, 8 (02): : 1 - 5
  • [7] Assessing the Impact of Land Cover Classification Methods on the Accuracy of Urban Land Change Prediction
    Zubair, Opeyemi A.
    Ji, Wei
    CANADIAN JOURNAL OF REMOTE SENSING, 2015, 41 (03) : 170 - 190
  • [8] Assessing the Accuracy of Multiple Classification Algorithms for Crop Classification Using Landsat-8 and Sentinel-2 Data
    Chakhar, Amal
    Ortega-Terol, Damian
    Hernandez-Lopez, David
    Ballesteros, Rocio
    Ortega, Jose E.
    Moreno, Miguel A.
    REMOTE SENSING, 2020, 12 (11)
  • [9] A METHOD OF ASSESSING ACCURACY OF A DIGITAL CLASSIFICATION
    QUIRK, BK
    SCARPACE, FL
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 1980, 46 (11): : 1427 - 1431
  • [10] An Overview of Lung Cancer Classification Algorithms and their Performances
    Taher, F.
    Prakash, N.
    Shaffie, A.
    Soliman, A.
    El-Baz, A.
    IAENG International Journal of Computer Science, 2021, 48 (04)