Comparing classification models-a practical tutorial

被引:6
|
作者
Walters, W. Patrick [1 ]
机构
[1] Relay Therapeut, 399 Binney St, Cambridge, MA 02141 USA
关键词
QSAR; Classification model; Statistical validation; Machine learning; Tutorial;
D O I
10.1007/s10822-021-00417-2
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
While machine learning models have become a mainstay in Cheminformatics, the field has yet to agree on standards for model evaluation and comparison. In many cases, authors compare methods by performing multiple folds of cross-validation and reporting the mean value for an evaluation metric such as the area under the receiver operating characteristic. These comparisons of mean values often lack statistical rigor and can lead to inaccurate conclusions. In the interest of encouraging best practices, this tutorial provides an example of how multiple methods can be compared in a statistically rigorous fashion.
引用
收藏
页码:381 / 389
页数:9
相关论文
共 50 条
  • [1] Comparing classification models—a practical tutorial
    W. Patrick Walters
    Journal of Computer-Aided Molecular Design, 2022, 36 : 381 - 389
  • [3] Hierarchical Diagnostic Classification Models Morphing into Unidimensional 'Diagnostic' Classification Models-A Commentary
    von Davier, Matthias
    Haberman, Shelby J.
    PSYCHOMETRIKA, 2014, 79 (02) : 340 - 346
  • [4] Tutorial on practical prediction theory for classification
    Langford, J
    JOURNAL OF MACHINE LEARNING RESEARCH, 2005, 6 : 273 - 306
  • [5] Clinical Prediction Models-a Practical Approach to Development, Validation and Updating
    Bedogni, Giorgio
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2009, 172 : 944 - 944
  • [6] Additive manufacturing cost estimation models-a classification review
    Kadir, Aini Zuhra Abdul
    Yusof, Yusri
    Wahab, Md Saidin
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2020, 107 (9-10): : 4033 - 4053
  • [7] Classification and challenges of bottom -up energy system models-A review
    Prina, Matteo Giacomo
    Manzolini, Giampaolo
    Moser, David
    Nastasi, Benedetto
    Sparber, Wolfram
    RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2020, 129
  • [8] Mixture Cure Models in Oncology: A Tutorial and Practical Guidance
    Federico Felizzi
    Noman Paracha
    Johannes Pöhlmann
    Joshua Ray
    PharmacoEconomics - Open, 2021, 5 : 143 - 155
  • [9] Mixture Cure Models in Oncology: A Tutorial and Practical Guidance
    Felizzi, Federico
    Paracha, Noman
    Pohlmann, Johannes
    Ray, Joshua
    PHARMACOECONOMICS-OPEN, 2021, 5 (02) : 143 - 155
  • [10] Variable predictive models-A new multivariate classification approach for pattern recognition applications
    Raghuraj, Rao
    Lakshminarayanan, Samavedham
    PATTERN RECOGNITION, 2009, 42 (01) : 7 - 16