The Difference Between the Accuracy of Real and the Corresponding Random Model is a Useful Parameter for Validation of Two-State Classification Model Quality

被引:17
|
作者
Batista, Jadranko [1 ]
Vikic-Topic, Drazen [2 ]
Lucic, Bono [2 ]
机构
[1] Univ Mostar, Fac Sci & Educ, Mostar, Bosnia & Herceg
[2] Rudjer Boskovic Inst, NMR Ctr, POB 180, HR-10002 Zagreb, Croatia
关键词
classification model; Q(2) accuracy; overall classification accuracy; random classification accuracy; classification accuracy difference; correct class estimation; under-prediction; over-prediction; class imbalance; membrane structure modeling; QSAR classification modeling; PREDICTION; TOPOLOGY; PROTEINS; INDEXES;
D O I
10.5562/cca3117
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The simplest and the most commonly used measure for assess the classification model quality is parameter Q(2) = 100 (p + n) / N (%) named the classification accuracy, p, n and N are the total numbers of correctly predicted compounds in the first and in the second class, and the total number of elements of classes (compounds) in data set, respectively. Moreover, the most probable accuracy that can be obtained by a random model is calculated for two-state model by the formulae Q(2,rnd) = 100 [(p + u) (p + o) + (n + u) (n + o)] / N-2 (%), where u and o are thetotal number of under-predictions (when class 1 is predicted by the model as class 2) and over-predictions (when class 2 is predicted by the model as class 1) in data set, respectively. Finally, the difference between these two parameter Delta Q(2) = Q(2) - Q(2), rnd is introduced, and it is suggested to compute and give Delta Q(2) for each two-state classification model to assess its contribution over the accuracy of the corresponding random model. When data set is ideally balanced having the same numbers of elements in both classes, the two-state classification problem is the most difficult with maximal Q(2) = 100 % and Q(2,rnd) = 50 %, giving the maximal Q(2) = 50 %. The usefulness of Q(2) parameter is illustrated in comparative analysis on two-class classification models from literature for prediction of secondary structure of membrane proteins and on several quantitative structure-property models. Real contributions of these models over the random level of accuracy is calculated, and their Delta Q(2) values are compared mutually and with the value of.Q(2) (= 50 %) for the most difficult two-state classification model.
引用
收藏
页码:527 / 534
页数:8
相关论文
共 30 条
  • [21] Comment on Molecular first hyperpolarizability of push-pull polyenes: Relationship between electronic and vibrational contribution by a two-state model
    Bishop, D. M.
    Kirtman, B.
    Physical Review B: Condensed Matter, 56 (04):
  • [22] Molecular first hyperpolarizability of push-pull polyenes: Relationship between electronic and vibrational contribution by a two-state model - Reply
    Castiglioni, C
    DelZoppo, M
    Zerbi, G
    PHYSICAL REVIEW B, 1997, 56 (04): : 2275 - 2276
  • [23] Two algorithms for model quality estimation in state-space systems with time-varying parameter uncertainty
    Salehpour, Soheil
    Johansson, Andreas
    2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 4809 - 4814
  • [24] Computational insight into relations between electronic and vibrational polarizabilities within the two-state valence-bond charge-transfer model
    Zalesny, Robert
    Bartkowiak, Wojciech
    Toman, Petr
    Leszczynski, Jerzy
    CHEMICAL PHYSICS, 2007, 337 (1-3) : 77 - 80
  • [25] Empirical validation of a two-state kinetic model for the B-Z transition of double-stranded poly[d(G-m5C)]
    M. A. Fuertes
    José Manuel Pérez
    Víctor Manuel González
    Carlos Alonso
    JBIC Journal of Biological Inorganic Chemistry, 1999, 4 : 759 - 765
  • [26] Empirical validation of a two-state kinetic model for the B-Z transition of double-stranded poly[d(G-m5C)]
    Fuertes, MA
    Pérez, JM
    González, VM
    Alonso, C
    JOURNAL OF BIOLOGICAL INORGANIC CHEMISTRY, 1999, 4 (06): : 759 - 765
  • [27] PRELIMINARY VALIDATION RESULTS: IMPROVING AHI SCORING ACCURACY USING AN AI MODEL FOR SLEEP STATE AND AROUSAL CLASSIFICATION FROM HOME SLEEP APNEA TESTING
    Arnason, A.
    Sigurdarson, A. K.
    Leonardsson, E.
    Hakonardottir, H. Th Hildur Thora
    Finnbogadottir, H.
    Kristjansdottir, L.
    Andresdottir, L. O.
    Arnadottir, M.
    Sigurdarson, S.
    Petursson, T. O.
    Sigmarsdottir, Th B.
    Valsdottir, V.
    Teixeira, C.
    Islind, A. S.
    Arnardottir, E. S.
    Agustsson, J. S.
    SLEEP MEDICINE, 2024, 115 : 413 - 414
  • [28] Relationship between static vibrational and electronic hyperpolarizabilities of π-conjugated push-pull molecules within the two-state valence-bond charge-transfer model
    Univ of Ottawa, Ottawa, Canada
    J Chem Phys, 22 (9987-9994):
  • [29] Relationship between static vibrational and electronic hyperpolarizabilities of π-conjugated push-pull molecules within the two-state valence-bond charge-transfer model
    Bishop, DM
    Champagne, B
    Kirtman, B
    JOURNAL OF CHEMICAL PHYSICS, 1998, 109 (22): : 9987 - 9994
  • [30] Two-state model based on electron-transfer reactivity changes to quantify the noncovalent interaction between Co(NH3)5 Cl2+ and 18-crown-6 ether:: The effect of second-sphere coordination on electron-transfer processes
    Borreguero, A.
    Prado-Gotor, R.
    JOURNAL OF PHYSICAL CHEMISTRY A, 2008, 112 (13): : 2813 - 2819