A study of statistical techniques and performance measures for genetics-based machine learning: accuracy and interpretability

被引:0
|
作者
S. García
A. Fernández
J. Luengo
F. Herrera
机构
[1] University of Jaén,Department of Computer Science
[2] University of Granada,Department of Computer Science and Artificial Intelligence
来源
Soft Computing | 2009年 / 13卷
关键词
Genetics-based machine learning; Genetic algorithms; Statistical tests; Non-parametric tests; Cohen’s kappa; Interpretability; Classification;
D O I
暂无
中图分类号
学科分类号
摘要
The experimental analysis on the performance of a proposed method is a crucial and necessary task to carry out in a research. This paper is focused on the statistical analysis of the results in the field of genetics-based machine Learning. It presents a study involving a set of techniques which can be used for doing a rigorous comparison among algorithms, in terms of obtaining successful classification models. Two accuracy measures for multi-class problems have been employed: classification rate and Cohen’s kappa. Furthermore, two interpretability measures have been employed: size of the rule set and number of antecedents. We have studied whether the samples of results obtained by genetics-based classifiers, using the performance measures cited above, check the necessary conditions for being analysed by means of parametrical tests. The results obtained state that the fulfillment of these conditions are problem-dependent and indefinite, which supports the use of non-parametric statistics in the experimental analysis. In addition, non-parametric tests can be satisfactorily employed for comparing generic classifiers over various data-sets considering any performance measure. According to these facts, we propose the use of the most powerful non-parametric statistical tests to carry out multiple comparisons. However, the statistical analysis conducted on interpretability must be carefully considered.
引用
收藏
相关论文
共 50 条
  • [31] Machine learning for genetics-based classification and treatment response prediction in cancer of unknown primary
    Moon, Intae
    LoPiccolo, Jaclyn
    Baca, Sylvan C.
    Sholl, Lynette M.
    Kehl, Kenneth L.
    Hassett, Michael J.
    Liu, David
    Schrag, Deborah
    Gusev, Alexander
    NATURE MEDICINE, 2023, 29 (08) : 2057 - +
  • [32] Machine learning for genetics-based classification and treatment response prediction in cancer of unknown primary
    Intae Moon
    Jaclyn LoPiccolo
    Sylvan C. Baca
    Lynette M. Sholl
    Kenneth L. Kehl
    Michael J. Hassett
    David Liu
    Deborah Schrag
    Alexander Gusev
    Nature Medicine, 2023, 29 : 2057 - 2067
  • [33] USING TRANSPUTERS TO INCREASE SPEED AND FLEXIBILITY OF GENETICS-BASED MACHINE LEARNING-SYSTEMS
    DORIGO, M
    MICROPROCESSING AND MICROPROGRAMMING, 1992, 34 (1-5): : 147 - 152
  • [34] An adaptive classifier system tree for extending genetics-based machine learning in a dynamic environment
    Dongcheng Hu
    Rui Jiang
    Yupin Luo
    Artificial Life and Robotics, 2000, 4 (1) : 7 - 11
  • [35] Hybrid Fuzzy Genetics-based Machine Learning with Entropy-based Inhomogeneous Interval Discretization
    Takahashi, Yuji
    Nojima, Yusuke
    Ishibuchi, Hisao
    2014 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2014, : 1512 - 1517
  • [36] Machine-learning interpretability techniques for seismic performance assessment of infrastructure systems
    Mangalathu, Sujith
    Karthikeyan, Karthika
    Feng, De-Cheng
    Jeon, Jong-Su
    ENGINEERING STRUCTURES, 2022, 250
  • [37] Comparative Study of Machine Learning Techniques for Population Genetics
    Amin, Muhammad Arslan
    Hanif, Muhammad Kashif
    Sarwar, Muhammad Umer
    Abbas, Mohsin
    Jilani, Muhammad Haroon
    Nasir, Usman
    Sarwar, Muhammad Bilal
    Talha, Hafiz Muhammad
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (06): : 78 - 84
  • [38] Rotation Effects of Objective Functions in Parallel Distributed Multiobjective Fuzzy Genetics-Based Machine Learning
    Takahashi, Yuji
    Nojima, Yusuke
    Ishibuchi, Hisao
    2015 10TH ASIAN CONTROL CONFERENCE (ASCC), 2015,
  • [39] GAssist vs. BioHEL: critical assessment of two paradigms of genetics-based machine learning
    Franco, Maria A.
    Krasnogor, Natalio
    Bacardit, Jaume
    SOFT COMPUTING, 2013, 17 (06) : 953 - 981
  • [40] Rule acquisition for production scheduling - A genetics-based machine learning approach to flexible shop scheduling
    Tamaki, H
    Sakakibara, K
    Murao, H
    Kitamura, S
    SICE 2003 ANNUAL CONFERENCE, VOLS 1-3, 2003, : 2762 - 2767