An Efficient Variable Selection Method for Predictive Discriminant Analysis

被引:0
|
作者
Iduseri A. [1 ]
Osemwenkhae J.E. [1 ]
机构
[1] Department of Mathematics, Faculty of Physical Sciences, University of Benin, P.M.B. 1154, Benin City, 300001, Edo State
关键词
Actual hit rate; Predictive discriminant analysis; Superior subset; Variable selection;
D O I
10.1007/s40745-015-0061-9
中图分类号
学科分类号
摘要
Seeking a subset of relevant predictor variables for use in predictive model construction in order to simplify the model, obtain shorter training time, as well as enhance generalization by reducing overfitting is a common preprocessing step prior to training a predictive model. In predictive discriminant analysis, the use of classic variable selection methods as a preprocessing step, may lead to “good” overall correct classification within the confusion matrix. However, in most cases, the obtained best subset of predictor variables are not superior (both in terms of the number and combination of the predictor variables, as well as the hit rate obtained when used as training sample) to all other subsets from the same historical sample. Hence the obtained maximum hit rate of the obtained predictive discriminant function is often not optimal even for the training sample that gave birth to it. This paper proposes an efficient variable selection method for obtaining a subset of predictors that will be superior to all other subsets from the same historical sample. In application to real life datasets, the obtained predictive function using our proposed method achieved an actual hit rate that was essentially equal to that of the all-possible-subset method, with a significantly less computational expense. © 2015, Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:489 / 504
页数:15
相关论文
共 50 条
  • [11] Kernel Canonical Discriminant Analysis Based on Variable Selection
    Ikeda, Seiichi
    Sato, Yoshiharu
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2009, 13 (04) : 416 - 420
  • [12] Variable Selection in Canonical Discriminant Analysis for Family Studies
    Jin, Man
    Fang, Yixin
    BIOMETRICS, 2011, 67 (01) : 124 - 132
  • [13] Variable selection and error rate estimation in discriminant analysis
    Le Roux, NJ
    Steel, SJ
    Louw, N
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1997, 59 (03) : 195 - 219
  • [14] Input variable selection in kernel Fisher discriminant analysis
    Louw, N
    Steel, SJ
    FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 126 - +
  • [15] An efficient kernel discriminant analysis method
    Lu, JW
    Plataniotis, KN
    Venetsanopoulos, A
    Wang, J
    PATTERN RECOGNITION, 2005, 38 (10) : 1788 - 1790
  • [16] SIMULTANEOUS PROCEDURES FOR VARIABLE SELECTION IN MULTIPLE DISCRIMINANT-ANALYSIS
    MCKAY, RJ
    BIOMETRIKA, 1977, 64 (02) : 283 - 290
  • [17] A variable selection technique in discriminant analysis with application in marketing data
    Gupta, AK
    Logan, TP
    Chen, J
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1999, 63 (02) : 187 - 199
  • [18] Stopping rule for variable selection using stepwise discriminant analysis
    C. S. Munita
    L. P. Barroso
    P. M. S. Oliveira
    Journal of Radioanalytical and Nuclear Chemistry, 2006, 269 : 335 - 338
  • [19] Ant colony optimization for variable selection in discriminant linear analysis
    Pontes, Aline S.
    Araujo, Alisson
    Marinho, Weverton
    Goncalves Dias Diniz, Paulo H.
    Gomes, Adriano de Araujo
    Goicoechea, Hector C.
    Silva, Edvan C.
    Araujo, Mario C. U.
    JOURNAL OF CHEMOMETRICS, 2020, 34 (12)
  • [20] Variable selection in discriminant analysis: measuring the influence of individual cases
    Steel, SJ
    Louw, N
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2001, 37 (02) : 249 - 260