An Efficient Variable Selection Method for Predictive Discriminant Analysis

被引:0
|
作者
Iduseri A. [1 ]
Osemwenkhae J.E. [1 ]
机构
[1] Department of Mathematics, Faculty of Physical Sciences, University of Benin, P.M.B. 1154, Benin City, 300001, Edo State
关键词
Actual hit rate; Predictive discriminant analysis; Superior subset; Variable selection;
D O I
10.1007/s40745-015-0061-9
中图分类号
学科分类号
摘要
Seeking a subset of relevant predictor variables for use in predictive model construction in order to simplify the model, obtain shorter training time, as well as enhance generalization by reducing overfitting is a common preprocessing step prior to training a predictive model. In predictive discriminant analysis, the use of classic variable selection methods as a preprocessing step, may lead to “good” overall correct classification within the confusion matrix. However, in most cases, the obtained best subset of predictor variables are not superior (both in terms of the number and combination of the predictor variables, as well as the hit rate obtained when used as training sample) to all other subsets from the same historical sample. Hence the obtained maximum hit rate of the obtained predictive discriminant function is often not optimal even for the training sample that gave birth to it. This paper proposes an efficient variable selection method for obtaining a subset of predictors that will be superior to all other subsets from the same historical sample. In application to real life datasets, the obtained predictive function using our proposed method achieved an actual hit rate that was essentially equal to that of the all-possible-subset method, with a significantly less computational expense. © 2015, Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:489 / 504
页数:15
相关论文
共 50 条
  • [1] Variational discriminant analysis with variable selection
    Yu, Weichang
    Ormerod, John T.
    Stewart, Michael
    STATISTICS AND COMPUTING, 2020, 30 (04) : 933 - 951
  • [2] Variational discriminant analysis with variable selection
    Weichang Yu
    John T. Ormerod
    Michael Stewart
    Statistics and Computing, 2020, 30 : 933 - 951
  • [3] COMPUTATIONS FOR VARIABLE SELECTION IN DISCRIMINANT-ANALYSIS
    MCCABE, GP
    TECHNOMETRICS, 1975, 17 (01) : 103 - 109
  • [4] VARIABLE SELECTION IN HETEROSCEDASTIC DISCRIMINANT-ANALYSIS
    FATTI, LP
    HAWKINS, DM
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1986, 81 (394) : 494 - 500
  • [5] Variable selection in discriminant analysis in the presence of outliers
    Steel, SJ
    Louw, N
    ITI 2001: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2001, : 251 - 256
  • [6] Analysis of new variable selection methods for discriminant analysis
    Pacheco, Joaquin
    Casado, Silvia
    Nunez, Laura
    Gomez, Olga
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (03) : 1463 - 1478
  • [7] A consistent variable selection method in high-dimensional canonical discriminant analysis
    Oda, Ryoya
    Suzuki, Yuya
    Yanagihara, Hirokazu
    Fujikoshi, Yasunori
    JOURNAL OF MULTIVARIATE ANALYSIS, 2020, 175
  • [8] Variable Selection in PLS Discriminant Analysis via the Disco
    Simonetti, Biagio
    Lucadamo, Antonio
    Rodriguez, Maria R. G.
    CURRENT ANALYTICAL CHEMISTRY, 2012, 8 (02) : 266 - 272
  • [9] Variable selection in model-based discriminant analysis
    Maugis, C.
    Celeux, G.
    Martin-Magniette, M-L
    JOURNAL OF MULTIVARIATE ANALYSIS, 2011, 102 (10) : 1374 - 1387
  • [10] DALASS: Variable selection in discriminant analysis via the LASSO
    Trendafilov, Nickolay T.
    Jolliffe, Ian T.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 51 (08) : 3718 - 3736