Variable selection in discriminant analysis based on the location model for mixed variables

被引:10
|
作者
Mahat, Nor Idayu [1 ]
Krzanowski, Wojtek Janusz [2 ]
Hernandez, Adolfo [3 ]
机构
[1] Univ Utara Malaysia, Fac Quantitat Sci, Sintok 06010, Kedah, Malaysia
[2] Univ Exeter, Sch Engn Comp Sci & Math, Exeter EX4 4QE, Devon, England
[3] Univ Complutense, Escuela Univ Estudios Empresariales, Madrid 28003, Spain
关键词
Brier score; Cross-validation; Discriminant analysis; Error rate; Kullback-Leibler divergence; Location model; Non-parametric smoothing procedures; Variable selection;
D O I
10.1007/s11634-007-0009-9
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Non-parametric smoothing of the location model is a potential basis for discriminating between groups of objects using mixtures of continuous and categorical variables simultaneously. However, it may lead to unreliable estimates of parameters when too many variables are involved. This paper proposes a method for performing variable selection on the basis of distance between groups as measured by smoothed Kullback-Leibler divergence. Searching strategies using forward, backward and step-wise selections are outlined, and corresponding stopping rules derived from asymptotic distributional results are proposed. Results from a Monte Carlo study demonstrate the feasibility of the method. Examples on real data show that the method is generally competitive with, and sometimes is better than, other existing classification methods.
引用
收藏
页码:105 / 122
页数:18
相关论文
共 50 条
  • [41] Model Selection With Mixed Variables on the Lasso Path
    X. Jessie Jeng
    Huimin Peng
    Wenbin Lu
    Sankhya B, 2021, 83 : 170 - 184
  • [42] Model Selection With Mixed Variables on the Lasso Path
    Jeng, X. Jessie
    Peng, Huimin
    Lu, Wenbin
    SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 2021, 83 (01): : 170 - 184
  • [43] SIMULTANEOUS PROCEDURES FOR VARIABLE SELECTION IN MULTIPLE DISCRIMINANT-ANALYSIS
    MCKAY, RJ
    BIOMETRIKA, 1977, 64 (02) : 283 - 290
  • [44] A variable selection technique in discriminant analysis with application in marketing data
    Gupta, AK
    Logan, TP
    Chen, J
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1999, 63 (02) : 187 - 199
  • [45] Stopping rule for variable selection using stepwise discriminant analysis
    C. S. Munita
    L. P. Barroso
    P. M. S. Oliveira
    Journal of Radioanalytical and Nuclear Chemistry, 2006, 269 : 335 - 338
  • [46] Ant colony optimization for variable selection in discriminant linear analysis
    Pontes, Aline S.
    Araujo, Alisson
    Marinho, Weverton
    Goncalves Dias Diniz, Paulo H.
    Gomes, Adriano de Araujo
    Goicoechea, Hector C.
    Silva, Edvan C.
    Araujo, Mario C. U.
    JOURNAL OF CHEMOMETRICS, 2020, 34 (12)
  • [47] Variable selection in discriminant analysis: measuring the influence of individual cases
    Steel, SJ
    Louw, N
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2001, 37 (02) : 249 - 260
  • [48] Stopping rule for variable selection using stepwise discriminant analysis
    Munita, C. S.
    Barroso, L. P.
    Oliveira, P. M. S.
    JOURNAL OF RADIOANALYTICAL AND NUCLEAR CHEMISTRY, 2006, 269 (02) : 335 - 338
  • [49] Multivariate fault isolation via variable selection in discriminant analysis
    Kuang, Te-Hui
    Yan, Zhengbing
    Yao, Yuan
    JOURNAL OF PROCESS CONTROL, 2015, 35 : 30 - 40
  • [50] Exact and approximate algorithms for variable selection in linear discriminant analysis
    Brusco, Michael J.
    Steinley, Douglas
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (01) : 123 - 131