Variable selection in model-based discriminant analysis

被引:26
|
作者
Maugis, C. [1 ]
Celeux, G. [2 ]
Martin-Magniette, M-L [3 ,4 ]
机构
[1] Univ Toulouse, INSA Toulouse, Inst Math Toulouse, F-31077 Toulouse 4, France
[2] Inria Saclay Ile de France, Sophia Antipolis, France
[3] UMR AgroParisTech INRA MIA 518, Paris, France
[4] ERL CNRS 8196, UEVE, URGV UMR INRA 1165, Evry, France
关键词
Discriminant; redundant or independent variables; Variable selection; Gaussian classification models; Linear regression; BIC; CLASSIFICATION;
D O I
10.1016/j.jmva.2011.05.004
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A general methodology for selecting predictors for Gaussian generative classification models is presented. The problem is regarded as a model selection problem. Three different roles for each possible predictor are considered: a variable can be a relevant classification predictor or not, and the irrelevant classification variables can be linearly dependent on a part of the relevant predictors or independent variables. This variable selection model was inspired by a previous work on variable selection in model-based clustering. A BIC-like model selection criterion is proposed. It is optimized through two embedded forward stepwise variable selection algorithms for classification and linear regression. The model identifiability and the consistency of the variable selection criterion are proved. Numerical experiments on simulated and real data sets illustrate the interest of this variable selection methodology. In particular, it is shown that this well ground variable selection model can be of great interest to improve the classification performance of the quadratic discriminant analysis in a high dimension context. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:1374 / 1387
页数:14
相关论文
共 50 条
  • [21] Variable selection in discriminant analysis based on Gram-Schmidt process
    Wang, Huiwen
    Chen, Meiling
    Saporta, Gilbert
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2011, 37 (08): : 958 - 961
  • [22] COMPUTATIONS FOR VARIABLE SELECTION IN DISCRIMINANT-ANALYSIS
    MCCABE, GP
    TECHNOMETRICS, 1975, 17 (01) : 103 - 109
  • [23] VARIABLE SELECTION IN HETEROSCEDASTIC DISCRIMINANT-ANALYSIS
    FATTI, LP
    HAWKINS, DM
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1986, 81 (394) : 494 - 500
  • [24] Variable selection in discriminant analysis in the presence of outliers
    Steel, SJ
    Louw, N
    ITI 2001: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2001, : 251 - 256
  • [25] Probabilistic model-based discriminant analysis and clustering methods in chemometrics
    Bouveyron, Charles
    JOURNAL OF CHEMOMETRICS, 2013, 27 (12) : 433 - 446
  • [26] Mixture model-based functional discriminant analysis for curve classification
    Chamroukhi, Faicel
    Glotin, Herve
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [27] Analysis of new variable selection methods for discriminant analysis
    Pacheco, Joaquin
    Casado, Silvia
    Nunez, Laura
    Gomez, Olga
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (03) : 1463 - 1478
  • [28] A simple model-based approach to variable selection in classification and clustering
    Partovi Nia, Vahid
    Davison, Anthony C.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2015, 43 (02): : 157 - 175
  • [29] Probing for Sparse and Fast Variable Selection with Model-Based Boosting
    Thomas, Janek
    Hepp, Tobias
    Mayr, Andreas
    Bischl, Bernd
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2017, 2017
  • [30] Input variable selection for model-based production control and optimisation
    Miha Glavan
    Dejan Gradišar
    Maja Atanasijević-Kunc
    Stanko Strmčnik
    Gašper Mušič
    The International Journal of Advanced Manufacturing Technology, 2013, 68 : 2743 - 2759