Exact and approximate algorithms for variable selection in linear discriminant analysis

被引:13
|
作者
Brusco, Michael J. [1 ]
Steinley, Douglas [2 ]
机构
[1] Florida State Univ, Coll Business, Dept Mkt, Tallahassee, FL 32306 USA
[2] Univ Missouri Columbia, Columbia, MO USA
关键词
Linear discriminant analysis; Variable selection; Branch and bound; Tabu search; WELL-FORMULATED SUBSETS; POLYNOMIAL REGRESSION; MULTIPLE MEASUREMENTS; MULTIVARIATE-ANALYSIS; TABU SEARCH; MODELS; STEPWISE;
D O I
10.1016/j.csda.2010.05.027
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Variable selection is a venerable problem in multivariate statistics. In the context of discriminant analysis, the goal is to select a subset of variables that accomplishes one of two objectives: (1) the provision of a parsimonious, yet descriptive, representation of group structure, or (2) the ability to correctly allocate new cases to groups. We present an exact (branch-and-bound) algorithm for variable selection in linear discriminant analysis that identifies subsets of variables that minimize Wilks' A. An important feature of this algorithm is a variable reordering scheme that greatly reduces computation time. We also present an approximate procedure based on tabu search, which can be implemented for a variety of objective criteria designed for either the descriptive or allocation goals associated with discriminant analysis. The tabu search heuristic is especially useful for maximizing the hit ratio (i.e., the percentage of correctly classified cases). Computational results for the proposed methods are provided for two data sets from the literature. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:123 / 131
页数:9
相关论文
共 50 条
  • [31] Variable selection and error rate estimation in discriminant analysis
    Le Roux, NJ
    Steel, SJ
    Louw, N
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1997, 59 (03) : 195 - 219
  • [32] Input variable selection in kernel Fisher discriminant analysis
    Louw, N
    Steel, SJ
    FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 126 - +
  • [33] Convergence Analysis on Trace Ratio Linear Discriminant Analysis Algorithms
    Ye, Qiaolin
    Yang, Jie
    Zheng, Hao
    Fu, Liyong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 3878 - 3881
  • [34] Online local learning algorithms for linear discriminant analysis
    Demir, GK
    Omehmet, K
    PATTERN RECOGNITION LETTERS, 2005, 26 (04) : 421 - 431
  • [35] The variable selection methods and algorithms in the multiple linear model
    Wei, Gongding
    Yu, Mingyuan
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2023, 52 (17) : 6232 - 6240
  • [36] Variable selection for linear regression in large databases: exact methods
    Pacheco, Joaquin
    Casado, Silvia
    APPLIED INTELLIGENCE, 2021, 51 (06) : 3736 - 3756
  • [37] Variable selection for linear regression in large databases: exact methods
    Joaquín Pacheco
    Silvia Casado
    Applied Intelligence, 2021, 51 : 3736 - 3756
  • [38] Approximate and Exact Selection on GPUs
    Ribizel, Tobias
    Anzt, Hartwig
    2019 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2019, : 471 - 478
  • [39] Genetic algorithms combined with discriminant analysis for key variable identification
    Chiang, LH
    Pell, RJ
    JOURNAL OF PROCESS CONTROL, 2004, 14 (02) : 143 - 155
  • [40] An adapted linear discriminant analysis with variable selection for the classification in high-dimension, and an application to medical data
    Le, Khuyen T.
    Chaux, Caroline
    Richard, Frederic J. P.
    Guedj, Eric
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 152 (152)