Penalized classification using Fisher's linear discriminant

被引:264
|
作者
Witten, Daniela M. [1 ]
Tibshirani, Robert [2 ]
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[2] Stanford Univ, Stanford, CA 94305 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Classification; Feature selection; High dimensional problems; Lasso; Linear discriminant analysis; Supervised learning; MULTICLASS CANCER-DIAGNOSIS; SHRUNKEN CENTROIDS; VARIABLE SELECTION; OPTIMIZATION; PREDICTION; REGRESSION;
D O I
10.1111/j.1467-9868.2011.00783.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the supervised classification setting, in which the data consist of p features measured on n observations, each of which belongs to one of K classes. Linear discriminant analysis (LDA) is a classical method for this problem. However, in the high dimensional setting where p >> n, LDA is not appropriate for two reasons. First, the standard estimate for the within-class covariance matrix is singular, and so the usual discriminant rule cannot be applied. Second, when p is large, it is difficult to interpret the classification rule that is obtained from LDA, since it involves all p features. We propose penalized LDA, which is a general approach for penalizing the discriminant vectors in Fisher's discriminant problem in a way that leads to greater interpretability. The discriminant problem is not convex, so we use a minorization-maximization approach to optimize it efficiently when convex penalties are applied to the discriminant vectors. In particular, we consider the use of L-1 and fused lasso penalties. Our proposal is equivalent to recasting Fisher's discriminant problem as a biconvex problem. We evaluate the performances of the resulting methods on a simulation study, and on three gene expression data sets. We also survey past methods for extending LDA to the high dimensional setting and explore their relationships with our proposal.
引用
收藏
页码:753 / 772
页数:20
相关论文
共 50 条
  • [21] Infrared Point Target Detection with Fisher Linear Discriminant and Kernel Fisher Linear Discriminant
    Ruiming Liu
    Hongliang Zhi
    Journal of Infrared, Millimeter, and Terahertz Waves, 2010, 31 : 1491 - 1502
  • [22] Detection of human physiological state change using Fisher's linear discriminant
    Yu, L
    Hoover, A
    Muth, E
    Foundations of Augmented Cognition, Vol 11, 2005, : 284 - 292
  • [23] ADHERENTLY PENALIZED LINEAR DISCRIMINANT ANALYSIS
    Hino, Hideitsu
    Fujiki, Jun
    JOURNAL JAPANESE SOCIETY OF COMPUTATIONAL STATISTICS, 2015, 28 (01): : 125 - 137
  • [24] ON EXTENSIONS TO FISHER'S LINEAR DISCRIMINANT FUNCTION.
    Longstaff, Ian D.
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 1987, PAMI-9 (02) : 321 - 325
  • [25] Fisher's linear discriminant embedded metric learning
    Guo, Yiwen
    Ding, Xiaoqing
    Fang, Chi
    Xue, Jing-Hao
    NEUROCOMPUTING, 2014, 143 : 7 - 13
  • [26] Face recognition using recursive Fisher linear discriminant
    Xiang, C.
    Fan, X. A.
    Lee, T. H.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (08) : 2097 - 2105
  • [27] Support vector machine classification for large datasets using decision tree and Fisher linear discriminant
    Lopez Chau, Asdrubal
    Li, Xiaoou
    Yu, Wen
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 36 : 57 - 65
  • [28] Face recognition using recursive Fisher Linear Discriminant
    Xiang, C
    Fan, XA
    Lee, TH
    2004 INTERNATIONAL CONFERENCE ON COMMUNICATION, CIRCUITS, AND SYSTEMS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEMS - VOL 2: SIGNAL PROCESSING, CIRCUITS AND SYSTEMS, 2004, : 800 - 804
  • [29] Gabor feature based classification using the enhanced Fisher linear discriminant model for face recognition
    Liu, CJ
    Wechsler, H
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2002, 11 (04) : 467 - 476
  • [30] Electrocorticogram Classification Based on Wavelet Variance and Fisher Linear Discriminant Analysis
    Yan, Shiyu
    Wang, Hong
    Liu, Chong
    Zhao, Haibin
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 5404 - 5408