Supervised principal component analysis: Visualization, classification and regression on subspaces and submanifolds

被引:209
|
作者
Barshan, Elnaz [1 ,3 ]
Ghodsi, Ali [2 ]
Azimifar, Zohreh [1 ,3 ]
Jahromi, Mansoor Zolghadri [1 ,3 ]
机构
[1] Shiraz Univ, Dept IT & Comp Engn, Sch Elect & Comp Engn, Shiraz, Iran
[2] Univ Waterloo, Dept Stat & Actuarial Sci, Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
[3] Shiraz Univ, Dept Comp Sci & Engn, Shiraz, Iran
关键词
Dimensionality reduction; Principal component analysis (PCA); Kernel methods; Supervised learning; Visualization; Classification; Regression; SLICED INVERSE REGRESSION; DIMENSION REDUCTION; PREDICTION;
D O I
10.1016/j.patcog.2010.12.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose "supervised principal component analysis (supervised PCA)", a generalization of PCA that is uniquely effective for regression and classification problems with high-dimensional input data. It works by estimating a sequence of principal components that have maximal dependence on the response variable. The proposed supervised PCA is solvable in closed-form, and has a dual formulation that significantly reduces the computational complexity of problems in which the number of predictors greatly exceeds the number of observations (such as DNA microarray experiments). Furthermore, we show how the algorithm can be kernelized, which makes it applicable to non-linear dimensionality reduction tasks. Experimental results on various visualization, classification and regression problems show significant improvement over other supervised approaches both in accuracy and computational efficiency. (c) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1357 / 1371
页数:15
相关论文
共 50 条