Supervised principal component analysis: Visualization, classification and regression on subspaces and submanifolds

作者:

Highlights:

摘要

We propose “supervised principal component analysis (supervised PCA)”, a generalization of PCA that is uniquely effective for regression and classification problems with high-dimensional input data. It works by estimating a sequence of principal components that have maximal dependence on the response variable. The proposed supervised PCA is solvable in closed-form, and has a dual formulation that significantly reduces the computational complexity of problems in which the number of predictors greatly exceeds the number of observations (such as DNA microarray experiments). Furthermore, we show how the algorithm can be kernelized, which makes it applicable to non-linear dimensionality reduction tasks. Experimental results on various visualization, classification and regression problems show significant improvement over other supervised approaches both in accuracy and computational efficiency.

论文关键词:Dimensionality reduction,Principal component analysis (PCA),Kernel methods,Supervised learning,Visualization,Classification,Regression

论文评审过程:Received 20 April 2010, Revised 7 December 2010, Accepted 21 December 2010, Available online 29 December 2010.

论文官网地址:https://doi.org/10.1016/j.patcog.2010.12.015