Cohort-based kernel visualisation with scatter matrices

作者:

Highlights:

摘要

Visualisation with good discrimination between data cohorts is important for exploratory data analysis and for decision support interfaces. This paper proposes a kernel extension of the cluster-based linear visualisation method described in Lisboa et al. [15]. A representation of the data in dual form permits the application of the kernel trick, so projecting the data onto the orthonormalised cohort means in the feature space. The only parameters of the method are those for the kernel function. The method is shown to obtain well-discriminating visualisations of non-linearly separable data with low computational cost. The linearity of the visualisation was tested using nearest neighbour and linear discriminant classifiers, achieving significant improvements in classification accuracy with respect to the original features, especially for high-dimensional data, where 93% accuracy was obtained for the Splice-junction Gene Sequences data set from the UCI repository.

论文关键词:Visualisation,Discriminant analysis,Kernel method

论文评审过程:Received 11 January 2010, Revised 15 September 2011, Accepted 21 September 2011, Available online 19 October 2011.

论文官网地址:https://doi.org/10.1016/j.patcog.2011.09.025