Sign consistency for the linear programming discriminant rule
作者:
Highlights:
•
摘要
Linear discriminant analysis (LDA) is an important conventional model for data classification. Classical theory shows that LDA is Bayes consistent for a fixed data dimensionality p and a large training sample size n. However, in high-dimensional settings when p ≫ n, LDA is difficult due to the inconsistent estimation of the covariance matrix and the mean vectors of populations. Recently, a linear programming discriminant (LPD) rule was proposed for high-dimensional linear discriminant analysis, based on the sparsity assumption over the discriminant function. It is shown that the LPD rule is Bayes consistent in high-dimensional settings. In this paper, we further show that the LPD rule is sign consistent under the sparsity assumption. Such sign consistency ensures the LPD rule to select the optimal discriminative features for high-dimensional data classification problems. Evaluations on both synthetic and real data validate our result on the sign consistency of the LPD rule.
论文关键词:High-dimensional linear discriminant analysis,Sign consistency,Irrepresentability condition,Linear programming
论文评审过程:Received 1 November 2018, Revised 30 September 2019, Accepted 15 October 2019, Available online 5 November 2019, Version of Record 15 November 2019.
论文官网地址:https://doi.org/10.1016/j.patcog.2019.107083