Some statistical bounds for the accuracy of distance-based pattern classification

作者:

Highlights:

摘要

Before implementing a pattern recognition algorithm, a rational step is to estimate its validity by bounding the probability of error. The ability to make such an estimate impacts crucially on the satisfactoriness of the particular features used, on the number of samples required to train and test the system and on the overall paradigm. This study develops statistical upper and lower bounds for estimating the probability of error, in the one-dimensional case. The bounds are distribution-free except for requiring the existence of the relevant statistics and can be evaluated easily by hand or by computer. Many of the results are also applicable to other problems involving the estimation of an arbitrary distribution of a random variable. Some multidimensional generalizations may be feasible.

论文关键词:Classification error probability,Distribution-free inference,Distribution bounds,Confidence limits,Moments

论文评审过程:Received 31 May 1978, Revised 30 August 1978, Available online 19 May 2003.

论文官网地址:https://doi.org/10.1016/0031-3203(79)90055-4