Prediction, Learning, Uniform Convergence, and Scale-Sensitive Dimensions

作者:

Highlights:

摘要

We present a new general-purpose algorithm for learning classes of [0, 1]-valued functions in a generalization of the prediction model and prove a general upper bound on the expected absolute error of this algorithm in terms of a scale-sensitive generalization of the Vapnik dimension proposed by Alon, Ben-David, Cesa-Bianchi, and Haussler. We give lower bounds implying that our upper bounds cannot be improved by more than a constant factor in general. We apply this result, together with techniques due to Haussler and to Benedek and Itai, to obtain new upper bounds on packing numbers in terms of this scale-sensitive notion of dimension. Using a different technique, we obtain new bounds on packing numbers in terms of Kearns and Schapire's fat-shattering function. We show how to apply both packing bounds to obtain improved general bounds on the sample complexity of agnostic learning. For eachε>0, we establish weaker sufficient and stronger necessary conditions for a class of [0, 1]-valued functions to be agnostically learnable to withinεand to be anε-uniform Glivenko–Cantelli class.

论文关键词:

论文评审过程:Received 2 November 1995, Revised 24 March 1997, Available online 25 May 2002.

论文官网地址:https://doi.org/10.1006/jcss.1997.1557