Unlabeled sample compression schemes and corner peelings for ample and maximum classes

作者:

Highlights:

摘要

We examine connections between combinatorial notions that arise in machine learning and topological notions in cubical/simplicial geometry. These connections enable to export results from geometry to machine learning. Our first main result is based on a geometric construction by Tracy Hall (2004) [20] of a partial shelling of the cross-polytope which can not be extended. From it, we derive a maximum class of VC dimension 3 without corners. This refutes several previous works in machine learning. In particular, it implies that the previous constructions of optimal unlabeled sample compression schemes for maximum classes are erroneous. On the positive side we present a new construction of an optimal unlabeled sample compression scheme for maximum classes. We leave as open whether our unlabeled sample compression scheme extends to ample classes, which generalize maximum classes. Towards resolving this question, we provide a geometric characterization in terms of unique sink orientations of the associated 1-inclusion graph.

论文关键词:VC-dimension,Sample compression,Sauer-Shelah-Perles Lemma,Sandwich Lemma,Maximum class,Ample class,Extremal class,Corner peeling,Unique sink orientation

论文评审过程:Received 17 December 2020, Revised 17 January 2022, Accepted 23 January 2022, Available online 3 February 2022, Version of Record 17 February 2022.

论文官网地址:https://doi.org/10.1016/j.jcss.2022.01.003