Unlabeled sample compression schemes and corner peelings for ample and maximum classes

12/05/2018
by   Jérémie Chalopin, et al.
0

We examine connections between combinatorial notions that arise in machine learning and topological notions in cubical/simplicial geometry. These connections enable to export results from geometry to machine learning. Our first main result is based on a geometric construction by Tracy Hall (2004) of a partial shelling of the cross-polytope which can not be extended. We use it to derive a maximum class of VC dimension 3 that has no corners. This refutes several previous works in machine learning from the past 11 years. In particular, it implies that all previous constructions of optimal unlabeled sample compression schemes for maximum classes are erroneous. On the positive side we present a new construction of an unlabeled sample compression scheme for maximum classes. We leave as open whether our unlabeled sample compression scheme extends to ample (a.k.a. lopsided or extremal) classes, which represent a natural and far-reaching generalization of maximum classes. Towards resolving this question, we provide a geometric characterization in terms of unique sink orientations of the 1-skeletons of associated cubical complexes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/18/2009

A Geometric Approach to Sample Compression

The Sample Compression Conjecture of Littlestone & Warmuth has remained ...
research
11/29/2018

Unlabeled Compression Schemes Exceeding the VC-dimension

In this note we disprove a conjecture of Kuzmin and Warmuth claiming tha...
research
01/29/2014

Bounding Embeddings of VC Classes into Maximum Classes

One of the earliest conjectures in computational learning theory-the Sam...
research
10/11/2022

Unlabelled Sample Compression Schemes for Intersection-Closed Classes and Extremal Classes

The sample compressibility of concept classes plays an important role in...
research
07/22/2020

PhishZip: A New Compression-based Algorithm for Detecting Phishing Websites

Phishing has grown significantly in the past few years and is predicted ...
research
05/25/2012

Measurability Aspects of the Compactness Theorem for Sample Compression Schemes

It was proved in 1998 by Ben-David and Litman that a concept space has a...
research
07/20/2018

Optimal Bounds on the VC-dimension

The VC-dimension of a set system is a way to capture its complexity and ...

Please sign up or login with your details

Forgot password? Click here to reset