Prediction, Learning, Uniform Convergence, and Scale-sensitive Dimensions

04/21/2023
by   Peter L. Bartlett, et al.
0

We present a new general-purpose algorithm for learning classes of [0,1]-valued functions in a generalization of the prediction model, and prove a general upper bound on the expected absolute error of this algorithm in terms of a scale-sensitive generalization of the Vapnik dimension proposed by Alon, Ben-David, Cesa-Bianchi and Haussler. We give lower bounds implying that our upper bounds cannot be improved by more than a constant factor in general. We apply this result, together with techniques due to Haussler and to Benedek and Itai, to obtain new upper bounds on packing numbers in terms of this scale-sensitive notion of dimension. Using a different technique, we obtain new bounds on packing numbers in terms of Kearns and Schapire's fat-shattering function. We show how to apply both packing bounds to obtain improved general bounds on the sample complexity of agnostic learning. For each ϵ > 0, we establish weaker sufficient and stronger necessary conditions for a class of [0,1]-valued functions to be agnostically learnable to within ϵ, and to be an ϵ-uniform Glivenko-Cantelli class. This is a manuscript that was accepted by JCSS, together with a correction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/13/2023

An Improved Uniform Convergence Bound with Fat-Shattering Dimension

The fat-shattering dimension characterizes the uniform convergence prope...
research
03/26/2022

Constant factor approximations for Lower and Upper bounded Clusterings

Clustering is one of the most fundamental problem in Machine Learning. R...
research
08/20/2017

A Capacity Scaling Law for Artificial Neural Networks

By assuming an ideal neural network with gating functions handling the w...
research
11/28/2017

When are epsilon-nets small?

In many interesting situations the size of epsilon-nets depends only on ...
research
12/28/2020

Minoration via Mixed Volumes and Cover's Problem for General Channels

We propose a method for establishing lower bounds on the supremum of pro...
research
05/25/2021

Prediction error quantification through probabilistic scaling – EXTENDED VERSION

In this paper, we address the probabilistic error quantification of a ge...
research
05/20/2022

Sample Complexity of Learning Heuristic Functions for Greedy-Best-First and A* Search

Greedy best-first search (GBFS) and A* search (A*) are popular algorithm...

Please sign up or login with your details

Forgot password? Click here to reset