Risk Bounds for the Majority Vote: From a PAC-Bayesian Analysis to a Learning Algorithm

03/28/2015
by   Pascal Germain, et al.
0

We propose an extensive analysis of the behavior of majority votes in binary classification. In particular, we introduce a risk bound for majority votes, called the C-bound, that takes into account the average quality of the voters and their average disagreement. We also propose an extensive PAC-Bayesian analysis that shows how the C-bound can be estimated from various observations contained in the training data. The analysis intends to be self-contained and can be used as introductory material to PAC-Bayesian statistical learning theory. It starts from a general PAC-Bayesian perspective and ends with uncommon PAC-Bayesian bounds. Some of these bounds contain no Kullback-Leibler divergence and others allow kernel functions to be used as voters (via the sample compression setting). Finally, out of the analysis, we propose the MinCq learning algorithm that basically minimizes the C-bound. MinCq reduces to a simple quadratic program. Aside from being theoretically grounded, MinCq achieves state-of-the-art performance, as shown in our extensive empirical comparison with both AdaBoost and the Support Vector Machine.

READ FULL TEXT
research
10/23/2016

Simpler PAC-Bayesian Bounds for Hostile Data

PAC-Bayesian learning bounds are of the utmost interest to the learning ...
research
12/11/2007

PAC-Bayesian Bounds for Randomized Empirical Risk Minimizers

The aim of this paper is to generalize the PAC-Bayesian theorems proved ...
research
04/28/2021

Self-Bounding Majority Vote Learning Algorithms by the Direct Minimization of a Tight PAC-Bayesian C-Bound

In the PAC-Bayesian literature, the C-Bound refers to an insightful rela...
research
05/31/2019

PAC-Bayes Un-Expected Bernstein Inequality

We present a new PAC-Bayesian generalization bound. Standard bounds cont...
research
02/19/2021

A PAC-Bayes Analysis of Adversarial Robustness

We propose the first general PAC-Bayesian generalization bounds for adve...
research
08/06/2014

On the Generalization of the C-Bound to Structured Output Ensemble Methods

This paper generalizes an important result from the PAC-Bayesian literat...
research
03/29/2023

PAC-Bayesian bounds for learning LTI-ss systems with input from empirical loss

In this paper we derive a Probably Approxilmately Correct(PAC)-Bayesian ...

Please sign up or login with your details

Forgot password? Click here to reset