On the Importance of Gradient Norm in PAC-Bayesian Bounds

10/12/2022
by   Itai Gat, et al.
0

Generalization bounds which assess the difference between the true risk and the empirical risk, have been studied extensively. However, to obtain bounds, current techniques use strict assumptions such as a uniformly bounded or a Lipschitz loss function. To avoid these assumptions, in this paper, we follow an alternative approach: we relax uniform bounds assumptions by using on-average bounded loss and on-average bounded gradient norm assumptions. Following this relaxation, we propose a new generalization bound that exploits the contractivity of the log-Sobolev inequalities. These inequalities add an additional loss-gradient norm term to the generalization bound, which is intuitively a surrogate of the model complexity. We apply the proposed bound on Bayesian deep nets and empirically analyze the effect of this new loss-gradient norm term on different neural architectures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/23/2020

On the generalization of bayesian deep nets for multi-class classification

Generalization bounds which assess the difference between the true risk ...
research
10/16/2020

Failures of model-dependent generalization bounds for least-norm interpolation

We consider bounds on the generalization performance of the least-norm l...
research
10/07/2015

Efficient Per-Example Gradient Computations

This technical report describes an efficient technique for computing the...
research
01/29/2020

A Class of Lower Bounds for Bayesian Risk with a Bregman Loss

A general class of Bayesian lower bounds when the underlying loss functi...
research
02/12/2018

Dimension-free PAC-Bayesian bounds for the estimation of the mean of a random vector

In this paper, we present a new estimator of the mean of a random vector...
research
05/27/2022

Generalization Bounds for Gradient Methods via Discrete and Continuous Prior

Proving algorithm-dependent generalization error bounds for gradient-typ...
research
01/07/2022

The Green's function of the Lax-Wendroff and Beam-Warming schemes

We prove a sharp uniform generalized Gaussian bound for the Green's func...

Please sign up or login with your details

Forgot password? Click here to reset