On Random Subset Generalization Error Bounds and the Stochastic Gradient Langevin Dynamics Algorithm

10/21/2020
by   Borja Rodríguez Gálvez, et al.
8

In this work, we unify several expected generalization error bounds based on random subsets using the framework developed by Hellström and Durisi [1]. First, we recover the bounds based on the individual sample mutual information from Bu et al. [2] and on a random subset of the dataset from Negrea et al. [3]. Then, we introduce their new, analogous bounds in the randomized subsample setting from Steinke and Zakynthinou [4], and we identify some limitations of the framework. Finally, we extend the bounds from Haghifam et al. [5] for Langevin dynamics to stochastic gradient Langevin dynamics and we refine them for loss functions with potentially large gradient norms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/27/2020

Sharpened Generalization Bounds based on Conditional Mutual Information and an Application to Noisy, Iterative Algorithms

The information-theoretic framework of Russo and J. Zou (2016) and Xu an...
research
01/09/2022

Stability Based Generalization Bounds for Exponential Family Langevin Dynamics

We study generalization bounds for noisy stochastic mini-batch iterative...
research
01/22/2021

Tighter expected generalization error bounds via Wasserstein distance

In this work, we introduce several expected generalization error bounds ...
research
02/02/2019

On Generalization Error Bounds of Noisy Gradient Methods for Non-Convex Learning

Generalization error (also known as the out-of-sample error) measures ho...
research
04/30/2019

Hitting Time of Stochastic Gradient Langevin Dynamics to Stationary Points: A Direct Analysis

Stochastic gradient Langevin dynamics (SGLD) is a fundamental algorithm ...
research
09/22/2022

Evaluating undercounts in epidemics: response to Maruotti et al. 2022

Maruotti et al. 2022 used a mark-recapture approach to estimate bounds o...
research
05/16/2020

Generalization Bounds via Information Density and Conditional Information Density

We present a general approach, based on an exponential inequality, to de...

Please sign up or login with your details

Forgot password? Click here to reset