Supermodular f-divergences and bounds on lossy compression and generalization error with mutual f-information

06/21/2022
by   Saeed Masiha, et al.
0

In this paper, we introduce super-modular -divergences and provide three applications for them: (i) we introduce Sanov's upper bound on the tail probability of sum of independent random variables based on super-modular -divergence and show that our generalized Sanov's bound strictly improves over ordinary one, (ii) we consider the lossy compression problem which studies the set of achievable rates for a given distortion and code length. We extend the rate-distortion function using mutual -information and provide new and strictly better bounds on achievable rates in the finite blocklength regime using super-modular -divergences, and (iii) we provide a connection between the generalization error of algorithms with bounded input/output mutual -information and a generalized rate-distortion problem. This connection allows us to bound the generalization error of learning algorithms using lower bounds on the rate-distortion function. Our bound is based on a new lower bound on the rate-distortion function that (for some examples) strictly improves over previously best-known bounds. Moreover, super-modular -divergences are utilized to reduce the dimension of the problem and obtain single-letter bounds.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/10/2021

Learning under Distribution Mismatch and Model Misspecification

We study learning algorithms when there is a mismatch between the distri...
research
04/15/2019

Exact Rate-Distortion in Autoencoders via Echo Noise

Compression is at the heart of effective representation learning. Howeve...
research
03/04/2022

Rate-Distortion Theoretic Generalization Bounds for Stochastic Learning Algorithms

Understanding generalization in modern machine learning settings has bee...
research
01/09/2019

Generalized Deduplication: Bounds, Convergence, and Asymptotic Properties

We study a generalization of deduplication, which enables lossless dedup...
research
05/10/2021

Neural Computation of Capacity Region of Memoryless Multiple Access Channels

This paper provides a numerical framework for computing the achievable r...
research
03/09/2023

Data-dependent Generalization Bounds via Variable-Size Compressibility

In this paper, we establish novel data-dependent upper bounds on the gen...
research
02/10/2022

Generalization Bounds via Convex Analysis

Since the celebrated works of Russo and Zou (2016,2019) and Xu and Ragin...

Please sign up or login with your details

Forgot password? Click here to reset