Information theoretic limits of learning a sparse rule

06/19/2020
by   Clément Luneau, et al.
0

We consider generalized linear models in regimes where the number of nonzerocomponents of the signal and accessible data points are sublinear with respect to the size of the signal. We prove a variational formula for the asymptotic mutual information per sample when the system size grows to infinity. This result allows us to heuristically derive an expression for the minimum mean-square error (MMSE)of the Bayesian estimator. We then find that, for discrete signals and suitable vanishing scalings of the sparsity and sampling rate, the MMSE displays an all-or-nothing phenomenon, namely, the MMSE sharply jumps from its maximum value to zero at a critical sampling rate. The all-or-nothing phenomenon has recently been proved to occur in high-dimensional linear regression. Our analysis goes beyond the linear case and applies to learning the weights of a perceptron with general activation function in a teacher-student scenario. In particular we discuss an all-or-nothing phenomenon for the generalization error with a sublinear set of training examples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/27/2021

The fundamental limits of sparse linear regression with sublinear sparsity

We establish exact asymptotic expressions for the normalized mutual info...
research
05/07/2017

Performance Limits on the Classification of Kronecker-structured Models

Kronecker-structured (K-S) models recently have been proposed for the ef...
research
07/15/2019

Concentration of the matrix-valued minimum mean-square error in optimal Bayesian inference

We consider Bayesian inference of signals with vector-valued entries. Ex...
research
11/12/2019

0-1 phase transitions in sparse spiked matrix estimation

We consider statistical models of estimation of a rank-one matrix (the s...
research
01/25/2021

On maximum-likelihood estimation in the all-or-nothing regime

We study the problem of estimating a rank-1 additive deformation of a Ga...
research
10/25/2021

Maximum Correntropy Criterion Regression models with tending-to-zero scale parameters

Maximum correntropy criterion regression (MCCR) models have been well st...
research
12/04/2019

Landscape Complexity for the Empirical Risk of Generalized Linear Models

We present a method to obtain the average and the typical value of the n...

Please sign up or login with your details

Forgot password? Click here to reset