Fast rates for noisy interpolation require rethinking the effects of inductive bias

03/07/2022
by   Konstantin Donhauser, et al.
5

Good generalization performance on high-dimensional data crucially hinges on a simple structure of the ground truth and a corresponding strong inductive bias of the estimator. Even though this intuition is valid for regularized models, in this paper we caution against a strong inductive bias for interpolation in the presence of noise: Our results suggest that, while a stronger inductive bias encourages a simpler structure that is more aligned with the ground truth, it also increases the detrimental effect of noise. Specifically, for both linear regression and classification with a sparse ground truth, we prove that minimum ℓ_p-norm and maximum ℓ_p-margin interpolators achieve fast polynomial rates up to order 1/n for p > 1 compared to a logarithmic rate for p = 1. Finally, we provide experimental evidence that this trade-off may also play a crucial role in understanding non-linear interpolating models used in practice.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/18/2023

Strong inductive biases provably prevent harmless interpolation

Classical wisdom suggests that estimators should avoid fitting noise to ...
research
07/08/2022

A law of adversarial risk, interpolation, and label noise

In supervised learning, it has been shown that label noise in the data c...
research
06/04/2020

Inject Machine Learning into Significance Test for Misspecified Linear Models

Due to its strong interpretability, linear regression is widely used in ...
research
05/04/2021

Towards Error Measures which Influence a Learners Inductive Bias to the Ground Truth

Artificial intelligence is applied in a range of sectors, and is relied ...
research
04/03/2020

Orthogonal Inductive Matrix Completion

We propose orthogonal inductive matrix completion (OMIC), an interpretab...
research
06/10/2022

Intrinsic dimensionality and generalization properties of the ℛ-norm inductive bias

We study the structural and statistical properties of ℛ-norm minimizing ...
research
12/13/2021

Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias

Variational Autoencoders (VAEs) are one of the most commonly used genera...

Please sign up or login with your details

Forgot password? Click here to reset