Near-Optimal Bounds for Learning Gaussian Halfspaces with Random Classification Noise

07/13/2023
by   Ilias Diakonikolas, et al.
0

We study the problem of learning general (i.e., not necessarily homogeneous) halfspaces with Random Classification Noise under the Gaussian distribution. We establish nearly-matching algorithmic and Statistical Query (SQ) lower bound results revealing a surprising information-computation gap for this basic problem. Specifically, the sample complexity of this learning problem is Θ(d/ϵ), where d is the dimension and ϵ is the excess error. Our positive result is a computationally efficient learning algorithm with sample complexity Õ(d/ϵ + d/(max{p, ϵ})^2), where p quantifies the bias of the target halfspace. On the lower bound side, we show that any efficient SQ algorithm (or low-degree test) for the problem requires sample complexity at least Ω(d^1/2/(max{p, ϵ})^2). Our lower bound suggests that this quadratic dependence on 1/ϵ is inherent for efficient algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/28/2023

Information-Computation Tradeoffs for Learning Margin Halfspaces with Random Classification Noise

We study the problem of PAC learning γ-margin halfspaces with Random Cla...
research
08/19/2021

Threshold Phenomena in Learning Halfspaces with Massart Noise

We study the problem of PAC learning halfspaces on ℝ^d with Massart nois...
research
03/28/2022

A super-polynomial lower bound for learning nonparametric mixtures

We study the problem of learning nonparametric distributions in a finite...
research
07/22/2021

Learning Sparse Fixed-Structure Gaussian Bayesian Networks

Gaussian Bayesian networks (a.k.a. linear Gaussian structural equation m...
research
05/24/2018

Learning convex polytopes with margin

We present a near-optimal algorithm for properly learning convex polytop...
research
10/27/2021

Provable Lifelong Learning of Representations

In lifelong learning, the tasks (or classes) to be learned arrive sequen...
research
05/30/2022

Data Banzhaf: A Data Valuation Framework with Maximal Robustness to Learning Stochasticity

This paper studies the robustness of data valuation to noisy model perfo...

Please sign up or login with your details

Forgot password? Click here to reset