Algorithms and SQ Lower Bounds for PAC Learning One-Hidden-Layer ReLU Networks

06/22/2020
by   Ilias Diakonikolas, et al.
0

We study the problem of PAC learning one-hidden-layer ReLU networks with k hidden units on ℝ^d under Gaussian marginals in the presence of additive label noise. For the case of positive coefficients, we give the first polynomial-time algorithm for this learning problem for k up to Õ(√(log d)). Previously, no polynomial time algorithm was known, even for k=3. This answers an open question posed by <cit.>. Importantly, our algorithm does not require any assumptions about the rank of the weight matrix and its complexity is independent of its condition number. On the negative side, for the more general task of PAC learning one-hidden-layer ReLU networks with arbitrary real coefficients, we prove a Statistical Query lower bound of d^Ω(k). Thus, we provide a separation between the two classes in terms of efficient learnability. Our upper and lower bounds are general, extending to broader families of activation functions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/10/2022

Hardness of Noise-Free Learning for Two-Hidden-Layer Neural Networks

We give superpolynomial statistical query (SQ) lower bounds for learning...
research
05/31/2022

Learning (Very) Simple Generative Models Is Hard

Motivated by the recent empirical successes of deep generative models, w...
research
11/05/2018

Learning Two Layer Rectified Neural Networks in Polynomial Time

Consider the following fundamental learning problem: given input example...
research
06/29/2020

Statistical-Query Lower Bounds via Functional Gradients

We give the first statistical-query lower bounds for agnostically learni...
research
07/24/2023

Efficiently Learning One-Hidden-Layer ReLU Networks via Schur Polynomials

We study the problem of PAC learning a linear combination of k ReLU acti...
research
10/18/2022

SQ Lower Bounds for Learning Single Neurons with Massart Noise

We study the problem of PAC learning a single neuron in the presence of ...
research
09/18/2017

Learning Depth-Three Neural Networks in Polynomial Time

We give a polynomial-time algorithm for learning neural networks with on...

Please sign up or login with your details

Forgot password? Click here to reset