Learning (Very) Simple Generative Models Is Hard

05/31/2022
by   Sitan Chen, et al.
0

Motivated by the recent empirical successes of deep generative models, we study the computational complexity of the following unsupervised learning problem. For an unknown neural network F:ℝ^d→ℝ^d', let D be the distribution over ℝ^d' given by pushing the standard Gaussian 𝒩(0,Id_d) through F. Given i.i.d. samples from D, the goal is to output any distribution close to D in statistical distance. We show under the statistical query (SQ) model that no polynomial-time algorithm can solve this problem even when the output coordinates of F are one-hidden-layer ReLU networks with log(d) neurons. Previously, the best lower bounds for this problem simply followed from lower bounds for supervised learning and required at least two hidden layers and poly(d) neurons [Daniely-Vardi '21, Chen-Gollakota-Klivans-Meka '22]. The key ingredient in our proof is an ODE-based construction of a compactly supported, piecewise-linear function f with polynomially-bounded slopes such that the pushforward of 𝒩(0,1) under f matches all low-degree moments of 𝒩(0,1).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/22/2020

Algorithms and SQ Lower Bounds for PAC Learning One-Hidden-Layer ReLU Networks

We study the problem of PAC learning one-hidden-layer ReLU networks with...
research
06/29/2020

Near-Optimal SQ Lower Bounds for Agnostically Learning Halfspaces and ReLUs under Gaussian Marginals

We study the fundamental problems of agnostically learning halfspaces an...
research
04/08/2022

Learning Polynomial Transformations

We consider the problem of learning high dimensional polynomial transfor...
research
06/22/2023

SQ Lower Bounds for Learning Bounded Covariance GMMs

We study the complexity of learning mixtures of separated Gaussians with...
research
10/18/2022

SQ Lower Bounds for Learning Single Neurons with Massart Noise

We study the problem of PAC learning a single neuron in the presence of ...
research
06/29/2020

Statistical-Query Lower Bounds via Functional Gradients

We give the first statistical-query lower bounds for agnostically learni...
research
07/24/2023

Efficiently Learning One-Hidden-Layer ReLU Networks via Schur Polynomials

We study the problem of PAC learning a linear combination of k ReLU acti...

Please sign up or login with your details

Forgot password? Click here to reset