Learning Two layer Networks with Multinomial Activation and High Thresholds

03/21/2019
by   Surbhi Goel, et al.
0

Giving provable guarantees for learning neural networks is a core challenge of machine learning theory. Most prior work gives parameter recovery guarantees for one hidden layer networks. In this work we study a two layer network where the top node instead of a sum (one layer) is a well-behaved multivariate polynomial in all its inputs. We show that if the thresholds (biases) of the first layer neurons are higher than Ω(√( d)) for d being the input dimension, then the weights are learnable under the gaussian input. Furthermore even for lower thresholds, we can learn the lowest layer using polynomial sample complexity although exponential time. As an application of our results, we give a polynomial time algorithm for learning an intersection of halfspaces that are Ω(√( d)) far from the origin for gaussian input distribution. Finally for deep networks with depth larger than two, assuming the layers two onwards can be expressed as a polynomial by simply using the taylor series, we can learn the lowest layer under the conditions required by our assumptions.

READ FULL TEXT
research
09/18/2017

Learning Depth-Three Neural Networks in Polynomial Time

We give a polynomial-time algorithm for learning neural networks with on...
research
04/30/2020

Binary autoencoder with random binary weights

Here is presented an analysis of an autoencoder with binary activations ...
research
04/08/2022

Learning Polynomial Transformations

We consider the problem of learning high dimensional polynomial transfor...
research
11/05/2018

Learning Two Layer Rectified Neural Networks in Polynomial Time

Consider the following fundamental learning problem: given input example...
research
04/20/2023

Learning Narrow One-Hidden-Layer ReLU Networks

We consider the well-studied problem of learning a linear combination of...
research
11/22/2019

Neural Networks Learning and Memorization with (almost) no Over-Parameterization

Many results in recent years established polynomial time learnability of...
research
06/02/2017

Learning causal Bayes networks using interventional path queries in polynomial time and sample complexity

Causal discovery from empirical data is a fundamental problem in many sc...

Please sign up or login with your details

Forgot password? Click here to reset