Achieving Small Test Error in Mildly Overparameterized Neural Networks

04/24/2021
by   Shiyu Liang, et al.
0

Recent theoretical works on over-parameterized neural nets have focused on two aspects: optimization and generalization. Many existing works that study optimization and generalization together are based on neural tangent kernel and require a very large width. In this work, we are interested in the following question: for a binary classification problem with two-layer mildly over-parameterized ReLU network, can we find a point with small test error in polynomial time? We first show that the landscape of loss functions with explicit regularization has the following property: all local minima and certain other points which are only stationary in certain directions achieve small test error. We then prove that for convolutional neural nets, there is an algorithm which finds one of these points in polynomial time (in the input dimension and the number of data points). In addition, we prove that for a fully connected neural net, with an additional assumption on the data distribution, there is a polynomial time algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/27/2018

Complexity of Training ReLU Neural Network

In this paper, we explore some basic questions on the complexity of trai...
research
11/28/2021

Generalization Performance of Empirical Risk Minimization on Over-parameterized Deep ReLU Nets

In this paper, we study the generalization performance of global minima ...
research
06/10/2020

All Local Minima are Global for Two-Layer ReLU Neural Networks: The Hidden Convex Optimization Landscape

We are interested in two-layer ReLU neural networks from an optimization...
research
08/19/2017

Unbounded product-form Petri nets

Computing steady-state distributions in infinite-state stochastic system...
research
05/18/2021

The Computational Complexity of ReLU Network Training Parameterized by Data Dimensionality

Understanding the computational complexity of training simple neural net...
research
11/29/2019

Tropical Polynomial Division and Neural Networks

In this work, we examine the process of Tropical Polynomial Division, a ...

Please sign up or login with your details

Forgot password? Click here to reset