Optimal Rate of Convergence for Deep Neural Network Classifiers under the Teacher-Student Setting

01/19/2020
by   Tianyang Hu, et al.
0

Classifiers built with neural networks handle large-scale high-dimensional data, such as facial images from computer vision, extremely well while traditional statistical methods often fail miserably. In this paper, we attempt to understand this empirical success in high dimensional classification by deriving the convergence rates of excess risk. In particular, a teacher-student framework is proposed that assumes the Bayes classifier to be expressed as ReLU neural networks. In this setup, we obtain a dimension-independent and un-improvable rate of convergence, i.e., O(n^-2/3), for classifiers trained based on either 0-1 loss or hinge loss. This rate can be further improved to O(n^-1) when data is separable. Here, n represents the sample size.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/04/2023

Minimax optimal high-dimensional classification using deep neural networks

High-dimensional classification is a fundamentally important research pr...
research
03/03/2020

On the rate of convergence of image classifiers based on convolutional neural networks

Image classifiers based on convolutional neural networks are defined, an...
research
05/30/2022

Excess Risk of Two-Layer ReLU Neural Networks in Teacher-Student Settings and its Superiority to Kernel Methods

While deep learning has outperformed other methods for various tasks, th...
research
07/04/2022

Minimax Optimal Deep Neural Network Classifiers Under Smooth Decision Boundary

Deep learning has gained huge empirical successes in large-scale classif...
research
03/22/2022

Learning curves for the multi-class teacher-student perceptron

One of the most classical results in high-dimensional learning theory pr...
research
09/28/2018

A theoretical framework for deep locally connected ReLU network

Understanding theoretical properties of deep and locally connected nonli...
research
07/31/2023

Classification with Deep Neural Networks and Logistic Loss

Deep neural networks (DNNs) trained with the logistic loss (i.e., the cr...

Please sign up or login with your details

Forgot password? Click here to reset