Classification of Data Generated by Gaussian Mixture Models Using Deep ReLU Networks

08/15/2023
by   Tian-Yi Zhou, et al.
0

This paper studies the binary classification of unbounded data from ℝ^d generated under Gaussian Mixture Models (GMMs) using deep ReLU neural networks. We obtain x2013 for the first time x2013 non-asymptotic upper bounds and convergence rates of the excess risk (excess misclassification error) for the classification without restrictions on model parameters. The convergence rates we derive do not depend on dimension d, demonstrating that deep ReLU networks can overcome the curse of dimensionality in classification. While the majority of existing generalization analysis of classification algorithms relies on a bounded domain, we consider an unbounded domain by leveraging the analyticity and fast decay of Gaussian distributions. To facilitate our analysis, we give a novel approximation error bound for general analytic functions using ReLU networks, which may be of independent interest. Gaussian distributions can be adopted nicely to model data arising in applications, e.g., speeches, images, and texts; our results provide a theoretical verification of the observed efficiency of deep neural networks in practical classification problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/31/2023

Classification with Deep Neural Networks and Logistic Loss

Deep neural networks (DNNs) trained with the logistic loss (i.e., the cr...
research
11/13/2021

Deep Learning in High Dimension: Neural Network Approximation of Analytic Functions in L^2(ℝ^d,γ_d)

For artificial deep neural networks, we prove expression rates for analy...
research
08/02/2021

Convergence rates of deep ReLU networks for multiclass classification

For classification problems, trained deep neural networks return probabi...
research
11/10/2021

Collocation approximation by deep neural ReLU networks for parametric elliptic PDEs with lognormal inputs

We obtained convergence rates of the collocation approximation by deep R...
research
05/27/2020

Fast Risk Assessment for Autonomous Vehicles Using Learned Models of Agent Futures

This paper presents fast non-sampling based methods to assess the risk o...
research
09/21/2021

Fast nonlinear risk assessment for autonomous vehicles using learned conditional probabilistic models of agent futures

This paper presents fast non-sampling based methods to assess the risk f...
research
03/11/2021

On Finite-Sample Analysis of Offline Reinforcement Learning with Deep ReLU Networks

This paper studies the statistical theory of offline reinforcement learn...

Please sign up or login with your details

Forgot password? Click here to reset