Generalization for multiclass classification with overparameterized linear models

06/03/2022
by   Vignesh Subramanian, et al.
0

Via an overparameterized linear model with Gaussian features, we provide conditions for good generalization for multiclass classification of minimum-norm interpolating solutions in an asymptotic setting where both the number of underlying features and the number of classes scale with the number of training points. The survival/contamination analysis framework for understanding the behavior of overparameterized learning problems is adapted to this setting, revealing that multiclass classification qualitatively behaves like binary classification in that, as long as there are not too many classes (made precise in the paper), it is possible to generalize well even in some settings where the corresponding regression tasks would not generalize. Besides various technical challenges, it turns out that the key difference from the binary classification setting is that there are relatively fewer positive training examples of each class in the multiclass setting as the number of classes increases, making the multiclass problem "harder" than the binary one.

READ FULL TEXT

page 7

page 11

research
11/16/2020

Theoretical Insights Into Multiclass Classification: A High-dimensional Asymptotic View

Contemporary machine learning applications often involve classification ...
research
09/06/2012

Multiclass Learning with Simplex Coding

In this paper we discuss a novel framework for multiclass learning, defi...
research
06/23/2023

Precise Asymptotic Generalization for Multiclass Classification with Overparameterized Linear Models

We study the asymptotic generalization of an overparameterized linear mo...
research
10/04/2018

Improved generalization bounds for robust learning

We consider a model of robust learning in an adversarial environment. Th...
research
02/18/2023

The Generalization Error of Stochastic Mirror Descent on Over-Parametrized Linear Models

Despite being highly over-parametrized, and having the ability to fully ...
research
06/15/2016

Logarithmic Time One-Against-Some

We create a new online reduction of multiclass classification to binary ...
research
06/20/2023

A Model-free Closeness-of-influence Test for Features in Supervised Learning

Understanding the effect of a feature vector x ∈ℝ^d on the response valu...

Please sign up or login with your details

Forgot password? Click here to reset