Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions

06/07/2021
by   Bruno Loureiro, et al.
14

Generalised linear models for multi-class classification problems are one of the fundamental building blocks of modern machine learning tasks. In this manuscript, we characterise the learning of a mixture of K Gaussians with generic means and covariances via empirical risk minimisation (ERM) with any convex loss and regularisation. In particular, we prove exact asymptotics characterising the ERM estimator in high-dimensions, extending several previous results about Gaussian mixture classification in the literature. We exemplify our result in two tasks of interest in statistical learning: a) classification for a mixture with sparse means, where we study the efficiency of ℓ_1 penalty with respect to ℓ_2; b) max-margin multi-class classification, where we characterise the phase transition on the existence of the multi-class logistic maximum likelihood estimator for K>2. Finally, we discuss how our theory can be applied beyond the scope of synthetic data, showing that in different cases Gaussian mixtures capture closely the learning curve of classification tasks in real data sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/17/2023

Universality laws for Gaussian mixtures in generalized linear models

Let (x_i, y_i)_i=1,…,n denote independent samples from a general mixture...
research
04/28/2021

Risk Bounds for Over-parameterized Maximum Margin Classification on Sub-Gaussian Mixtures

Modern machine learning systems such as deep neural networks are often h...
research
09/15/2022

Upper bounds on the Natarajan dimensions of some function classes

The Natarajan dimension is a fundamental tool for characterizing multi-c...
research
08/19/2020

Self-regularizing Property of Nonparametric Maximum Likelihood Estimator in Mixture Models

Introduced by Kiefer and Wolfowitz <cit.>, the nonparametric maximum lik...
research
12/26/2014

Exploring Sparsity in Multi-class Linear Discriminant Analysis

Recent studies in the literature have paid much attention to the sparsit...
research
06/07/2018

Scalable Multi-Class Bayesian Support Vector Machines for Structured and Unstructured Data

We introduce a new Bayesian multi-class support vector machine by formul...
research
04/06/2023

Classification of Superstatistical Features in High Dimensions

We characterise the learning of a mixture of two clouds of data points w...

Please sign up or login with your details

Forgot password? Click here to reset