Learning Mixtures of Gaussians Using the DDPM Objective

07/03/2023
by   Kulin Shah, et al.
0

Recent works have shown that diffusion models can learn essentially any distribution provided one can perform score estimation. Yet it remains poorly understood under what settings score estimation is possible, let alone when practical gradient-based algorithms for this task can provably succeed. In this work, we give the first provably efficient results along these lines for one of the most fundamental distribution families, Gaussian mixture models. We prove that gradient descent on the denoising diffusion probabilistic model (DDPM) objective can efficiently recover the ground truth parameters of the mixture model in the following two settings: 1) We show gradient descent with random initialization learns mixtures of two spherical Gaussians in d dimensions with 1/poly(d)-separated centers. 2) We show gradient descent with a warm start learns mixtures of K spherical Gaussians with Ω(√(log(min(K,d))))-separated centers. A key ingredient in our proofs is a new connection between score-based methods and two other approaches to distribution learning, the EM algorithm and spectral methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/01/2023

The Parametric Stability of Well-separated Spherical Gaussian Mixtures

We quantify the parameter stability of a spherical Gaussian Mixture Mode...
research
04/13/2020

Learning Mixtures of Spherical Gaussians via Fourier Analysis

Suppose that we are given independent, identically distributed samples x...
research
10/31/2017

On Learning Mixtures of Well-Separated Gaussians

We consider the problem of efficiently learning mixtures of a large numb...
research
11/20/2017

Mixture Models, Robustness, and Sum of Squares Proofs

We use the Sum of Squares method to develop new efficient algorithms for...
research
10/05/2022

A Fourier Approach to Mixture Learning

We revisit the problem of learning mixtures of spherical Gaussians. Give...
research
08/26/2023

Large-scale gradient-based training of Mixtures of Factor Analyzers

Gaussian Mixture Models (GMMs) are a standard tool in data analysis. How...
research
09/24/2020

A Rigorous Link Between Self-Organizing Maps and Gaussian Mixture Models

This work presents a mathematical treatment of the relation between Self...

Please sign up or login with your details

Forgot password? Click here to reset