Fit Like You Sample: Sample-Efficient Generalized Score Matching from Fast Mixing Markov Chains

06/15/2023
by   Yilong Qin, et al.
0

Score matching is an approach to learning probability distributions parametrized up to a constant of proportionality (e.g. Energy-Based Models). The idea is to fit the score of the distribution, rather than the likelihood, thus avoiding the need to evaluate the constant of proportionality. While there's a clear algorithmic benefit, the statistical "cost” can be steep: recent work by Koehler et al. 2022 showed that for distributions that have poor isoperimetric properties (a large Poincaré or log-Sobolev constant), score matching is substantially statistically less efficient than maximum likelihood. However, many natural realistic distributions, e.g. multimodal distributions as simple as a mixture of two Gaussians in one dimension – have a poor Poincaré constant. In this paper, we show a close connection between the mixing time of an arbitrary Markov process with generator ℒ and an appropriately chosen generalized score matching loss that tries to fit 𝒪 p/p. If ℒ corresponds to a Markov process corresponding to a continuous version of simulated tempering, we show the corresponding generalized score matching loss is a Gaussian-convolution annealed score matching loss, akin to the one proposed in Song and Ermon 2019. Moreover, we show that if the distribution being learned is a finite mixture of Gaussians in d dimensions with a shared covariance, the sample complexity of annealed score matching is polynomial in the ambient dimension, the diameter the means, and the smallest and largest eigenvalues of the covariance – obviating the Poincaré constant-based lower bounds of the basic score matching loss shown in Koehler et al. 2022. This is the first result characterizing the benefits of annealing for score matching – a crucial component in more sophisticated score-based approaches like Song and Ermon 2019.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2022

Statistical Efficiency of Score Matching: The View from Isoperimetry

Deep generative models parametrized up to a normalizing constant (e.g. e...
research
06/03/2023

Provable benefits of score matching

Score matching is an alternative to maximum likelihood (ML) for estimati...
research
12/26/2018

Generalized Score Matching for Non-Negative Data

A common challenge in estimating parameters of probability density funct...
research
02/18/2018

Graphical Models for Non-Negative Data Using Generalized Score Matching

A common challenge in estimating parameters of probability density funct...
research
03/18/2022

Generalized Score Matching for Regression

Many probabilistic models that have an intractable normalizing constant ...
research
10/24/2018

Notes on asymptotics of sample eigenstructure for spiked covariance models with non-Gaussian data

These expository notes serve as a reference for an accompanying post Mor...
research
10/01/2022

Pitfalls of Gaussians as a noise distribution in NCE

Noise Contrastive Estimation (NCE) is a popular approach for learning pr...

Please sign up or login with your details

Forgot password? Click here to reset