Your diffusion model secretly knows the dimension of the data manifold

12/23/2022
by   Georgios Batzolis, et al.
0

In this work, we propose a novel framework for estimating the dimension of the data manifold using a trained diffusion model. A diffusion model approximates the score function i.e. the gradient of the log density of a noise-corrupted version of the target distribution for varying levels of corruption. If the data concentrates around a manifold embedded in the high-dimensional ambient space, then as the level of corruption decreases, the score function points towards the manifold, as this direction becomes the direction of maximal likelihood increase. Therefore, for small levels of corruption, the diffusion model provides us with access to an approximation of the normal bundle of the data manifold. This allows us to estimate the dimension of the tangent space, thus, the intrinsic dimension of the data manifold. To the best of our knowledge, our method is the first deep-learning based estimator of the data manifold dimension and it outperforms well established statistical estimators in controlled experiments on both Euclidean and image data.

READ FULL TEXT

page 4

page 5

page 6

page 7

research
05/25/2021

Density estimation on low-dimensional manifolds: an inflation-deflation approach

Normalizing Flows (NFs) are universal density estimators based on Neuron...
research
07/16/2003

Manifold Learning with Geodesic Minimal Spanning Trees

In the manifold learning problem one seeks to discover a smooth low dime...
research
02/11/2021

Quadric hypersurface intersection for manifold learning in feature space

The knowledge that data lies close to a particular submanifold of the am...
research
02/01/2023

Local transfer learning from one data space to another

A fundamental problem in manifold learning is to approximate a functiona...
research
11/02/2017

Approximation of Functions over Manifolds: A Moving Least-Squares Approach

We present an algorithm for approximating a function defined over a d-di...
research
02/25/2021

Diffusion Earth Mover's Distance and Distribution Embeddings

We propose a new fast method of measuring distances between large number...
research
03/06/2022

Diffusion Maps : Using the Semigroup Property for Parameter Tuning

Diffusion maps (DM) constitute a classic dimension reduction technique, ...

Please sign up or login with your details

Forgot password? Click here to reset