Estimating Distributions with Low-dimensional Structures Using Mixtures of Generative Models

01/02/2023
by   Rong Tang, et al.
0

There has been a growing interest in statistical inference from data satisfying the so-called manifold hypothesis, assuming data points in the high-dimensional ambient space to lie in close vicinity of a submanifold of much lower dimension. In machine learning, encoder-decoder pair based generative modelling approaches have been successful in learning complicated high-dimensional distributions such as those over images and texts by explicitly imposing the low-dimensional manifold structure. In this work, we introduce a new approach for estimating distributions on unknown submanifolds via mixtures of generative models. We show that conventional generative modeling approaches using a single encoder-decoder pair are generally unable to capture data distributions under the manifold hypothesis, unless the underlying manifold admits a global parametrization; however, this issue can be solved by using a collection of encoder-decoder pairs for learning different local patches of the data supporting manifold. A rigorous theoretical analysis is developed to demonstrate that the proposed estimator attains the minimax-optimal rate of convergence for the implicit estimation of data distributions with manifold structures. Our experiments show that, by utilizing parameter sharing, the proposed method can significantly improve the performance of conventional auto-encoder based generative modelling approaches with minimal additional computational efforts.

READ FULL TEXT

page 14

page 15

research
02/18/2022

Minimax Rate of Distribution Estimation on Unknown Submanifold under Adversarial Losses

Statistical inference from high-dimensional data with low-dimensional st...
research
02/25/2023

On Deep Generative Models for Approximation and Estimation of Distributions on Manifolds

Generative networks have experienced great empirical successes in distri...
research
06/25/2015

Diffusion Nets

Non-linear manifold learning enables high-dimensional data analysis, but...
research
07/08/2021

Manifold Hypothesis in Data Analysis: Double Geometrically-Probabilistic Approach to Manifold Dimension Estimation

Manifold hypothesis states that data points in high-dimensional space ac...
research
07/06/2022

The Union of Manifolds Hypothesis and its Implications for Deep Generative Modelling

Deep learning has had tremendous success at learning low-dimensional rep...
research
02/09/2019

Distance metric learning based on structural neighborhoods for dimensionality reduction and classification performance improvement

Distance metric learning can be viewed as one of the fundamental interes...
research
12/03/2019

Structure Learning with Similarity Preserving

Leveraging on the underlying low-dimensional structure of data, low-rank...

Please sign up or login with your details

Forgot password? Click here to reset