A Mathematical Framework for Learning Probability Distributions

12/22/2022
by   Hongkang Yang, et al.
0

The modeling of probability distributions, specifically generative modeling and density estimation, has become an immensely popular subject in recent years by virtue of its outstanding performance on sophisticated data such as images and texts. Nevertheless, a theoretical understanding of its success is still incomplete. One mystery is the paradox between memorization and generalization: In theory, the model is trained to be exactly the same as the empirical distribution of the finite samples, whereas in practice, the trained model can generate new samples or estimate the likelihood of unseen samples. Likewise, the overwhelming diversity of distribution learning models calls for a unified perspective on this subject. This paper provides a mathematical framework such that all the well-known models can be derived based on simple principles. To demonstrate its efficacy, we present a survey of our results on the approximation error, training error and generalization error of these models, which can all be established based on this framework. In particular, the aforementioned paradox is resolved by proving that these models enjoy implicit regularization during training, so that the generalization error at early-stopping avoids the curse of dimensionality. Furthermore, we provide some new results on landscape analysis and the mode collapse phenomenon.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/29/2020

Generalization and Memorization: The Bias Potential Model

Models for learning probability distributions such as generative models ...
research
07/08/2021

Generalization Error of GAN from the Discriminator's Perspective

The generative adversarial network (GAN) is a well-known model for learn...
research
03/09/2021

An Introduction to Deep Generative Modeling

Deep generative models (DGM) are neural networks with many hidden layers...
research
05/26/2023

Quantum Kernel Mixtures for Probabilistic Deep Learning

This paper presents a novel approach to probabilistic deep learning (PDL...
research
12/02/2021

HyperSPNs: Compact and Expressive Probabilistic Circuits

Probabilistic circuits (PCs) are a family of generative models which all...
research
03/07/2022

Robust Modeling of Unknown Dynamical Systems via Ensemble Averaged Learning

Recent work has focused on data-driven learning of the evolution of unkn...
research
06/06/2019

A Look at the Effect of Sample Design on Generalization through the Lens of Spectral Analysis

This paper provides a general framework to study the effect of sampling ...

Please sign up or login with your details

Forgot password? Click here to reset