Auto-Encoding Goodness of Fit

10/12/2022
by   Aaron Palmer, et al.
0

For generative autoencoders to learn a meaningful latent representation for data generation, a careful balance must be achieved between reconstruction error and how close the distribution in the latent space is to the prior. However, this balance is challenging to achieve due to a lack of criteria that work both at the mini-batch (local) and aggregated posterior (global) level. Goodness of fit (GoF) hypothesis tests provide a measure of statistical indistinguishability between the latent distribution and a target distribution class. In this work, we develop the Goodness of Fit Autoencoder (GoFAE), which incorporates hypothesis tests at two levels. At the mini-batch level, it uses GoF test statistics as regularization objectives. At a more global level, it selects a regularization coefficient based on higher criticism, i.e., a test on the uniformity of the local GoF p-values. We justify the use of GoF tests by providing a relaxed L_2-Wasserstein bound on the distance between the latent distribution and target prior. We propose to use GoF tests and prove that optimization based on these tests can be done with stochastic gradient (SGD) descent on a compact Riemannian manifold. Empirically, we show that our higher criticism parameter selection procedure balances reconstruction and generation using mutual information and uniformity of p-values respectively. Finally, we show that GoFAE achieves comparable FID scores and mean squared errors with competing deep generative models while retaining statistical indistinguishability from Gaussian in the latent space based on a variety of hypothesis tests.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2019

Adversarially Approximated Autoencoder for Image Generation and Manipulation

Regularized autoencoders learn the latent codes, a structure with the re...
research
07/20/2020

Generalizing Variational Autoencoders with Hierarchical Empirical Bayes

Variational Autoencoders (VAEs) have experienced recent success as data-...
research
09/23/2020

Generative Model without Prior Distribution Matching

Variational Autoencoder (VAE) and its variations are classic generative ...
research
06/24/2021

Symmetric Wasserstein Autoencoders

Leveraging the framework of Optimal Transport, we introduce a new family...
research
06/17/2021

Spectral goodness-of-fit tests for complete and partial network data

Networks describe the, often complex, relationships between individual a...
research
06/05/2018

On Latent Distributions Without Finite Mean in Generative Models

We investigate the properties of multidimensional probability distributi...
research
05/30/2019

One-element Batch Training by Moving Window

Several deep models, esp. the generative, compare the samples from two d...

Please sign up or login with your details

Forgot password? Click here to reset