Maximum Likelihood Training of Autoencoders

06/02/2023
by   Peter Sorrenson, et al.
0

Maximum likelihood training has favorable statistical properties and is popular for generative modeling, especially with normalizing flows. On the other hand, generative autoencoders promise to be more efficient than normalizing flows due to the manifold hypothesis. In this work, we introduce successful maximum likelihood training of unconstrained autoencoders for the first time, bringing the two paradigms together. To do so, we identify and overcome two challenges: Firstly, existing maximum likelihood estimators for free-form networks are unacceptably slow, relying on iteration schemes whose cost scales linearly with latent dimension. We introduce an improved estimator which eliminates iteration, resulting in constant cost (roughly double the runtime per batch of a vanilla autoencoder). Secondly, we demonstrate that naively applying maximum likelihood to autoencoders can lead to divergent solutions and use this insight to motivate a stable maximum likelihood training objective. We perform extensive experiments on toy, tabular and image data, demonstrating the competitive performance of the resulting model. We call our model the maximum likelihood autoencoder (MLAE).

READ FULL TEXT

page 2

page 28

page 33

page 34

research
07/12/2022

Sliced-Wasserstein normalizing flows: beyond maximum likelihood training

Despite their advantages, normalizing flows generally suffer from severa...
research
05/19/2022

Closing the gap: Exact maximum likelihood training of generative autoencoders using invertible layers

In this work, we provide an exact likelihood alternative to the variatio...
research
06/14/2022

Energy Flows: Towards Determinant-Free Training of Normalizing Flows

Normalizing flows are a popular approach for constructing probabilistic ...
research
05/24/2018

Implicit Autoencoders

In this paper, we describe the "implicit autoencoder" (IAE), a generativ...
research
12/19/2017

Improving End-to-End Speech Recognition with Policy Learning

Connectionist temporal classification (CTC) is widely used for maximum l...
research
10/19/2010

Maximum Likelihood Mosaics

The majority of the approaches to the automatic recovery of a panoramic ...
research
11/16/2015

How (not) to Train your Generative Model: Scheduled Sampling, Likelihood, Adversary?

Modern applications and progress in deep learning research have created ...

Please sign up or login with your details

Forgot password? Click here to reset