Autoencoding beyond pixels using a learned similarity metric

12/31/2015 ∙ by Anders Boesen Lindbo Larsen, et al. ∙ 0

We present an autoencoder that leverages learned representations to better measure similarities in data space. By combining a variational autoencoder with a generative adversarial network we can use learned feature representations in the GAN discriminator as basis for the VAE reconstruction objective. Thereby, we replace element-wise errors with feature-wise errors to better capture the data distribution while offering invariance towards e.g. translation. We apply our method to images of faces and show that it outperforms VAEs with element-wise similarity measures in terms of visual fidelity. Moreover, we show that the method learns an embedding in which high-level abstract visual features (e.g. wearing glasses) can be modified using simple arithmetic.



There are no comments yet.


page 5

page 6

Code Repositories


Variational Autoencoder using a similarity metric learned by a generative adversarial network

view repo


Keras / Tensorflow implementation of Larsen,

view repo
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.