Generating Diverse High-Fidelity Images with VQ-VAE-2

06/02/2019
by   Ali Razavi, et al.
4

We explore the use of Vector Quantized Variational AutoEncoder (VQ-VAE) models for large scale image generation. To this end, we scale and enhance the autoregressive priors used in VQ-VAE to generate synthetic samples of much higher coherence and fidelity than possible before. We use simple feed-forward encoder and decoder networks, making our model an attractive candidate for applications where the encoding and/or decoding speed is critical. Additionally, VQ-VAE requires sampling an autoregressive model only in the compressed latent space, which is an order of magnitude faster than sampling in the pixel space, especially for large images. We demonstrate that a multi-scale hierarchical organization of VQ-VAE, augmented with powerful priors over the latent codes, is able to generate samples with quality that rivals that of state of the art Generative Adversarial Networks on multifaceted datasets such as ImageNet, while not suffering from GAN's known shortcomings such as mode collapse and lack of diversity.

READ FULL TEXT

page 1

page 4

page 7

page 8

page 9

research
06/22/2020

generating annotated high-fidelity images containing multiple coherent objects

Recent developments related to generative models have made it possible t...
research
07/20/2020

Incorporating Reinforced Adversarial Learning in Autoregressive Image Generation

Autoregressive models recently achieved comparable results versus state-...
research
02/19/2018

Degeneration in VAE: in the Light of Fisher Information Loss

Variational Autoencoder (VAE) is one of the most popular generative mode...
research
06/22/2020

Hierarchical Patch VAE-GAN: Generating Diverse Videos from a Single Sample

We consider the task of generating diverse and novel videos from a singl...
research
03/23/2023

High Fidelity Image Synthesis With Deep VAEs In Latent Space

We present fast, realistic image generation on high-resolution, multimod...
research
03/06/2019

Hierarchical Autoregressive Image Models with Auxiliary Decoders

Autoregressive generative models of images tend to be biased towards cap...
research
12/04/2018

Generating High Fidelity Images with Subscale Pixel Networks and Multidimensional Upscaling

The unconditional generation of high fidelity images is a longstanding b...

Please sign up or login with your details

Forgot password? Click here to reset