Learning Diverse Image Colorization

12/06/2016
by   Aditya Deshpande, et al.
0

Colorization is an ambiguous problem, with multiple viable colorizations for a single grey-level image. However, previous methods only produce the single most probable colorization. Our goal is to model the diversity intrinsic to the problem of colorization and produce multiple colorizations that display long-scale spatial co-ordination. We learn a low dimensional embedding of color fields using a variational autoencoder (VAE). We construct loss terms for the VAE decoder that avoid blurry outputs and take into account the uneven distribution of pixel colors. Finally, we build a conditional model for the multi-modal distribution between grey-level image and the color field embeddings. Samples from this conditional model result in diverse colorization. We demonstrate that our method obtains better diverse colorizations than a standard conditional variational autoencoder (CVAE) model, as well as a recently proposed conditional generative adversarial network (cGAN).

READ FULL TEXT

page 2

page 6

page 7

page 8

page 10

page 11

page 12

research
07/02/2019

MIDI-Sandwich: Multi-model Multi-task Hierarchical Conditional VAE-GAN networks for Symbolic Single-track Music Generation

Most existing neural network models for music generation explore how to ...
research
02/07/2022

Multi-modal data generation with a deep metric variational autoencoder

We present a deep metric variational autoencoder for multi-modal data ge...
research
05/29/2023

Autoencoding Conditional Neural Processes for Representation Learning

Conditional neural processes (CNPs) are a flexible and efficient family ...
research
12/01/2016

CDVAE: Co-embedding Deep Variational Auto Encoder for Conditional Variational Generation

Problems such as predicting a new shading field (Y) for an image (X) are...
research
10/04/2019

Conditional out-of-sample generation for unpaired data using trVAE

While generative models have shown great success in generating high-dime...
research
08/05/2021

RockGPT: Reconstructing three-dimensional digital rocks from single two-dimensional slice from the perspective of video generation

Random reconstruction of three-dimensional (3D) digital rocks from two-d...
research
09/11/2021

Conditional Generation of Synthetic Geospatial Images from Pixel-level and Feature-level Inputs

Training robust supervised deep learning models for many geospatial appl...

Please sign up or login with your details

Forgot password? Click here to reset