Unpaired Image-to-Image Translation via Latent Energy Transport

12/01/2020
by   fcq, et al.
8

Image-to-image translation aims to preserve source contents while translating to discriminative target styles between two visual domains. Most works apply adversarial learning in the ambient image space, which could be computationally expensive and challenging to train. In this paper, we propose to deploy an energy-based model (EBM) in the latent space of a pretrained autoencoder for this task. The pretrained autoencoder serves as both a latent code extractor and an image reconstruction worker. Our model is based on the assumption that two domains share the same latent space, where latent representation is implicitly decomposed as a content code and a domain-specific style code. Instead of explicitly extracting the two codes and applying adaptive instance normalization to combine them, our latent EBM can implicitly learn to transport the source style code to the target style code while preserving the content code, which is an advantage over existing image translation methods. This simplified solution also brings us far more efficiency in the one-sided unpaired image translation setting. Qualitative and quantitative comparisons demonstrate superior translation quality and faithfulness for content preservation. To the best of our knowledge, our model is the first to be applicable to 1024×1024-resolution unpaired image translation.

READ FULL TEXT

page 2

page 6

page 7

page 8

page 12

page 13

page 14

page 15

research
10/27/2021

Separating Content and Style for Unsupervised Image-to-Image Translation

Unsupervised image-to-image translation aims to learn the mapping betwee...
research
07/23/2021

Image-to-Image Translation with Low Resolution Conditioning

Most image-to-image translation methods focus on learning mappings acros...
research
08/13/2020

Powers of layers for image-to-image translation

We propose a simple architecture to address unpaired image-to-image tran...
research
06/11/2021

GANs N' Roses: Stable, Controllable, Diverse Image to Image Translation (works for videos too!)

We show how to learn a map that takes a content code, derived from a fac...
research
05/08/2023

Domain Agnostic Image-to-image Translation using Low-Resolution Conditioning

Generally, image-to-image translation (i2i) methods aim at learning mapp...
research
10/11/2021

LSC-GAN: Latent Style Code Modeling for Continuous Image-to-image Translation

Image-to-image (I2I) translation is usually carried out among discrete d...
research
03/20/2020

Unsupervised Latent Space Translation Network

One task that is often discussed in a computer vision is the mapping of ...

Please sign up or login with your details

Forgot password? Click here to reset