Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation

08/03/2020
by   Elad Richardson, et al.
8

We present a generic image-to-image translation framework, Pixel2Style2Pixel (pSp). Our pSp framework is based on a novel encoder network that directly generates a series of style vectors which are fed into a pretrained StyleGAN generator, forming the extended W+ latent space. We first show that our encoder can directly embed real images into W+, with no additional optimization. We further introduce a dedicated identity loss which is shown to achieve improved performance in the reconstruction of an input image. We demonstrate pSp to be a simple architecture that, by leveraging a well-trained, fixed generator network, can be easily applied on a wide-range of image-to-image translation tasks. Solving these tasks through the style representation results in a global approach that does not rely on a local pixel-to-pixel correspondence and further supports multi-modal synthesis via the resampling of styles. Notably, we demonstrate that pSp can be trained to align a face image to a frontal pose without any labeled data, generate multi-modal results for ambiguous tasks such as conditional face generation from segmentation maps, and construct high-resolution images from corresponding low-resolution images.

READ FULL TEXT

page 1

page 6

page 7

page 9

page 10

page 12

page 15

page 16

research
09/26/2021

ISF-GAN: An Implicit Style Function for High-Resolution Image-to-Image Translation

Recently, there has been an increasing interest in image editing methods...
research
04/14/2021

StEP: Style-based Encoder Pre-training for Multi-modal Image Synthesis

We propose a novel approach for multi-modal Image-to-image (I2I) transla...
research
12/05/2020

Spatially-Adaptive Pixelwise Networks for Fast Image Translation

We introduce a new generator architecture, aimed at fast and efficient h...
research
07/10/2020

Impression Space from Deep Template Network

It is an innate ability for humans to imagine something only according t...
research
07/23/2020

TSIT: A Simple and Versatile Framework for Image-to-Image Translation

We introduce a simple and versatile framework for image-to-image transla...
research
05/14/2022

Mask CycleGAN: Unpaired Multi-modal Domain Translation with Interpretable Latent Variable

We propose Mask CycleGAN, a novel architecture for unpaired image domain...
research
06/27/2017

Auto-Encoder Guided GAN for Chinese Calligraphy Synthesis

In this paper, we investigate the Chinese calligraphy synthesis problem:...

Please sign up or login with your details

Forgot password? Click here to reset