Rethinking CycleGAN: Improving Quality of GANs for Unpaired Image-to-Image Translation

03/28/2023
by   Dmitrii Torbunov, et al.
0

An unpaired image-to-image (I2I) translation technique seeks to find a mapping between two domains of data in a fully unsupervised manner. While the initial solutions to the I2I problem were provided by the generative adversarial neural networks (GANs), currently, diffusion models (DM) hold the state-of-the-art status on the I2I translation benchmarks in terms of FID. Yet, they suffer from some limitations, such as not using data from the source domain during the training, or maintaining consistency of the source and translated images only via simple pixel-wise errors. This work revisits the classic CycleGAN model and equips it with recent advancements in model architectures and model training procedures. The revised model is shown to significantly outperform other advanced GAN- and DM-based competitors on a variety of benchmarks. In the case of Male2Female translation of CelebA, the model achieves over 40 state-of-the-art results. This work also demonstrates the ineffectiveness of the pixel-wise I2I translation faithfulness metrics and suggests their revision. The code and trained models are available at https://github.com/LS4GAN/uvcgan2

READ FULL TEXT

page 3

page 6

page 13

page 16

page 17

page 18

page 19

page 20

research
12/14/2019

Asymmetric Generative Adversarial Networks for Image-to-Image Translation

State-of-the-art models for unpaired image-to-image translation with Gen...
research
03/04/2022

UVCGAN: UNet Vision Transformer cycle-consistent GAN for unpaired image-to-image translation

Image-to-image translation has broad applications in art, design, and sc...
research
04/18/2023

Look ATME: The Discriminator Mean Entropy Needs Attention

Generative adversarial networks (GANs) are successfully used for image s...
research
05/29/2019

Batch weight for domain adaptation with mass shift

Unsupervised domain transfer is the task of transferring or translating ...
research
01/12/2019

SteganoGAN: High Capacity Image Steganography with GANs

Image steganography is a procedure for hiding messages inside pictures. ...
research
06/02/2021

Sound-to-Imagination: Unsupervised Crossmodal Translation Using Deep Dense Network Architecture

The motivation of our research is to develop a sound-to-image (S2I) tran...

Please sign up or login with your details

Forgot password? Click here to reset