Towards Pragmatic Semantic Image Synthesis for Urban Scenes

05/16/2023
by   George Eskandar, et al.
0

The need for large amounts of training and validation data is a huge concern in scaling AI algorithms for autonomous driving. Semantic Image Synthesis (SIS), or label-to-image translation, promises to address this issue by translating semantic layouts to images, providing a controllable generation of photorealistic data. However, they require a large amount of paired data, incurring extra costs. In this work, we present a new task: given a dataset with synthetic images and labels and a dataset with unlabeled real images, our goal is to learn a model that can generate images with the content of the input mask and the appearance of real images. This new task reframes the well-known unsupervised SIS task in a more practical setting, where we leverage cheaply available synthetic data from a driving simulator to learn how to generate photorealistic images of urban scenes. This stands in contrast to previous works, which assume that labels and images come from the same domain but are unpaired during training. We find that previous unsupervised works underperform on this task, as they do not handle distribution shifts between two different domains. To bypass these problems, we propose a novel framework with two main contributions. First, we leverage the synthetic image as a guide to the content of the generated image by penalizing the difference between their high-level features on a patch level. Second, in contrast to previous works which employ one discriminator that overfits the target domain semantic distribution, we employ a discriminator for the whole image and multiscale discriminators on the image patches. Extensive comparisons on the benchmarks GTA-V → Cityscapes and GTA-V → Mapillary show the superior performance of the proposed model against state-of-the-art on this task.

READ FULL TEXT

page 1

page 2

page 3

page 7

research
06/23/2023

A Semi-Paired Approach For Label-to-Image Translation

Data efficiency, or the ability to generalize from a few labeled data, r...
research
05/16/2023

Wavelet-based Unsupervised Label-to-Image Translation

Semantic Image Synthesis (SIS) is a subclass of image-to-image translati...
research
09/29/2021

USIS: Unsupervised Semantic Image Synthesis

Semantic Image Synthesis (SIS) is a subclass of image-to-image translati...
research
08/19/2023

Controllable Multi-domain Semantic Artwork Synthesis

We present a novel framework for multi-domain synthesis of artwork from ...
research
04/16/2018

DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation

Harvesting dense pixel-level annotations to train deep neural networks f...
research
05/16/2023

Urban-StyleGAN: Learning to Generate and Manipulate Images of Urban Scenes

A promise of Generative Adversarial Networks (GANs) is to provide cheap ...

Please sign up or login with your details

Forgot password? Click here to reset