PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation Learning

08/08/2023
by   Florian Bordes, et al.
0

Synthetic image datasets offer unmatched advantages for designing and evaluating deep neural networks: they make it possible to (i) render as many data samples as needed, (ii) precisely control each scene and yield granular ground truth labels (and captions), (iii) precisely control distribution shifts between training and testing to isolate variables of interest for sound experimentation. Despite such promise, the use of synthetic image data is still limited – and often played down – mainly due to their lack of realism. Most works therefore rely on datasets of real images, which have often been scraped from public images on the internet, and may have issues with regards to privacy, bias, and copyright, while offering little control over how objects precisely appear. In this work, we present a path to democratize the use of photorealistic synthetic data: we develop a new generation of interactive environments for representation learning research, that offer both controllability and realism. We use the Unreal Engine, a powerful game engine well known in the entertainment industry, to produce PUG (Photorealistic Unreal Graphics) environments and datasets for representation learning. In this paper, we demonstrate the potential of PUG to enable more rigorous evaluations of vision models.

READ FULL TEXT

page 2

page 4

page 6

page 8

page 21

page 22

page 24

page 32

research
05/10/2019

Ship classification from overhead imagery using synthetic data and domain adaptation

In this paper, we revisit the problem of classifying ships (maritime ves...
research
11/29/2022

Procedural Image Programs for Representation Learning

Learning image representations using synthetic data allows training neur...
research
04/23/2021

UnrealROX+: An Improved Tool for Acquiring Synthetic Data from Virtual 3D Environments

Synthetic data generation has become essential in last years for feeding...
research
07/16/2018

Unlimited Road-scene Synthetic Annotation (URSA) Dataset

In training deep neural networks for semantic segmentation, the main lim...
research
12/16/2021

CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data

We present a visual localization system that learns to estimate camera p...
research
09/05/2016

UnrealCV: Connecting Computer Vision to Unreal Engine

Computer graphics can not only generate synthetic images and ground trut...
research
03/21/2020

NeuCrowd: Neural Sampling Network for Representation Learning with Crowdsourced Labels

Representation learning approaches require a massive amount of discrimin...

Please sign up or login with your details

Forgot password? Click here to reset