Adjusting for Bias with Procedural Data

04/03/2022
by   Shesh Narayan Gupta, et al.
0

3D softwares are now capable of producing highly realistic images that look nearly indistinguishable from the real images. This raises the question: can real datasets be enhanced with 3D rendered data? We investigate this question. In this paper we demonstrate the use of 3D rendered data, procedural, data for the adjustment of bias in image datasets. We perform error analysis of images of animals which shows that the misclassification of some animal breeds is largely a data issue. We then create procedural images of the poorly classified breeds and that model further trained on procedural data can better classify poorly performing breeds on real data. We believe that this approach can be used for the enhancement of visual data for any underrepresented group, including rare diseases, or any data bias potentially improving the accuracy and fairness of models. We find that the resulting representations rival or even out-perform those learned directly from real data, but that good performance requires care in the 3D rendered procedural data generation. 3D image dataset can be viewed as a compressed and organized copy of a real dataset, and we envision a future where more and more procedural data proliferate while datasets become increasingly unwieldy, missing, or private. This paper suggests several techniques for dealing with visual representation learning in such a future.

READ FULL TEXT

page 2

page 3

page 4

page 6

research
06/09/2021

Generative Models as a Data Source for Multiview Representation Learning

Generative models are now capable of producing highly realistic images t...
research
11/07/2019

This dataset does not exist: training models from generated images

Current generative networks are increasingly proficient in generating hi...
research
06/10/2021

Learning to See by Looking at Noise

Current vision systems are trained on huge datasets, and these datasets ...
research
10/14/2021

Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

Visually-grounded spoken language datasets can enable models to learn cr...
research
11/16/2021

Language bias in Visual Question Answering: A Survey and Taxonomy

Visual question answering (VQA) is a challenging task, which has attract...
research
12/05/2022

A Dataless FaceSwap Detection Approach Using Synthetic Images

Face swapping technology used to create "Deepfakes" has advanced signifi...

Please sign up or login with your details

Forgot password? Click here to reset