Attribute Descent: Simulating Object-Centric Datasets on the Content Level and Beyond

02/28/2022
by   Yue Yao, et al.
0

This article aims to use graphic engines to simulate a large number of training data that have free annotations and possibly strongly resemble to real-world data. Between synthetic and real, a two-level domain gap exists, involving content level and appearance level. While the latter is concerned with appearance style, the former problem arises from a different mechanism, i.e., content mismatch in attributes such as camera viewpoint, object placement and lighting conditions. In contrast to the widely-studied appearance-level gap, the content-level discrepancy has not been broadly studied. To address the content-level misalignment, we propose an attribute descent approach that automatically optimizes engine attributes to enable synthetic data to approximate real-world data. We verify our method on object-centric tasks, wherein an object takes up a major portion of an image. In these tasks, the search space is relatively small, and the optimization of each attribute yields sufficiently obvious supervision signals. We collect a new synthetic asset VehicleX, and reformat and reuse existing the synthetic assets ObjectX and PersonX. Extensive experiments on image classification and object re-identification confirm that adapted synthetic data can be effectively used in three scenarios: training with synthetic data only, training data augmentation and numerically understanding dataset content.

READ FULL TEXT

page 1

page 3

page 5

page 6

page 7

page 11

page 12

page 15

research
12/18/2019

Simulating Content Consistent Vehicle Datasets with Attribute Descent

We simulate data using a graphic engine to augment real-world datasets, ...
research
06/25/2020

Learning to simulate complex scenes

Data simulation engines like Unity are becoming an increasingly importan...
research
09/22/2021

Less is More: Learning from Synthetic Data with Fine-grained Attributes for Person Re-Identification

Person re-identification (re-ID) plays an important role in applications...
research
12/02/2022

PASTA: Proportional Amplitude Spectrum Training Augmentation for Syn-to-Real Domain Generalization

Synthetic data offers the promise of cheap and bountiful training data f...
research
06/12/2020

Attribute analysis with synthetic dataset for person re-identification

Person re-identification (re-ID) plays an important role in applications...
research
04/01/2020

Objects of violence: synthetic data for practical ML in human rights investigations

We introduce a machine learning workflow to search for, identify, and me...
research
12/13/2019

Joint Viewpoint and Keypoint Estimation with Real and Synthetic Data

The estimation of viewpoints and keypoints effectively enhance object de...

Please sign up or login with your details

Forgot password? Click here to reset