SemAug: Semantically Meaningful Image Augmentations for Object Detection Through Language Grounding

08/15/2022
by   Morgan Heisler, et al.
9

Data augmentation is an essential technique in improving the generalization of deep neural networks. The majority of existing image-domain augmentations either rely on geometric and structural transformations, or apply different kinds of photometric distortions. In this paper, we propose an effective technique for image augmentation by injecting contextually meaningful knowledge into the scenes. Our method of semantically meaningful image augmentation for object detection via language grounding, SemAug, starts by calculating semantically appropriate new objects that can be placed into relevant locations in the image (the what and where problems). Then it embeds these objects into their relevant target locations, thereby promoting diversity of object instance distribution. Our method allows for introducing new object instances and categories that may not even exist in the training set. Furthermore, it does not require the additional overhead of training a context network, so it can be easily added to existing architectures. Our comprehensive set of evaluations showed that the proposed method is very effective in improving the generalization, while the overhead is negligible. In particular, for a wide range of model architectures, our method achieved  2-4 improvements for the task of object detection on the Pascal VOC and COCO datasets, respectively.

READ FULL TEXT

page 23

page 24

page 27

page 28

page 29

page 30

page 31

page 32

research
10/03/2019

ANDA: A Novel Data Augmentation Technique Applied to Salient Object Detection

In this paper, we propose a novel data augmentation technique (ANDA) app...
research
06/24/2015

Deep CNN Ensemble with Data Augmentation for Object Detection

We report on the methods used in our recent DeepEnsembleCoco submission ...
research
09/06/2018

On the Importance of Visual Context for Data Augmentation in Scene Understanding

Performing data augmentation for learning deep neural networks is known ...
research
09/18/2020

IDA: Improved Data Augmentation Applied to Salient Object Detection

In this paper, we present an Improved Data Augmentation (IDA) technique ...
research
07/19/2018

Modeling Visual Context is Key to Augmenting Object Detection Datasets

Performing data augmentation for learning deep neural networks is well k...
research
08/29/2023

Unveiling Camouflage: A Learnable Fourier-based Augmentation for Camouflaged Object Detection and Instance Segmentation

Camouflaged object detection (COD) and camouflaged instance segmentation...
research
12/21/2021

Contrastive Object Detection Using Knowledge Graph Embeddings

Object recognition for the most part has been approached as a one-hot pr...

Please sign up or login with your details

Forgot password? Click here to reset