CLIP the Gap: A Single Domain Generalization Approach for Object Detection

01/13/2023
by   Vidit Vidit, et al.
0

Single Domain Generalization (SDG) tackles the problem of training a model on a single source domain so that it generalizes to any unseen target domain. While this has been well studied for image classification, the literature on SDG object detection remains almost non-existent. To address the challenges of simultaneously learning robust object localization and representation, we propose to leverage a pre-trained vision-language model to introduce semantic domain concepts via textual prompts. We achieve this via a semantic augmentation strategy acting on the features extracted by the detector backbone, as well as a text-based classification loss. Our experiments evidence the benefits of our approach, outperforming by 10 detection method, Single-DGOD [49], on their own diverse weather-driving benchmark.

READ FULL TEXT

page 3

page 4

page 5

research
03/07/2022

An Unsupervised Domain Adaptive Approach for Multimodal 2D Object Detection in Adverse Weather Conditions

Integrating different representations from complementary sensing modalit...
research
05/05/2022

InvNorm: Domain Generalization for Object Detection in Gastrointestinal Endoscopy

Domain Generalization is a challenging topic in computer vision, especia...
research
01/11/2023

Adversarial Alignment for Source Free Object Detection

Source-free object detection (SFOD) aims to transfer a detector pre-trai...
research
03/10/2022

Domain Generalisation for Object Detection

Domain generalisation aims to promote the learning of domain-invariant f...
research
07/04/2023

SRCD: Semantic Reasoning with Compound Domains for Single-Domain Generalized Object Detection

This paper provides a novel framework for single-domain generalized obje...
research
08/18/2022

Prompt Vision Transformer for Domain Generalization

Though vision transformers (ViTs) have exhibited impressive ability for ...
research
12/19/2022

Million-scale Object Detection with Large Vision Model

Over the past few years, developing a broad, universal, and general-purp...

Please sign up or login with your details

Forgot password? Click here to reset