Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding

06/07/2022
by   Lingchen Meng, et al.
9

Leveraging large-scale data can introduce performance gains on many computer vision tasks. Unfortunately, this does not happen in object detection when training a single model under multiple datasets together. We observe two main obstacles: taxonomy difference and bounding box annotation inconsistency, which introduces domain gaps in different datasets that prevents us from joint training. In this paper, we show that these two challenges can be effectively addressed by simply adapting object queries on language embedding of categories per dataset. We design a detection hub to dynamically adapt queries on category embedding based on the different distributions of datasets. Unlike previous methods attempted to learn a joint embedding for all datasets, our adaptation method can utilize the language embedding as semantic centers for common categories, while learning the semantic bias towards specific categories belonging to different datasets to handle annotation differences and make up the domain gaps. These novel improvements enable us to end-to-end train a single detector on multiple datasets simultaneously to fully take their advantages. Further experiments on joint training on multiple datasets demonstrate the significant performance gains over separate individual fine-tuned detectors.

READ FULL TEXT

page 4

page 9

page 13

research
10/06/2021

Decoupled Adaptation for Cross-Domain Object Detection

Cross-domain object detection is more challenging than object classifica...
research
04/07/2023

Language-aware Multiple Datasets Detection Pretraining for DETRs

Pretraining on large-scale datasets can boost the performance of object ...
research
07/05/2021

Web-Scale Generic Object Detection at Microsoft Bing

In this paper, we present Generic Object Detection (GenOD), one of the l...
research
07/18/2014

LSDA: Large Scale Detection Through Adaptation

A major challenge in scaling object detection is the difficulty of obtai...
research
02/27/2023

LMSeg: Language-guided Multi-dataset Segmentation

It's a meaningful and attractive topic to build a general and inclusive ...
research
03/13/2023

Uni3D: A Unified Baseline for Multi-dataset 3D Object Detection

Current 3D object detection models follow a single dataset-specific trai...
research
02/25/2021

Simple multi-dataset detection

How do we build a general and broad object detection system? We use all ...

Please sign up or login with your details

Forgot password? Click here to reset