VEIL: Vetting Extracted Image Labels from In-the-Wild Captions for Weakly-Supervised Object Detection

03/16/2023
by   Arushi Rai, et al.
0

The use of large-scale vision-language datasets is limited for object detection due to the negative impact of label noise on localization. Prior methods have shown how such large-scale datasets can be used for pretraining, which can provide initial signal for localization, but is insufficient without clean bounding-box data for at least some categories. We propose a technique to "vet" labels extracted from noisy captions. Our method trains a classifier that predicts if an extracted label is actually present in the image or not. Our classifier generalizes across dataset boundaries and shows promise for generalizing across categories as well. We compare the classifier to eleven baselines on five datasets, and demonstrate that it can improve weakly-supervised detection without label vetting by 80 evaluated on PASCAL VOC).

READ FULL TEXT

page 1

page 6

page 7

page 14

page 16

page 17

research
11/20/2020

Open-Vocabulary Object Detection Using Captions

Despite the remarkable accuracy of deep neural networks in object detect...
research
07/31/2020

Weakly supervised one-stage vision and language disease detection using large scale pneumonia and pneumothorax studies

Detecting clinically relevant objects in medical images is a challenge d...
research
07/23/2019

Cap2Det: Learning to Amplify Weak Caption Supervision for Object Detection

Learning to localize and name object instances is a fundamental problem ...
research
07/27/2017

Exploiting Web Images for Weakly Supervised Object Detection

In recent years, the performance of object detection has advanced signif...
research
06/05/2021

GLSD: The Global Large-Scale Ship Database and Baseline Evaluations

In this paper, we introduce a challenging global large-scale ship databa...
research
08/30/2022

Weakly Supervised Faster-RCNN+FPN to classify animals in camera trap images

Camera traps have revolutionized the animal research of many species tha...
research
03/20/2023

Boosting Weakly Supervised Object Detection using Fusion and Priors from Hallucinated Depth

Despite recent attention and exploration of depth for various tasks, it ...

Please sign up or login with your details

Forgot password? Click here to reset