Improving Object Detection with Selective Self-supervised Self-training

07/17/2020
by   Yandong Li, et al.
32

We study how to leverage Web images to augment human-curated object detection datasets. Our approach is two-pronged. On the one hand, we retrieve Web images by image-to-image search, which incurs less domain shift from the curated data than other search methods. The Web images are diverse, supplying a wide variety of object poses, appearances, their interactions with the context, etc. On the other hand, we propose a novel learning method motivated by two parallel lines of work that explore unlabeled data for image classification: self-training and self-supervised learning. They fail to improve object detectors in their vanilla forms due to the domain gap between the Web images and curated datasets. To tackle this challenge, we propose a selective net to rectify the supervision signals in Web images. It not only identifies positive bounding boxes but also creates a safe zone for mining hard negative boxes. We report state-of-the-art results on detecting backpacks and chairs from everyday scenes, along with other challenging object classes.

READ FULL TEXT

page 2

page 3

page 4

page 7

page 12

research
02/16/2021

Instance Localization for Self-supervised Detection Pretraining

Prior research on self-supervised learning has led to considerable progr...
research
08/12/2020

Co-training for On-board Deep Object Detection

Providing ground truth supervision to train visual models has been a bot...
research
11/27/2020

Self-EMD: Self-Supervised Object Detection without ImageNet

In this paper, we propose a novel self-supervised representation learnin...
research
03/22/2020

Exploring Bottom-up and Top-down Cues with Attentive Learning for Webly Supervised Object Detection

Fully supervised object detection has achieved great success in recent y...
research
06/08/2021

DETReg: Unsupervised Pretraining with Region Priors for Object Detection

Unsupervised pretraining has recently proven beneficial for computer vis...
research
03/10/2018

Webly Supervised Learning with Category-level Semantic Information

As tons of photos are being uploaded to public websites (e.g., Flickr, B...
research
05/29/2019

Disentangling Monocular 3D Object Detection

In this paper we propose an approach for monocular 3D object detection f...

Please sign up or login with your details

Forgot password? Click here to reset