Weakly Supervised Dataset Collection for Robust Person Detection

03/27/2020
by   Munetaka Minoguchi, et al.
27

To construct an algorithm that can provide robust person detection, we present a dataset with over 8 million images that was produced in a weakly supervised manner. Through labor-intensive human annotation, the person detection research community has produced relatively small datasets containing on the order of 100,000 images, such as the EuroCity Persons dataset, which includes 240,000 bounding boxes. Therefore, we have collected 8.7 million images of persons based on a two-step collection process, namely person detection with an existing detector and data refinement for false positive suppression. According to the experimental results, the Weakly Supervised Person Dataset (WSPD) is simple yet effective for person detection pre-training. In the context of pre-trained person detection algorithms, our WSPD pre-trained model has 13.38 and 6.38 trained on the fully supervised ImageNet and EuroCity Persons datasets, respectively, when verified with the Caltech Pedestrian.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 8

research
03/30/2021

DAP: Detection-Aware Pre-training with Weak Supervision

This paper presents a detection-aware pre-training (DAP) approach, which...
research
03/08/2022

Language Matters: A Weakly Supervised Pre-training Approach for Scene Text Detection and Spotting

Recently, Vision-Language Pre-training (VLP) techniques have greatly ben...
research
06/19/2021

Exploring Visual Context for Weakly Supervised Person Search

Person search has recently emerged as a challenging task that jointly ad...
research
09/13/2021

Weakly Supervised Person Search with Region Siamese Networks

Supervised learning is dominant in person search, but it requires elabor...
research
07/14/2020

Tackling the Problem of Limited Data and Annotations in Semantic Segmentation

In this work, the case of semantic segmentation on a small image dataset...
research
10/24/2020

Weakly-supervised VisualBERT: Pre-training without Parallel Images and Captions

Pre-trained contextual vision-and-language (V L) models have brought i...
research
04/08/2019

Weakly Supervised Person Re-identification: Cost-effective Learning with A New Benchmark

Person re-identification (ReID) benefits greatly from the accurate annot...

Please sign up or login with your details

Forgot password? Click here to reset