Cheaper Pre-training Lunch: An Efficient Paradigm for Object Detection

04/25/2020
by   Dongzhan Zhou, et al.
1

In this paper, we propose a general and efficient pre-training paradigm, Jigsaw pre-training, for object detection. Jigsaw pre-training needs only the target detection dataset while taking only 1/4 computational resources compared to the widely adopted ImageNet pre-training. To build such an efficient paradigm, we reduce the potential redundancy by carefully extracting useful samples from the original images, assembling samples in a Jigsaw manner as input, and using an ERF-adaptive dense classification strategy for model pre-training. These designs include not only a new input pattern to improve the spatial utilization but also a novel learning objective to expand the effective receptive field of the pre-trained model. The efficiency and superiority of Jigsaw pre-training are validated by extensive experiments on the MS-COCO dataset, where the results indicate that the models using Jigsaw pre-training are able to achieve on-par or even better detection performances compared with the ImageNet pre-trained counterparts.

READ FULL TEXT

page 6

page 10

research
11/21/2018

Rethinking ImageNet Pre-training

We report competitive results on object detection and instance segmentat...
research
11/23/2018

Revisiting Pre-training: An Efficient Training Method for Image Classification

The training method of repetitively feeding all samples into a pre-defin...
research
06/11/2020

Rethinking Pre-training and Self-training

Pre-training is a dominant paradigm in computer vision. For example, sup...
research
06/08/2022

Delving into the Pre-training Paradigm of Monocular 3D Object Detection

The labels of monocular 3D object detection (M3OD) are expensive to obta...
research
03/24/2022

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

Multiple datasets and open challenges for object detection have been int...
research
11/17/2022

Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information

To effectively exploit the potential of large-scale models, various pre-...
research
05/14/2020

Temperate Fish Detection and Classification: a Deep Learning based Approach

A wide range of applications in marine ecology extensively uses underwat...

Please sign up or login with your details

Forgot password? Click here to reset