Proper Reuse of Image Classification Features Improves Object Detection

04/01/2022
by   Cristina Vasconcelos, et al.
0

A common practice in transfer learning is to initialize the downstream model weights by pre-training on a data-abundant upstream task. In object detection specifically, the feature backbone is typically initialized with Imagenet classifier weights and fine-tuned on the object detection task. Recent works show this is not strictly necessary under longer training regimes and provide recipes for training the backbone from scratch. We investigate the opposite direction of this end-to-end training trend: we show that an extreme form of knowledge preservation – freezing the classifier-initialized backbone – consistently improves many different detection models, and leads to considerable resource savings. We hypothesize and corroborate experimentally that the remaining detector components capacity and structure is a crucial factor in leveraging the frozen backbone. Immediate applications of our findings include performance improvements on hard cases like detection of long-tail object classes and computational and memory resource savings that contribute to making the field more accessible to researchers with access to fewer computational resources.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/21/2023

DEYOv3: DETR with YOLO for Real-time Object Detection

Recently, end-to-end object detectors have gained significant attention ...
research
03/30/2022

Exploring Plain Vision Transformer Backbones for Object Detection

We explore the plain, non-hierarchical Vision Transformer (ViT) as a bac...
research
04/17/2018

DetNet: A Backbone network for Object Detection

Recent CNN based object detectors, no matter one-stage methods like YOLO...
research
02/09/2022

GiraffeDet: A Heavy-Neck Paradigm for Object Detection

In conventional object detection frameworks, a backbone body inherited f...
research
06/14/2022

Efficient Decoder-free Object Detection with Transformers

Vision transformers (ViTs) are changing the landscape of object detectio...
research
03/26/2023

Mind the Backbone: Minimizing Backbone Distortion for Robust Object Detection

Building object detectors that are robust to domain shifts is critical f...
research
10/22/2020

Efficient Scale-Permuted Backbone with Learned Resource Distribution

Recently, SpineNet has demonstrated promising results on object detectio...

Please sign up or login with your details

Forgot password? Click here to reset