Simple Training Strategies and Model Scaling for Object Detection

06/30/2021
by   Xianzhi Du, et al.
0

The speed-accuracy Pareto curve of object detection systems have advanced through a combination of better model architectures, training and inference methods. In this paper, we methodically evaluate a variety of these techniques to understand where most of the improvements in modern detection systems come from. We benchmark these improvements on the vanilla ResNet-FPN backbone with RetinaNet and RCNN detectors. The vanilla detectors are improved by 7.7 accuracy while being 30 strategies to generate family of models that form two Pareto curves, named RetinaNet-RS and Cascade RCNN-RS. These simple rescaled detectors explore the speed-accuracy trade-off between the one-stage RetinaNet detectors and two-stage RCNN detectors. Our largest Cascade RCNN-RS models achieve 52.9 with a ResNet152-FPN backbone and 53.6 we show the ResNet architecture, with three minor architectural changes, outperforms EfficientNet as the backbone for object detection and instance segmentation systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/09/2019

CBNet: A Novel Composite Backbone Network Architecture for Object Detection

In existing CNN based detectors, the backbone network is a very importan...
research
08/11/2022

Optimizing Anchor-based Detectors for Autonomous Driving Scenes

This paper summarizes model improvements and inference-time optimization...
research
07/26/2023

YOLOBench: Benchmarking Efficient Object Detectors on Embedded Systems

We present YOLOBench, a benchmark comprised of 550+ YOLO-based object de...
research
03/13/2021

Revisiting ResNets: Improved Training and Scaling Strategies

Novel computer vision architectures monopolize the spotlight, but the im...
research
07/09/2018

Pooling Pyramid Network for Object Detection

We'd like to share a simple tweak of Single Shot Multibox Detector (SSD)...
research
04/22/2019

An Energy and GPU-Computation Efficient Backbone Network for Real-Time Object Detection

As DenseNet conserves intermediate features with diverse receptive field...
research
02/05/2019

Revisiting a single-stage method for face detection

Although accurate, two-stage face detectors usually require more inferen...

Please sign up or login with your details

Forgot password? Click here to reset