Pooling Pyramid Network for Object Detection

07/09/2018
by   Pengchong Jin, et al.
0

We'd like to share a simple tweak of Single Shot Multibox Detector (SSD) family of detectors, which is effective in reducing model size while maintaining the same quality. We share box predictors across all scales, and replace convolution between scales with max pooling. This has two advantages over vanilla SSD: (1) it avoids score miscalibration across scales; (2) the shared predictor sees the training data over all scales. Since we reduce the number of predictors to one, and trim all convolutions between them, model size is significantly smaller. We empirically show that these changes do not hurt model quality compared to vanilla SSD.

READ FULL TEXT

page 1

page 2

page 3

research
12/04/2017

FSSD: Feature Fusion Single Shot Multibox Detector

SSD (Single Shot Multibox Detetor) is one of the best object detection a...
research
03/22/2018

Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection

Recent years have witnessed many exciting achievements for object detect...
research
06/30/2021

Simple Training Strategies and Model Scaling for Object Detection

The speed-accuracy Pareto curve of object detection systems have advance...
research
06/16/2022

Delving into the Scale Variance Problem in Object Detection

Object detection has made substantial progress in the last decade, due t...
research
07/02/2019

CSSegNet: Fine-Grained Cardiac Structures Segmentation Using Dilated Pyramid Pooling in U-net

Cardiac structure segmentation plays an important role in medical analys...
research
06/29/2023

BinaryViT: Pushing Binary Vision Transformers Towards Convolutional Models

With the increasing popularity and the increasing size of vision transfo...

Please sign up or login with your details

Forgot password? Click here to reset