AutoFocus: Efficient Multi-Scale Inference

12/04/2018
by   Mahyar Najibi, et al.
4

This paper describes AutoFocus, an efficient multi-scale inference algorithm for deep-learning based object detectors. Instead of processing an entire image pyramid, AutoFocus adopts a coarse to fine approach and only processes regions which are likely to contain small objects at finer scales. This is achieved by predicting category agnostic segmentation maps for small objects at coarser scales, called FocusPixels. FocusPixels can be predicted with high recall, and in many cases, they only cover a small fraction of the entire image. To make efficient use of FocusPixels, an algorithm is proposed which generates compact rectangular FocusChips which enclose FocusPixels. The detector is only applied inside FocusChips, which reduces computation while processing finer scales. Different types of error can arise when detections from FocusChips of multiple scales are combined, hence techniques to correct them are proposed. AutoFocus obtains an mAP of 47.9 processing 6.4 images per second on a Titan X (Pascal) GPU. This is 2.5X faster than our multi-scale baseline detector and matches its mAP. The number of pixels processed in the pyramid can be reduced by 5X with a 1 AutoFocus obtains more than 10 same speed with the same ResNet-101 backbone.

READ FULL TEXT

page 3

page 4

page 5

page 8

research
02/10/2021

Scale Normalized Image Pyramids with AutoFocus for Object Detection

We present an efficient foveal framework to perform object detection. A ...
research
05/23/2018

SNIPER: Efficient Multi-Scale Training

We present SNIPER, an algorithm for performing efficient multi-scale tra...
research
02/19/2022

MSSNet: Multi-Scale-Stage Network for Single Image Deblurring

Most of traditional single image deblurring methods before deep learning...
research
07/28/2019

It's All About The Scale -- Efficient Text Detection Using Adaptive Scaling

"Text can appear anywhere". This property requires us to carefully proce...
research
06/15/2019

EXTD: Extremely Tiny Face Detector via Iterative Filter Reuse

In this paper, we propose a new multi-scale face detector having an extr...
research
07/10/2018

Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition

In this paper, we propose a novel Convolutional Neural Network (CNN) arc...
research
01/31/2021

Tone Mapping Based on Multi-scale Histogram Synthesis

In this paper, we present a novel tone mapping algorithm that can be use...

Please sign up or login with your details

Forgot password? Click here to reset