Speed/accuracy trade-offs for modern convolutional object detectors

by   Jonathan Huang, et al.

The goal of this paper is to serve as a guide for selecting a detection architecture that achieves the right speed/memory/accuracy balance for a given application and platform. To this end, we investigate various ways to trade accuracy for speed and memory usage in modern convolutional object detection systems. A number of successful systems have been proposed in recent years, but apples-to-apples comparisons are difficult due to different base feature extractors (e.g., VGG, Residual Networks), different default image resolutions, as well as different hardware and software platforms. We present a unified implementation of the Faster R-CNN [Ren et al., 2015], R-FCN [Dai et al., 2016] and SSD [Liu et al., 2015] systems, which we view as "meta-architectures" and trace out the speed/accuracy trade-off curve created by using alternative feature extractors and varying other critical parameters such as image size within each of these meta-architectures. On one extreme end of this spectrum where speed and memory are critical, we present a detector that achieves real time speeds and can be deployed on a mobile device. On the opposite end in which accuracy is critical, we present a detector that achieves state-of-the-art performance measured on the COCO detection task.


page 9

page 11

page 15

page 16

page 17

page 18

page 19

page 20


A Survey of Modern Object Detection Literature using Deep Learning

Object detection is the identification of an object in the image along w...

DynamicDet: A Unified Dynamic Architecture for Object Detection

Dynamic neural network is an emerging research topic in deep learning. W...

MnasNet: Platform-Aware Neural Architecture Search for Mobile

Designing convolutional neural networks (CNN) models for mobile devices ...

Accelerating Deep Learning Applications in Space

Computing at the edge offers intriguing possibilities for the developmen...

Meta-Cognition-Based Simple And Effective Approach To Object Detection

Recently, many researchers have attempted to improve deep learning-based...

To What Extent Does Downsampling, Compression, and Data Scarcity Impact Renal Image Analysis?

The condition of the Glomeruli, or filter sacks, in renal Direct Immunof...

Diagnosing State-Of-The-Art Object Proposal Methods

Object proposal has become a popular paradigm to replace exhaustive slid...

Please sign up or login with your details

Forgot password? Click here to reset