Optimizing Video Object Detection via a Scale-Time Lattice

04/16/2018
by   Kai Chen, et al.
0

High-performance object detection relies on expensive convolutional networks to compute features, often leading to significant challenges in applications, e.g. those that require detecting objects from video streams in real time. The key to this problem is to trade accuracy for efficiency in an effective way, i.e. reducing the computing cost while maintaining competitive performance. To seek a good balance, previous efforts usually focus on optimizing the model architectures. This paper explores an alternative approach, that is, to reallocate the computation over a scale-time space. The basic idea is to perform expensive detection sparsely and propagate the results across both scales and time with substantially cheaper networks, by exploiting the strong correlations among them. Specifically, we present a unified framework that integrates detection, temporal propagation, and across-scale refinement on a Scale-Time Lattice. On this framework, one can explore various strategies to balance performance and cost. Taking advantage of this flexibility, we further develop an adaptive scheme with the detector invoked on demand and thus obtain improved tradeoff. On ImageNet VID dataset, the proposed method can achieve a competitive mAP 79.6 tradeoff.

READ FULL TEXT

page 3

page 7

research
11/30/2017

Towards High Performance Video Object Detection

There has been significant progresses for image object detection in rece...
research
07/22/2022

QueryProp: Object Query Propagation for High-Performance Video Object Detection

Video object detection has been an important yet challenging topic in co...
research
03/01/2018

TSSD: Temporal Single-Shot Object Detection Based on Attention-Aware LSTM

Temporal object detection has attracted significant attention, but most ...
research
01/17/2023

Rethinking Lightweight Salient Object Detection via Network Depth-Width Tradeoff

Existing salient object detection methods often adopt deeper and wider n...
research
03/26/2021

MultiScope: Efficient Video Pre-processing for Exploratory Video Analytics

Performing analytics tasks over large-scale video datasets is increasing...
research
11/30/2012

Viewpoint Invariant Object Detector

Object Detection is the task of identifying the existence of an object c...
research
12/12/2022

Optimizing ship detection efficiency in SAR images

The detection and prevention of illegal fishing is critical to maintaini...

Please sign up or login with your details

Forgot password? Click here to reset