Plug Play Convolutional Regression Tracker for Video Object Detection

03/02/2020
by   Ye Lyu, et al.
0

Video object detection targets to simultaneously localize the bounding boxes of the objects and identify their classes in a given video. One challenge for video object detection is to consistently detect all objects across the whole video. As the appearance of objects may deteriorate in some frames, features or detections from the other frames are commonly used to enhance the prediction. In this paper, we propose a Plug Play scale-adaptive convolutional regression tracker for the video object detection task, which could be easily and compatibly implanted into the current state-of-the-art detection networks. As the tracker reuses the features from the detector, it is a very light-weighted increment to the detection network. The whole network performs at the speed close to a standard object detector. With our new video object detection pipeline design, image object detectors can be easily turned into efficient video object detectors without modifying any parameters. The performance is evaluated on the large-scale ImageNet VID dataset. Our Plug Play design improves mAP score for the image detector by around 5 drop.

READ FULL TEXT

page 1

page 4

page 7

page 8

page 11

page 12

research
12/06/2018

Tube-CNN: Modeling temporal evolution of appearance for object detection in video

Object detection in video is crucial for many applications. Compared to ...
research
09/30/2018

CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video

Detecting objects in a video is a compute-intensive task. In this paper ...
research
11/18/2022

Detect Only What You Specify : Object Detection with Linguistic Target

Object detection is a computer vision task of predicting a set of boundi...
research
01/06/2015

Analysing domain shift factors between videos and images for object detection

Object detection is one of the most important challenges in computer vis...
research
09/06/2019

Geometry-Aware Video Object Detection for Static Cameras

In this paper we propose a geometry-aware model for video object detecti...
research
03/03/2015

Context Forest for efficient object detection with large mixture models

We present Context Forest (ConF), a technique for predicting properties ...
research
06/20/2013

Felzenszwalb-Baum-Welch: Event Detection by Changing Appearance

We propose a method which can detect events in videos by modeling the ch...

Please sign up or login with your details

Forgot password? Click here to reset