Fast Object Detection in Compressed Video

11/27/2018
by   Shiyao Wang, et al.
0

Object detection in videos has drawn increasing attention recently since it is more important in real scenarios. Most of the deep learning methods for video analysis use convolutional neural networks designed for image-wise parsing in a video stream. But they usually ignore the fact that a video is generally stored and transmitted in a compressed data format. In this paper, we propose a fast object detection model that incorporates light-weight motion-aided memory network (MMNet), which can be directly used for H.264 compressed video. MMNet has two major advantages: 1) For a group of successive pictures (GOP) in a compressed video stream, it runs the heavy computational network for I-frames, i.e. a few reference frames in videos, while a light-weight memory network is designed to generate features for prediction frames called P-frames; 2) Unlike establishing an additional network to explicitly model motion among frames, we directly take full advantage of both motion vectors and residual errors that are all encoded in a compressed video. Such signals maintain spatial variations and are freely available. To our best knowledge, the MMNet is the first work that explores a convolutional detector on a compressed video and a motion-based memory in order to achieve significant speedup. Our model is evaluated on the large-scale ImageNet VID dataset, and the results show that it is about 3x times faster than single image detector R-FCN and 10x times faster than high performance detectors like FGFA and MANet.

READ FULL TEXT

page 3

page 6

page 7

research
04/16/2018

Towards High Performance Video Object Detection for Mobiles

Despite the recent success of video object detection on Desktop GPUs, it...
research
04/21/2000

Assisted Video Sequences Indexing : Motion Analysis Based on Interest Points

This work deals with content-based video indexing. Our viewpoint is semi...
research
07/07/2020

Single Shot Video Object Detector

Single shot detectors that are potentially faster and simpler than two-s...
research
04/16/2019

Fast object detection in compressed JPEG Images

Object detection in still images has drawn a lot of attention over past ...
research
09/29/2022

Speeding Up Action Recognition Using Dynamic Accumulation of Residuals in Compressed Domain

With the widespread use of installed cameras, video-based monitoring app...
research
01/06/2018

ReMotENet: Efficient Relevant Motion Event Detection for Large-scale Home Surveillance Videos

This paper addresses the problem of detecting relevant motion caused by ...
research
03/30/2020

TapLab: A Fast Framework for Semantic Video Segmentation Tapping into Compressed-Domain Knowledge

Real-time semantic video segmentation is a challenging task due to the s...

Please sign up or login with your details

Forgot password? Click here to reset