ReMotENet: Efficient Relevant Motion Event Detection for Large-scale Home Surveillance Videos

01/06/2018
by   Ruichi Yu, et al.
0

This paper addresses the problem of detecting relevant motion caused by objects of interest (e.g., person and vehicles) in large scale home surveillance videos. The traditional method usually consists of two separate steps, i.e., detecting moving objects with background subtraction running on the camera, and filtering out nuisance motion events (e.g., trees, cloud, shadow, rain/snow, flag) with deep learning based object detection and tracking running on cloud. The method is extremely slow and therefore not cost effective, and does not fully leverage the spatial-temporal redundancies with a pre-trained off-the-shelf object detector. To dramatically speedup relevant motion event detection and improve its performance, we propose a novel network for relevant motion event detection, ReMotENet, which is a unified, end-to-end data-driven method using spatial-temporal attention-based 3D ConvNets to jointly model the appearance and motion of objects-of-interest in a video. ReMotENet parses an entire video clip in one forward pass of a neural network to achieve significant speedup. Meanwhile, it exploits the properties of home surveillance videos, e.g., relevant motion is sparse both spatially and temporally, and enhances 3D ConvNets with a spatial-temporal attention model and reference-frame subtraction to encourage the network to focus on the relevant moving objects. Experiments demonstrate that our method can achieve comparable or event better performance than the object detection based method but with three to four orders of magnitude speedup (up to 20k times) on GPU devices. Our network is efficient, compact and light-weight. It can detect relevant motion on a 15s surveillance video clip within 4-8 milliseconds on a GPU and a fraction of second (0.17-0.39) on a CPU with a model size of less than 1MB.

READ FULL TEXT

page 6

page 8

page 10

research
08/11/2016

Automatic detection of moving objects in video surveillance

This work is in the field of video surveillance including motion detecti...
research
12/18/2017

Spatial-Temporal Memory Networks for Video Object Detection

We introduce Spatial-Temporal Memory Networks (STMN) for video object de...
research
05/30/2022

Time3D: End-to-End Joint Monocular 3D Object Detection and Tracking for Autonomous Driving

While separately leveraging monocular 3D object detection and 2D multi-o...
research
11/17/2022

TempNet: Temporal Attention Towards the Detection of Animal Behaviour in Videos

Recent advancements in cabled ocean observatories have increased the qua...
research
11/27/2018

Fast Object Detection in Compressed Video

Object detection in videos has drawn increasing attention recently since...
research
08/29/2019

Great Ape Detection in Challenging Jungle Camera Trap Footage via Attention-Based Spatial and Temporal Feature Blending

We propose the first multi-frame video object detection framework traine...
research
09/03/2019

Online Pedestrian Group Walking Event Detection Using Spectral Analysis of Motion Similarity Graph

A method for online identification of group of moving objects in the vid...

Please sign up or login with your details

Forgot password? Click here to reset