Long Term Temporal Context for Per-Camera Object Detection

12/07/2019
by   Sara Beery, et al.
1

In static monitoring cameras, useful contextual information can stretch far beyond the few seconds typical video understanding models might see: subjects may exhibit similar behavior over multiple days, and background objects remain static. However, due to power and storage constraints, sampling frequencies are low, often no faster than one frame per second, and sometimes are irregular due to the use of a motion trigger. In order to perform well in this setting, models must be robust to irregular sampling rates. In this paper we propose an attention-based approach that allows our model to index into a long term memory bank constructed on a per-camera basis and aggregate contextual features from other frames to boost object detection performance on the current frame. We apply our models to two settings: (1) species detection using camera trap data, which is sampled at a low, variable frame rate based on a motion trigger and used to study biodiversity, and (2) vehicle detection in traffic cameras, which have similarly low frame rate. We show that our model leads to performance gains over strong baselines in all settings. Moreover, we show that increasing the time horizon for our memory bank leads to improved results. When applied to camera trap data from the Snapshot Serengeti dataset, our best model which leverages context from up to a month of images outperforms the single-frame baseline by 17.9 baseline) by 11.2

READ FULL TEXT

page 1

page 2

page 4

page 6

page 7

research
06/14/2023

Predict to Detect: Prediction-guided 3D Object Detection using Sequential Images

Recent camera-based 3D object detection methods have introduced sequenti...
research
06/09/2023

DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds

Existing offboard 3D detectors always follow a modular pipeline design t...
research
05/21/2020

Joint Detection and Tracking in Videos with Identification Features

Recent works have shown that combining object detection and tracking tas...
research
10/13/2021

Fast Hand Detection in Collaborative Learning Environments

Long-term object detection requires the integration of frame-based resul...
research
02/10/2023

Virtually increasing the measurement frequency of LIDAR sensor utilizing a single RGB camera

The frame rates of most 3D LIDAR sensors used in intelligent vehicles ar...
research
09/16/2018

CADP: A Novel Dataset for CCTV Traffic Camera based Accident Analysis

This paper presents a novel dataset for traffic accidents analysis. Our ...
research
05/25/2016

Design and Implementation of a Novel Compatible Encoding Scheme in the Time Domain for Image Sensor Communication

This paper presents a modulation scheme in the time domain based on On-O...

Please sign up or login with your details

Forgot password? Click here to reset