Spatiotemporal Filtering for Event-Based Action Recognition

03/17/2019
by   Rohan Ghosh, et al.
0

In this paper, we address the challenging problem of action recognition, using event-based cameras. To recognise most gestural actions, often higher temporal precision is required for sampling visual information. Actions are defined by motion, and therefore, when using event-based cameras it is often unnecessary to re-sample the entire scene. Neuromorphic, event-based cameras have presented an alternative to visual information acquisition by asynchronously time-encoding pixel intensity changes, through temporally precise spikes (10 micro-second resolution), making them well equipped for action recognition. However, other challenges exist, which are intrinsic to event-based imagers, such as higher signal-to-noise ratio, and a spatiotemporally sparse information. One option is to convert event-data into frames, but this could result in significant temporal precision loss. In this work we introduce spatiotemporal filtering in the spike-event domain, as an alternative way of channeling spatiotemporal information through to a convolutional neural network. The filters are local spatiotemporal weight matrices, learned from the spike-event data, in an unsupervised manner. We find that appropriate spatiotemporal filtering significantly improves CNN performance beyond state-of-the-art on the event-based DVS Gesture dataset. On our newly recorded action recognition dataset, our method shows significant improvement when compared with other, standard ways of generating the spatiotemporal filters.

READ FULL TEXT

page 4

page 6

page 8

research
03/07/2023

Event Voxel Set Transformer for Spatiotemporal Representation Learning on Event Streams

Event cameras are neuromorphic vision sensors representing visual inform...
research
12/07/2021

E^2(GO)MOTION: Motion Augmented Event Stream for Egocentric Action Recognition

Event cameras are novel bio-inspired sensors, which asynchronously captu...
research
07/21/2019

Attention Filtering for Multi-person Spatiotemporal Action Detection on Deep Two-Stream CNN Architectures

Action detection and recognition tasks have been the target of much focu...
research
09/28/2020

Event-based Action Recognition Using Timestamp Image Encoding Network

Event camera is an asynchronous, high frequency vision sensor with low p...
research
04/12/2021

Event-based Timestamp Image Encoding Network for Human Action Recognition and Anticipation

Event camera is an asynchronous, high frequency vision sensor with low p...
research
03/16/2019

Spatiotemporal Feature Learning for Event-Based Vision

Unlike conventional frame-based sensors, event-based visual sensors outp...
research
10/18/2020

Temporal Binary Representation for Event-Based Action Recognition

In this paper we present an event aggregation strategy to convert the ou...

Please sign up or login with your details

Forgot password? Click here to reset