HARDVS: Revisiting Human Activity Recognition with Dynamic Vision Sensors

11/17/2022
by   Xiao Wang, et al.
0

The main streams of human activity recognition (HAR) algorithms are developed based on RGB cameras which are suffered from illumination, fast motion, privacy-preserving, and large energy consumption. Meanwhile, the biologically inspired event cameras attracted great interest due to their unique features, such as high dynamic range, dense temporal but sparse spatial resolution, low latency, low power, etc. As it is a newly arising sensor, even there is no realistic large-scale dataset for HAR. Considering its great practical value, in this paper, we propose a large-scale benchmark dataset to bridge this gap, termed HARDVS, which contains 300 categories and more than 100K event sequences. We evaluate and report the performance of multiple popular HAR algorithms, which provide extensive baselines for future works to compare. More importantly, we propose a novel spatial-temporal feature learning and fusion framework, termed ESTF, for event stream based human activity recognition. It first projects the event streams into spatial and temporal embeddings using StemNet, then, encodes and fuses the dual-view representations using Transformer networks. Finally, the dual features are concatenated and fed into a classification head for activity prediction. Extensive experiments on multiple datasets fully validated the effectiveness of our model. Both the dataset and source code will be released on <https://github.com/Event-AHU/HARDVS>.

READ FULL TEXT
research
08/28/2021

GroupFormer: Group Activity Recognition with Clustered Spatial-Temporal Transformer

Group activity recognition is a crucial yet challenging problem, whose c...
research
11/20/2022

Revisiting Color-Event based Tracking: A Unified Network, Dataset, and Metric

Combining the Color and Event cameras (also called Dynamic Vision Sensor...
research
08/08/2023

SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition

Event camera-based pattern recognition is a newly arising research topic...
research
03/13/2018

Dynamic Vision Sensors for Human Activity Recognition

Unlike conventional cameras which capture video at a fixed frame rate, D...
research
03/21/2023

E-MLB: Multilevel Benchmark for Event-Based Camera Denoising

Event cameras, such as dynamic vision sensors (DVS), are biologically in...
research
05/29/2023

Hierarchical Neural Memory Network for Low Latency Event Processing

This paper proposes a low latency neural network architecture for event-...
research
08/23/2023

Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion based Classification

Recognizing target objects using an event-based camera draws more and mo...

Please sign up or login with your details

Forgot password? Click here to reset