Masked Event Modeling: Self-Supervised Pretraining for Event Cameras

12/20/2022
by   Simon Klenk, et al.
0

Event cameras offer the capacity to asynchronously capture brightness changes with low latency, high temporal resolution, and high dynamic range. Deploying deep learning methods for classification or other tasks to these sensors typically requires large labeled datasets. Since the amount of labeled event data is tiny compared to the bulk of labeled RGB imagery, the progress of event-based vision has remained limited. To reduce the dependency on labeled event data, we introduce Masked Event Modeling (MEM), a self-supervised pretraining framework for events. Our method pretrains a neural network on unlabeled events, which can originate from any event camera recording. Subsequently, the pretrained model is finetuned on a downstream task leading to an overall better performance while requiring fewer labels. Our method outperforms the state-of-the-art on N-ImageNet, N-Cars, and N-Caltech101, increasing the object classification accuracy on N-ImageNet by 7.96 demonstrate that Masked Event Modeling is superior to RGB-based pretraining on a real world dataset.

READ FULL TEXT

page 3

page 5

page 13

page 14

research
01/28/2022

3D-FlowNet: Event-based optical flow estimation with 3D representation

Event-based cameras can overpass frame-based cameras limitations for imp...
research
01/09/2023

Self-Supervised Time-to-Event Modeling with Structured Medical Records

Time-to-event models (also known as survival models) are used in medicin...
research
01/05/2023

Event Camera Data Pre-training

This paper proposes a pre-trained neural network for handling event came...
research
03/24/2020

Exploiting Event Cameras by Using a Network Grafting Algorithm

Novel vision sensors such as event cameras provide information that is n...
research
12/03/2019

EventGAN: Leveraging Large Scale Image Datasets for Event Cameras

Event cameras provide a number of benefits over traditional cameras, suc...
research
12/02/2021

N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras

We introduce N-ImageNet, a large-scale dataset targeted for robust, fine...
research
02/02/2023

Energy-Inspired Self-Supervised Pretraining for Vision Models

Motivated by the fact that forward and backward passes of a deep network...

Please sign up or login with your details

Forgot password? Click here to reset