Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion based Classification

08/23/2023
by   Chengguo Yuan, et al.
0

Recognizing target objects using an event-based camera draws more and more attention in recent years. Existing works usually represent the event streams into point-cloud, voxel, image, etc, and learn the feature representations using various deep neural networks. Their final results may be limited by the following factors: monotonous modal expressions and the design of the network structure. To address the aforementioned challenges, this paper proposes a novel dual-stream framework for event representation, extraction, and fusion. This framework simultaneously models two common representations: event images and event voxels. By utilizing Transformer and Structured Graph Neural Network (GNN) architectures, spatial information and three-dimensional stereo information can be learned separately. Additionally, a bottleneck Transformer is introduced to facilitate the fusion of the dual-stream information. Extensive experiments demonstrate that our proposed framework achieves state-of-the-art performance on two widely used event-based classification datasets. The source code of this work is available at: <https://github.com/Event-AHU/EFV_event_classification>

READ FULL TEXT
research
08/08/2023

SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition

Event camera-based pattern recognition is a newly arising research topic...
research
06/08/2023

Point-Voxel Absorbing Graph Representation Learning for Event Stream based Recognition

Sampled point and voxel methods are usually employed to downsample the d...
research
03/07/2023

Event Voxel Set Transformer for Spatiotemporal Representation Learning on Event Streams

Event cameras are neuromorphic vision sensors representing visual inform...
research
06/07/2022

RAAT: Relation-Augmented Attention Transformer for Relation Modeling in Document-Level Event Extraction

In document-level event extraction (DEE) task, event arguments always sc...
research
11/17/2022

HARDVS: Revisiting Human Activity Recognition with Dynamic Vision Sensors

The main streams of human activity recognition (HAR) algorithms are deve...
research
04/07/2023

SGDP: A Stream-Graph Neural Network Based Data Prefetcher

Data prefetching is important for storage system optimization and access...
research
04/06/2023

MemeFier: Dual-stage Modality Fusion for Image Meme Classification

Hate speech is a societal problem that has significantly grown through t...

Please sign up or login with your details

Forgot password? Click here to reset